Jobs
>
Dallas

    Senior Platform Reliability Engineer - Dallas, United States - G-Research

    Default job background
    Technology / Internet
    Description
    Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?

    G-Research is a leading quantitative research and technology firm, with offices in London and Dallas.

    We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.


    This is a hybrid role based in our new Dallas infrastructure hub where we work on the latest technologies in a cutting-edge environment.

    The role


    The Reliability Engineering team, part of our Platforms as a Service (PaaS) function, works with a variety of technologies, including multiple Kubernetes clusters, multiple database technologies, low latency networks and big data warehouse across multiple regions around the globe.


    We are actively seeking an experienced Site Reliability Engineer (SRE) with a proven track record in building up and bootstrapping SRE functions across multiple teams.

    We want an individual who excels in ensuring the robustness, scalability, and fault tolerance of large-scale infrastructure.

    The ideal candidate will have a comprehensive understanding of the intricacies involved in architecting, deploying, and maintaining high-performance solutions, coupled with a track record of implementing and enhancing reliability measures across all infrastructure ecosystems.


    This role demands hands-on experience in orchestrating resilient systems, fine-tuning performance, and implementing proactive strategies to mitigate potential downtimes or disruptions.

    The successful candidate will play a pivotal role in driving the reliability, efficiency, and scalability of infrastructure platform through innovative solutions and best-in-class practices.


    In return, you will gain exposure to the latest hardware and software technologies in a forward-thinking company, which values innovation, personal development and training.


    Key responsibilities of the role include:
    Leading efforts to enhance existing practices across teams, fostering collaboration and synchronization to optimize system reliability and scalability
    Driving strategies for enhancing systems performance, leveraging innovative approaches to improve efficiency and streamline processes
    Implementing best practices for system reliability, fault tolerance, and scalability, ensuring alignment with evolving industry standards
    Cultivating a culture of continuous improvement, encouraging regular reviews and iterative enhancements to tools, methodologies, and processes
    Enhancing incident response processes by conducting comprehensive reviews, implementing improvements, and integrating learned lessons into future strategies
    Leading efforts to optimize capacity planning strategies, ensuring systems are prepared for future scaling while maximizing resource utilization
    Collaborating with security teams to fortify and enhance security measures within systems, ensuring compliance with evolving policies and standards
    Collaborating effectively with other SRE's within PaaS, and colleagues in different time zones.(Dallas and London)

    Who are we looking for?

    The successful candidate will be an experienced Platforms Reliability Engineer who is enthusiastic about contributing to an automated, scalable, reliable and high-performing Infrastructure and Platform as a Service:

    A strong desire to continually learn about new technologies, approaches, and systems, along with the agility to work across multiple teams
    A strong communicator with excellent written communications to technical and non-technical audiences
    A self-starter with excellent problem-solving skills
    Proficient in Go or other programming language such as Python, Rust or Java for automation and development tasks
    Extensive Linux, Networking and Infrastructure knowledge
    Experience with CI/CD (preferably Jenkins and ArgoCD) and Configuration Management tools, such as Ansible and Terraform
    Experience deploying and running applications on Docker and Kubernetes, including the creation of Helm charts
    Familiarity with monitoring tools like Prometheus, Grafana, Open Telemetry and the ELK stack (Elasticsearch, Logstash, Kibana), or similar
    Understanding of core SRE concepts and their implementation in platform engineering


    Beneficial experience would include:
    Experience building and bootstrapping an SRE organization across multiple teams
    Experience working on large-scale infrastructure to improve performance, stability and efficiency

    Why should you apply?

    Market-leading compensation plus annual discretionary bonus
    Informal dress code and excellent work/life balance
    Excellent paid time off allowance of 25 days
    Sick days, military leave, and family and medical leave
    Generous 401(k) plan
    16-weeks' fully paid parental leave
    Medical and Prescription, Dental, and Vision insurance
    Life and Accidental Death & Dismemberment (AD&D) insurance
    Employee Assistance and Wellness programs
    Generous relocation allowance and support
    Great selection of office snacks, and hot and cold drinks
    On-site gym and car parking% % %%techsoftware%%


  • Epsilon Grand Prairie, United States

    Sonova · Radolfzell am Bodensee, BW 78315 · posted 04/27/2024 · More... · front runner · Buyer/Planner Fertigungsdisponent:in (d/w/m) · Danaher · Bodman-Ludwigshafen, Baden-Württemberg 78351 · posted 04/27/2024 · More... · front runner · Manager: in Software Team (d/f/m) ...


  • Mass Staffing Projects Dallas, United States

    Job Description Looking for something reliable?One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State.Requirements Include: · •Senior Certificate · •Degree in Mechanical / Electrical Engineering · •GCC Mines & ...


  • Mass Staffing Projects Dallas, United States

    Job Description · Looking for something reliable? One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State. · Requirements Include: · Senior Certificate · Degree in Mechanical / Electrical Engineering · GCC ...


  • AllSTEM Connections Plano, United States

    SITE RELIABILTY ENGINEER · ON W2 · PLANO,TX/HOUSTON,TX/DELAWARE · HYBRID REPORTING: 3DAYS ONSITE · SKILLSET NEEDED: · AWS · BIG DATA · SPARK · PYTHON · SCRIPTING · SHELL · PERL · CONTROL-M · AUTOSYS · GRAFANA · ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability En ...


  • WestRock Dallas, United States

    The Electrical Reliability Engineer is considered the local expert on the equipment within their area of assignment and must either have demonstrated capability unique to the equipment or be capable of quickly assimilating new information to become e Reliability Engineer, Electri ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliabilit ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability Eng ...


  • Highbrow Dallas, United States

    System Reliability Engineer (SRE) 2 · —> 5 to 7 years experience · Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas · Job Description: · We are seeking an experienced System Reliability Engineer (SRE) 2 to join our team. The ideal candidate will have 5 to 7 years of ...

  • Juniper Networks

    Reliability Engineer 4

    17 hours ago


    Juniper Networks Dallas, United States

    Job Description · Site Reliability Engineer 4 · Location: Anywhere in the United States · Juniper is changing what's possible in networking. We're going beyond building the networks customers expect - we're building the networks customers deserve. And the world is taking note. B ...


  • Diverse Lynx Dallas, United States

    Job Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · JOB DESCRIPTION: · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · •Developing, automation and imple ...


  • Goldman Sachs Dallas, United States

    MORE ABOUT THIS JOB: · Who we are: · At Goldman Sachs, our culture is one of teamwork, innovation and meritocracy. We often say our people are our greatest asset and we take pride in supporting each colleague both professionally and personally. From collaborative work spaces an ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globally ...


  • Avature Dallas, United States

    WestRock (NYSE :WRK) is a global leader in sustainable paper and packaging solutions. We are materials scientists, packaging designers, mechanical engineers and manufacturing experts with a shared purpose: Innovate Boldly. Package Sustainably. Guided by our values of integrity, r ...


  • Suncaptech Dallas, United States

    Role: Site Reliability Engineer · Location: Dallas, TX (Onsite) · 2 Positions available · Implement SRE practices · Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate · Work ...


  • Avature Dallas, United States

    WestRock – · Electrical Reliability Engineer, Dallas, TX · WestRock is a company where each of us genuinely belongs, is respected and valued, can do our best work, and where diversity, inclusion and equity are competitive advantages · The Opportunity: · The Electrical Reliabilit ...


  • PMG, Inc. Dallas, United States

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to del ...


  • PMG Dallas, United States

    Job Description · Job DescriptionPMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the ...


  • Copart Dallas, United States

    Posted Wednesday, January 10, 2024 at 12:00 AM · Copart is seeking a Site Reliability Engineer for our Dallas HQ office specializing in Systems and application monitoring and troubleshooting. This position will be part of a 24/7 Global Network Operations team that monitors and pr ...


  • Priceline Long Distance LLC Dallas, United States

    This role is eligible for our hybrid work model: Two days in-office. · Site Reliability Engineer (SRE) · Our Technology team is the backbone of our company: constantly creating, testing, learning and iterating to better meet the needs of our customers. If you thrive in a fast-p ...