Jobs
>
Dallas

    Senior Platform Reliability Engineer - Dallas, United States - G-Research

    Default job background
    Description
    Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?

    G-Research is a leading quantitative research and technology firm, with offices in London and Dallas.

    We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.


    This is a hybrid role based in our new Dallas infrastructure hub where we work on the latest technologies in a cutting-edge environment.

    The role


    The Reliability Engineering team, part of our Platforms as a Service (PaaS) function, works with a variety of technologies, including multiple Kubernetes clusters, multiple database technologies, low latency networks and big data warehouse across multiple regions around the globe.


    We are actively seeking an experienced Site Reliability Engineer (SRE) with a proven track record in building up and bootstrapping SRE functions across multiple teams.

    We want an individual who excels in ensuring the robustness, scalability, and fault tolerance of large-scale infrastructure.

    The ideal candidate will have a comprehensive understanding of the intricacies involved in architecting, deploying, and maintaining high-performance solutions, coupled with a track record of implementing and enhancing reliability measures across all infrastructure ecosystems.


    This role demands hands-on experience in orchestrating resilient systems, fine-tuning performance, and implementing proactive strategies to mitigate potential downtimes or disruptions.

    The successful candidate will play a pivotal role in driving the reliability, efficiency, and scalability of infrastructure platform through innovative solutions and best-in-class practices.


    In return, you will gain exposure to the latest hardware and software technologies in a forward-thinking company, which values innovation, personal development and training.

    Key responsibilities of the role include:

    Leading efforts to enhance existing practices across teams, fostering collaboration and synchronization to optimize system reliability and scalability
    Driving strategies for enhancing systems performance, leveraging innovative approaches to improve efficiency and streamline processes
    Implementing best practices for system reliability, fault tolerance, and scalability, ensuring alignment with evolving industry standards
    Cultivating a culture of continuous improvement, encouraging regular reviews and iterative enhancements to tools, methodologies, and processes
    Enhancing incident response processes by conducting comprehensive reviews, implementing improvements, and integrating learned lessons into future strategies
    Leading efforts to optimize capacity planning strategies, ensuring systems are prepared for future scaling while maximizing resource utilization
    Collaborating with security teams to fortify and enhance security measures within systems, ensuring compliance with evolving policies and standards
    Collaborating effectively with other SREs within PaaS, and colleagues in different time zones.(Dallas and London)

    Who are we looking for?

    The successful candidate will be an experienced Platforms Reliability Engineer who is enthusiastic about contributing to an automated, scalable, reliable and high-performing Infrastructure and Platform as a Service:

    A strong desire to continually learn about new technologies, approaches, and systems, along with the agility to work across multiple teams
    A strong communicator with excellent written communications to technical and non-technical audiences
    A self-starter with excellent problem-solving skills
    Proficient in Go or other programming language such as Python, Rust or Java for automation and development tasks
    Extensive Linux, Networking and Infrastructure knowledge
    Experience with CI/CD (preferably Jenkins and ArgoCD) and Configuration Management tools, such as Ansible and Terraform
    Experience deploying and running applications on Docker and Kubernetes, including the creation of Helm charts
    Familiarity with monitoring tools like Prometheus, Grafana, Open Telemetry and the ELK stack (Elasticsearch, Logstash, Kibana), or similar
    Understanding of core SRE concepts and their implementation in platform engineering


    Beneficial experience would include:
    Experience building and bootstrapping an SRE organization across multiple teams
    Experience working on large-scale infrastructure to improve performance, stability and efficiency

    Why should you apply?

    Market-leading compensation plus annual discretionary bonus
    Informal dress code and excellent work/life balance
    Excellent paid time off allowance of 25 days
    Sick days, military leave, and family and medical leave
    Generous 401(k) plan
    16-weeks fully paid parental leave
    Medical and Prescription, Dental, and Vision insurance
    Life and Accidental Death & Dismemberment (AD&D) insurance
    Employee Assistance and Wellness programs
    Generous relocation allowance and support
    Great selection of office snacks, and hot and cold drinks
    On-site gym and car parking


  • Epsilon Grand Prairie, United States

    Sonova · Radolfzell am Bodensee, BW 78315 · posted 04/27/2024 · More... · front runner · Buyer/Planner Fertigungsdisponent:in (d/w/m) · Danaher · Bodman-Ludwigshafen, Baden-Württemberg 78351 · posted 04/27/2024 · More... · front runner · Manager: in Software Team (d/f/m) ...

  • Mass Staffing Projects

    Reliability Engineer

    3 weeks ago


    Mass Staffing Projects Dallas, United States

    Job Description · Looking for something reliable? One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State. · Requirements Include: · Senior Certificate · Degree in Mechanical / Electrical Engineering · GCC ...

  • Mass Staffing Projects

    Reliability Engineer

    2 weeks ago


    Mass Staffing Projects Dallas, United States

    Job Description Looking for something reliable?One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State.Requirements Include: · •Senior Certificate · •Degree in Mechanical / Electrical Engineering · •GCC Mines & ...


  • AllSTEM Connections Plano, United States

    SITE RELIABILTY ENGINEER · ON W2 · PLANO,TX/HOUSTON,TX/DELAWARE · HYBRID REPORTING: 3DAYS ONSITE · SKILLSET NEEDED: · AWS · BIG DATA · SPARK · PYTHON · SCRIPTING · SHELL · PERL · CONTROL-M · AUTOSYS · GRAFANA · ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability En ...


  • RTX Dallas, United States

    Date Posted: · :00 · Country: · United States of America · Location: · TX360: Dallas - North Bldg 13510 North Central Expressway North Building, Dallas, TX, 75243 USA · The Whole Life Engineering (WLE) Department is made up of several disciplines whose main objective is to ...


  • Saicon Consultants Dallas, United States

    Site Reliability Engineer (Buffer) · Location:Dallas, TX · Posted On: 11/08/2023 · Requirement Code: 66074 · Requirement Detail · Job Description: Site Reliability Engineer (Buffer) · • Bachelor's Degree in Computer Science or related; or equivalent combination of education and ...


  • Diverse Lynx Dallas, United States

    Job Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · Job Description · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · Developing, automation and implemen ...


  • Diverse Lynx Dallas, United States

    Job Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · JOB DESCRIPTION: · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · •Developing, automation and imple ...


  • Oldcastle BuildingEnvelope Dallas, United States

    Reliability Engineer, Remote · Who We Are · At OBE, together, we build excellence every day.We are driven by our passion to lead our industry and build a sustainable future, we focus on exceeding customer expectations and delivering innovative solutions. We succeed through the ...


  • Suncaptech Dallas, United States

    Role: Site Reliability Engineer · Location: Dallas, TX (Onsite) · 2 Positions available · Implement SRE practices · Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate · Work ...


  • K-Tek Resourcing LLC Dallas, United States

    Job Overview: · We are looking for a motivated Junior Operations Engineer to ensure the smooth operation of our software and systems. This role combines technical expertise with problem-solving skills to automate operational processes, enhance system functionality, and maintain t ...


  • WestRock Dallas, United States

    The Electrical Reliability Engineer is considered the local expert on the equipment within their area of assignment and must either have demonstrated capability unique to the equipment or be capable of quickly assimilating new information to become e Reliability Engineer, Electri ...


  • PMG, Inc. Dallas, United States

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to del ...


  • Juniper Networks Dallas, United States

    Job Description · Site Reliability Engineer 4 · Location: Anywhere in the United States · Juniper is changing what's possible in networking. We're going beyond building the networks customers expect - we're building the networks customers deserve. And the world is taking note. B ...


  • KTek Resourcing Dallas, United States

    Job Overview: · We are looking for a motivated Junior Operations Engineer to ensure the smooth operation of our software and systems. This role combines technical expertise with problem-solving skills to automate operational processes, enhance system functionality, and maintain t ...


  • Saxon Global Dallas, United States

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and ...


  • Avature Dallas, United States

    WestRock – · Electrical Reliability Engineer, Dallas, TX · WestRock is a company where each of us genuinely belongs, is respected and valued, can do our best work, and where diversity, inclusion and equity are competitive advantages · The Opportunity: · The Electrical Reliabilit ...


  • RTX Dallas, United States

    Date Posted: · :00 · Country: · United States of America · Location: · TX360: Dallas - North Bldg 13510 North Central Expressway North Building, Dallas, TX, 75243 USA · The Whole Life Engineering (WLE) Department is made up of several disciplines whose main objective is to influe ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliabilit ...