Jobs
>
Dallas

    Senior Platform Reliability Engineer - Dallas, United States - G-Research

    Default job background
    Description

    Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?

    G-Research is a leading quantitative research and technology firm, with offices in London and Dallas. We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.

    This is a hybrid role based in our new Dallas infrastructure hub where we work on the latest technologies in a cutting-edge environment.

    The role

    The Reliability Engineering team, part of our Platforms as a Service (PaaS) function, works with a variety of technologies, including multiple Kubernetes clusters, multiple database technologies, low latency networks and big data warehouse across multiple regions around the globe.

    We are actively seeking an experienced Site Reliability Engineer (SRE) with a proven track record in building up and bootstrapping SRE functions across multiple teams.

    We want an individual who excels in ensuring the robustness, scalability, and fault tolerance of large-scale infrastructure. The ideal candidate will have a comprehensive understanding of the intricacies involved in architecting, deploying, and maintaining high-performance solutions, coupled with a track record of implementing and enhancing reliability measures across all infrastructure ecosystems.

    This role demands hands-on experience in orchestrating resilient systems, fine-tuning performance, and implementing proactive strategies to mitigate potential downtimes or disruptions. The successful candidate will play a pivotal role in driving the reliability, efficiency, and scalability of infrastructure platform through innovative solutions and best-in-class practices.

    In return, you will gain exposure to the latest hardware and software technologies in a forward-thinking company, which values innovation, personal development and training.

    Key responsibilities of the role include:

    • Leading efforts to enhance existing practices across teams, fostering collaboration and synchronization to optimize system reliability and scalability
    • Driving strategies for enhancing systems performance, leveraging innovative approaches to improve efficiency and streamline processes
    • Implementing best practices for system reliability, fault tolerance, and scalability, ensuring alignment with evolving industry standards
    • Cultivating a culture of continuous improvement, encouraging regular reviews and iterative enhancements to tools, methodologies, and processes
    • Enhancing incident response processes by conducting comprehensive reviews, implementing improvements, and integrating learned lessons into future strategies
    • Leading efforts to optimize capacity planning strategies, ensuring systems are prepared for future scaling while maximizing resource utilization
    • Collaborating with security teams to fortify and enhance security measures within systems, ensuring compliance with evolving policies and standards
    • Collaborating effectively with other SRE's within PaaS, and colleagues in different time zones.(Dallas and London)
    Who are we looking for?

    The successful candidate will be an experienced Platforms Reliability Engineer who is enthusiastic about contributing to an automated, scalable, reliable and high-performing Infrastructure and Platform as a Service:
    • A strong desire to continually learn about new technologies, approaches, and systems, along with the agility to work across multiple teams
    • A strong communicator with excellent written communications to technical and non-technical audiences
    • A self-starter with excellent problem-solving skills
    • Proficient in Go or other programming language such as Python, Rust or Java for automation and development tasks
    • Extensive Linux, Networking and Infrastructure knowledge
    • Experience with CI/CD (preferably Jenkins and ArgoCD) and Configuration Management tools, such as Ansible and Terraform
    • Experience deploying and running applications on Docker and Kubernetes, including the creation of Helm charts
    • Familiarity with monitoring tools like Prometheus, Grafana, Open Telemetry and the ELK stack (Elasticsearch, Logstash, Kibana), or similar
    • Understanding of core SRE concepts and their implementation in platform engineering
    Beneficial experience would include:
    • Experience building and bootstrapping an SRE organization across multiple teams
    • Experience working on large-scale infrastructure to improve performance, stability and efficiency
    Why should you apply?
    • Market-leading compensation plus annual discretionary bonus
    • Informal dress code and excellent work/life balance
    • Excellent paid time off allowance of 25 days
    • Sick days, military leave, and family and medical leave
    • Generous 401(k) plan
    • 16-weeks' fully paid parental leave
    • Medical and Prescription, Dental, and Vision insurance
    • Life and Accidental Death & Dismemberment (AD&D) insurance
    • Employee Assistance and Wellness programs
    • Generous relocation allowance and support
    • Great selection of office snacks, and hot and cold drinks
    • On-site gym and car parking
    G-Research is committed to cultivating and preserving an inclusive work environment. We are an ideas-driven business and we place great value on diversity of experience and opinions.

    We want to ensure that applicants receive a recruitment experience that enables them to perform at their best. If you have a disability or special need that requires accommodation please let us know in the relevant section


  • Epsilon Grand Prairie, United States

    Sonova · Radolfzell am Bodensee, BW 78315 · posted 04/27/2024 · More... · front runner · Buyer/Planner Fertigungsdisponent:in (d/w/m) · Danaher · Bodman-Ludwigshafen, Baden-Württemberg 78351 · posted 04/27/2024 · More... · front runner · Manager: in Software Team (d/f/m) ...


  • Mass Staffing Projects Dallas, United States

    Job Description · Looking for something reliable? One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State. · Requirements Include: · Senior Certificate · Degree in Mechanical / Electrical Engineering · GCC ...


  • Mass Staffing Projects Dallas, United States

    Job Description Looking for something reliable?One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State.Requirements Include: · •Senior Certificate · •Degree in Mechanical / Electrical Engineering · •GCC Mines & ...


  • AllSTEM Connections Plano, United States

    SITE RELIABILTY ENGINEER · ON W2 · PLANO,TX/HOUSTON,TX/DELAWARE · HYBRID REPORTING: 3DAYS ONSITE · SKILLSET NEEDED: · AWS · BIG DATA · SPARK · PYTHON · SCRIPTING · SHELL · PERL · CONTROL-M · AUTOSYS · GRAFANA · ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability En ...


  • WestRock Dallas, United States

    The Electrical Reliability Engineer is considered the local expert on the equipment within their area of assignment and must either have demonstrated capability unique to the equipment or be capable of quickly assimilating new information to become e Reliability Engineer, Electri ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliabilit ...


  • Avature Dallas, United States

    WestRock – · Electrical Reliability Engineer, Dallas, TX · WestRock is a company where each of us genuinely belongs, is respected and valued, can do our best work, and where diversity, inclusion and equity are competitive advantages · The Opportunity: · The Electrical Reliabilit ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability Eng ...


  • Highbrow Dallas, United States

    System Reliability Engineer (SRE) 2 · —> 5 to 7 years experience · Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas · Job Description: · We are seeking an experienced System Reliability Engineer (SRE) 2 to join our team. The ideal candidate will have 5 to 7 years of ...


  • Diverse Lynx Dallas, United States

    Job Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · JOB DESCRIPTION: · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · •Developing, automation and imple ...


  • Avature Dallas, United States

    WestRock (NYSE :WRK) is a global leader in sustainable paper and packaging solutions. We are materials scientists, packaging designers, mechanical engineers and manufacturing experts with a shared purpose: Innovate Boldly. Package Sustainably. Guided by our values of integrity, r ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globall ...


  • Suncaptech Dallas, United States

    Role: Site Reliability Engineer · Location: Dallas, TX (Onsite) · 2 Positions available · Implement SRE practices · Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate · Work ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globally ...


  • Juniper Networks Dallas, United States

    Job Description · Site Reliability Engineer 4 · Location: Anywhere in the United States · Juniper is changing what's possible in networking. We're going beyond building the networks customers expect - we're building the networks customers deserve. And the world is taking note. B ...


  • PMG, Inc. Dallas, United States

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to del ...


  • Goldman Sachs Dallas, United States

    MORE ABOUT THIS JOB: · Who we are: · At Goldman Sachs, our culture is one of teamwork, innovation and meritocracy. We often say our people are our greatest asset and we take pride in supporting each colleague both professionally and personally. From collaborative work spaces an ...


  • MegaMex Foods LLC Dallas, United States

    Job Description · Job DescriptionMaintenance Reliability & Projects Engineer will execute continuous improvements related to reliability and capital excellence efforts resulting in increased uptime and running rates. They are also responsible for the development, strategy, and le ...


  • Priceline Long Distance LLC Dallas, United States

    This role is eligible for our hybrid work model: Two days in-office. · Site Reliability Engineer (SRE) · Our Technology team is the backbone of our company: constantly creating, testing, learning and iterating to better meet the needs of our customers. If you thrive in a fast-p ...