Site Reliability Engineer, ML Compute SRE - Durham

Only for registered members Durham, United States

2 days ago

Default job background
Full time $141,000 - $202,000 (USD)
Minimum qualifications: · Bachelor's degree in Computer Science, a related field, or equivalent practical experience. · 2 years of experience with software development in one or more programming languages (e.g., Golang). · Experience with debugging and troubleshooting software is ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Site Reliability Engineer, ML Compute SRE

    Only for registered members

    Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. · Experience driving progress, solving problems, and mentoring more junior team members; · Bachelor's degree in Computer Scien ...

    Durham $141,000 - $202,000 (USD)

    21 hours ago

  • Work in company

    Site Reliability Engineer, ML Compute SRE

    Only for registered members

    Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, · massively distributed, fault-tolerant systems. · ...

    Durham, NC

    1 day ago

  • Work in company

    Site Reliability Engineer, ML Compute SRE

    Only for registered members

    +Job summary · Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. · +Design/develop new features to help us support ML operations across Technical Infrastructure (TI) and Google ...

    Raleigh $141,000 - $202,000 (USD)

    2 days ago

  • Work in company

    Senior Software Engineer, Storage, Site Reliability Engineering

    Only for registered members

    Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. · Design, plan, and execute on software engineering projects that help products operate efficiently and reliably inside of Goo ...

    Durham $166,000 - $244,000 (USD) Full time

    1 month ago

  • Work in company

    Senior Software Engineer, Storage, Site Reliability Engineering

    Only for registered members

    Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. · ...

    Durham $166,000 - $244,000 (USD)

    1 month ago

  • Work in company

    Senior Software Engineer, Storage, Site Reliability Engineering

    Only for registered members

    Durham, NC, USA; Raleigh, NC, USA. · Design software engineering projects that help products operate efficiently and reliably inside of Google's data center. · ...

    Durham, NC

    1 month ago

  • Work in company

    Director, Site Reliability Engineering

    Only for registered members

    We are looking for a Systems Thinking, SRE Leader who has helped teams scale through production insights, operational automation, developer guidance, real-time metrics, · automation, automation, automation. · Bachelor's degree or higher in a technology related field required. · 1 ...

    Durham, NC

    3 weeks ago

  • Work in company

    Director, Site Reliability Engineering

    Only for registered members

    This position will be managing & leading SREs & Production Support Engineers. The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. · ...

    Durham Full time

    4 weeks ago

  • Work in company

    Manager, OCI AI Data Platform

    Only for registered members

    +Job summary · Toshiba Global Commerce Solutions is seeking a Manager to serve as the platform owner for TGCS's enterprise data lakehouse and AI/analytics environment on Oracle Cloud Infrastructure. · +ResponsibilitiesDefine and own the AIDP platform roadmap service catalog archi ...

    Durham Full time

    1 week ago

  • Work in company

    Systems Support Engineer

    Only for registered members

    The System Support Engineer will support application support, cloud enablement and provide critical production support.You will have the opportunity to demonstrate all your skills in engineering modernizing and testing applications. · ...

    Durham

    2 weeks ago

  • Work in company

    Manager, OCI AI Data Platform

    Only for registered members

    Toshiba Global Commerce Solutions seeks a Manager for its AI Data Platform (AIDP) on Oracle Cloud Infrastructure. This role oversees the platform's strategy, governance, operations and cost management to ensure secure and scalable data services supporting analytics and enterprise ...

    Durham, NC

    1 week ago

  • Work in company

    Site Reliability

    Only for registered members

    Aspida is seeking a Site Reliability & Infrastructure Automation Engineer to support and modernize our environment across both traditional infrastructure and modern cloud/SRE practices. · ...

    Durham

    1 month ago

  • Work in company

    Senior Production Engineer

    Only for registered members

    This is a production engineering leadership role focused on distributed system correctness, · resilience, and performance engineering.You will partner closely with existing SRE · and Operations leaders, but your charter is to engineer prevention by building · production readiness ...

    Durham

    2 weeks ago

  • Work in company

    Senior Site Reliability Engineer

    Only for registered members

    This is a production engineering leadership role focused on distributed system correctness, resilience, and performance engineering. As AI accelerates development velocity, the bottleneck shifts from writing code to verifying correctness, performance, and safe behavior under fail ...

    Durham

    1 week ago

  • Work in company

    Site Reliability

    Only for registered members

    We're seeking a Site Reliability & Infrastructure Automation Engineer to support and modernize our environment across both traditional infrastructure and modern cloud/SRE practices. · ...

    Durham, NC

    4 days ago

  • Work in company

    Site Reliability

    Only for registered members

    Aspida is seeking a Site Reliability & Infrastructure Automation Engineer to support and modernize our environment across both traditional infrastructure and modern cloud/SRE practices. · Design and maintain synthetic monitoring for critical applications, services, and APIs. · Bu ...

    Durham, NC

    1 week ago

  • Work in company

    Senior Production Engineer

    Only for registered members

    This is a production engineering leadership role focused on distributed system correctness,resilience,and performance engineering. · Bachelor's degree in Computer Science,Engineering, · or equivalent practical experience. · ...

    Durham, NC

    2 weeks ago

  • Work in company

    Oracle Database Administrator Hybrid

    Only for registered members

    The application window is expected to close on: 02/15/2026 · Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received. · This role is required to be hybrid in Raleigh North Carolina Research Triangle Park Days onsite in t ...

    Durham $118,700 - $222,600 (USD)

    1 week ago

  • Work in company

    Senior Backend Software Engineer, OpenShift Managed Services

    Only for registered members

    We're looking for a Senior Backend Software Engineer to join the ROSA Fleet Manager team. You will develop new features and maintain our Red Hat OpenShift Container Platform customer services. · Play an active part in developing various projects around OpenShift Customer services ...

    Durham, NC

    2 weeks ago

  • Work in company

    Director, Site Reliability Engineering

    Only for registered members

    Fidelity's Site Reliability Engineering group combines operations excellence with development experience to deliver services at high scale and availability using automation and infrastructure code. · ...

    Durham

    3 weeks ago