-
Site Reliability Engineer
12 hours ago
Goldman Sachs Salt Lake City, United StatesMORE ABOUT THIS JOB: · Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for the availability and reliab ...
-
Fixed Equipment Reliability Engineer
12 hours ago
Big West Oil North Salt Lake, United StatesExperienced Fixed Equipment Reliability engineer to develop and support a developing reliability system. Position is responsible for daily support activities for the refinery asset, as well as developing philosophies, work processes, and special emphasis programs. Responsible for ...
-
Site Reliability Engineer
3 weeks ago
Technology Search Group, Inc. Salt Lake City, United StatesAbout the job Site Reliability Engineer (SRE) · Responsibilities · Responsible for collaborating with businesspeople to have a real time understanding of business problems and expected to focus on agile methodology of development. Deliver high quality change within the deadlines ...
-
Senior Systems Engineer and Reliability
6 days ago
General Electric Company Salt Lake City, United StatesJob Description Summary · At GE HealthCare, our passionate people are creating the products, solutions and services our customers need to deliver the best patient care possible. · As part of the Engineering Organization, the Reliability Architect is primarily responsible for su ...
-
Senior Systems Engineer and Reliability
2 weeks ago
GE Healthcare Salt Lake City, United StatesJob Description Summary · At GE HealthCare, our passionate people are creating the products, solutions and services our customers need to deliver the best patient care possible. · As part of the Engineering Organization, the Reliability Architect is primarily responsible for sup ...
-
Fixed Equipment Reliability Engineer
3 weeks ago
Big West Oil Salt Lake City, United StatesExperienced Fixed Equipment Reliability engineer to develop and support a developing reliability system. Position is responsible for daily support activities for the refinery asset, as well as developing philosophies, work processes, and special emphasis programs. Responsible for ...
-
Battelle Applied Solutions, LLC Salt Lake City, United StatesRequisition Id 11976 · Overview: · Are you looking for a way to use your hard-earned SRE skills in a more ambitious environment where you can also help protect national security? The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts ...
-
Site Reliability Engineer
3 weeks ago
Breeze Airways Midvale, United StatesWorking at Breeze Airways is an exciting endeavor and a serious commitment to bring "The World's Nicest Airline" to life. We work cross-functionally with truly awesome Team Members to deliver on our mission: · "To make the world of travel simple, affordable, and convenient. Impr ...
-
Battelle Applied Solutions, LLC Salt Lake City, United StatesRequisition Id 11979 · Overview: · The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world's most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, pe ...
-
Site Reliability Engineer
1 week ago
Avetta Lehi, United StatesJoin Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globall ...
-
Senior Site Reliability Engineer
2 weeks ago
Collective Health Lehi, United StatesWhat you'll do: · Establish service level indicators and data-driven objectives, and develop SRE standards and processes to uphold and improve uptime, latency, and system health. · Define and execute initiatives to continuously improve our deployed cloud footprint in areas such a ...
-
Sr. Site Reliability Engineer
2 weeks ago
Vivint Lehi, United States Full timeJob Description · Responsibilities · Improve and maintain infrastructure for containerized microservice environments · Troubleshoot and debug issues with a focus on resolving problems quickly with minimal impact to customers and developers · Manage processes, systems, and infr ...
-
Sr. Site Reliability Engineer
2 weeks ago
Vivint Lehi, United StatesJob Description · Welcome to the intersection of energy and home services. At NRG, we're driven by the idea of a smarter, cleaner, more connected future-and the possibilities that will bring to the world and to the 7.3 million customers we serve. · Vivint Smart Home, an NRG-owne ...
-
Final thesis process development
1 week ago
Nexus Innovations Salt Lake City, United StatesAbout the Company · We are an essential partner in the field of engine management for combustion engines, generators, and turbines, with our experience and innovative power in control and drive technology. With dedication, we develop the perfect solution for emission reduction an ...
-
Sr. Site Reliability Engineer
2 weeks ago
Vivint Lehi, United StatesWelcome to the intersection of energy and home services. At NRG, were driven by the idea of a smarter, cleaner, more connected futureand the possibilities that will bring to the world and to the 7.3 million customers we serve. Vivint Smart Home, an Reliability Engineer, Liability ...
-
Staff Site Reliability Engineer
2 weeks ago
Vivint Lehi, United StatesWelcome to the intersection of energy and home services. At NRG, were driven by the idea of a smarter, cleaner, more connected futureand the possibilities that will bring to the world and to the 7.3 million customers we serve. Vivint Smart Home, an Reliability Engineer, Liability ...
-
Staff Site Reliability Engineer
2 weeks ago
Vivint Lehi, United StatesJob Description · Welcome to the intersection of energy and home services. At NRG, we're driven by the idea of a smarter, cleaner, more connected future-and the possibilities that will bring to the world and to the 7.3 million customers we serve. · Vivint Smart Home, an NRG-owne ...
-
Reliability Superintendent
1 week ago
Orion Talent Salt Lake City, United StatesThe overall objective of the Reliability Superintendent is to ensure the safe and reliable operation of production facilities at the lowest life cycle cost. You will be responsible for managing a team of Industrial Mechanics, Electrical Technicians, and Instrumentation techs. · ...
-
Quality Engineering Manager
2 weeks ago
Safeguard Global Recruiting Salt Lake City, United StatesQuality Engineering Manager · Salt Lake City · About Us: Join our rapidly growing medical device design house and manufacturing facility, where innovation and dedication to high-quality products drive our success. Our energetic team is committed to exceeding customer and patient ...
-
Reliability Data Analyst
1 week ago
JLL Salt Lake City, United States Full timeJLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...
Intermediate Site Reliability Engineer - Salt Lake City, United States - ARCS
Description
Join our client's vibrant team in Cape Town as an Intermediate Site Reliability Engineer (SRE II). Operating mostly remotely, their team occasionally collaborates in the office for direct engagement. Your role involves achieving operational excellence through automation tooling (e.g., Terraform). You'll contribute to architectural discussions, keeping your skills current for impactful contributions.Their Infrastructure & Software Stack:
Kubernetes running on Google Kubernetes Engine (GKE)
Prometheus, Grafana, Elastic, Kibana
CI/CD with Jenkins
Kong API Gateway
LogDNA
Falco
MongoDB Atlas
Microservice Architecture with Event Sourcing and CQRS
Containers running Kotlin, Python, Javascript (and a bit of Golang)
Your responsibilities will include:
Being part of their security incident response team
Managing their identity platform and enabling enterprise user and system authentication and authorization using OAuth2
Working effectively with the development team to plan and deploy required infrastructure changes or new capabilities ahead of time and unblocking the development team when unforeseen infrastructure blockers arise
Performing high-quality, ego-free code reviews drive visibility, testing, and improvement initiatives
Writing operational tooling to automate otherwise manual processes (e.g., Golang, Bash)
Writing, testing, and executing change control plans for production changes with an eye for detail to spot potential issues
Debug production issues
Being part of their on-call rotation. When on-call, you will work on repaying technical debt and deal with operational incidents as and when they occur. This will require you to have or acquire a good general knowledge of production operations for technical support.
#J-18808-Ljbffr