- Open Shift or GKC (Google Kubernetes Engine)
- Expertise in SRE principles and know how to apply them to infrastructure (bridge between infrastructure and dev)
- SRE > reactionary, dealing with optimizations and issues once the applications are running
- Prometheus
-
Reliability Engineer
3 weeks ago
Commercial Metals Company Birmingham, United StatesThere's more to CMC than our steel products · and the buildings, structures, and roads they go into. At CMC, it's the people inside our recycling centers, fabrication plants, steel mills and offices that make us who we are as a company. Our success comes from finding, retaining, ...
-
Reliability Engineer
1 week ago
Commercial Metals Birmingham, United StatesSelect how often (in days) to receive an alert: · There's more to CMC than our steel products and the buildings, structures, and roads they go into. At CMC, it's the people inside our recycling centers, fabrication plants, steel mills and offices that make us who we are as a comp ...
-
Senior Facilities Reliability Engineer
2 weeks ago
STERIS Birmingham, United StatesAt STERIS, we help our Customers create a healthier and safer world by providing innovative healthcare and life science product and service solutions around the globe. · The Role · The Senior Facilities Reliability Engineer at STERIS is essential for ensuring the reliability an ...
-
Reliability Engineer
2 weeks ago
CMC Inc Birmingham, United Statesit's what's inside that counts · There's more to CMC than our steel products and the buildings, structures, and roads they go into. At CMC, it's the people inside our recycling centers, fabrication plants, steel mills and offices that make us who we are as a company. Our success ...
-
Reliability Engineer
2 weeks ago
Commercial Metals Birmingham, United Statesit's what's inside that counts · There's more to CMC than our steel products and the buildings, structures, and roads they go into. At CMC, it's the people inside our recycling centers, fabrication plants, steel mills and offices that make us who we are as a company. Our success ...
-
Cloud Site Reliability Engineer
2 weeks ago
EOS Worldwide Birmingham, United StatesJob Description · Job DescriptionEOS: Real. Simple. Results. · EOS, the Entrepreneurial Operating System, is a complete set of simple concepts and practical tools that have helped thousands of entrepreneurs get what they want from their businesses. Purely implementing EOS helps t ...
-
Site Reliability Engineer
4 weeks ago
ITAC Solutions Birmingham, United StatesITAC is assisting an established Birmingham client in their search for a Site Reliability Engineer The ideal candidate will have a passion for automation, building scalable systems, embracing new technologies, and sharing with teammates. · What you'll be doing (duties of this ...
-
Senior Facilities Reliability Engineer
4 weeks ago
STERIS Birmingham, United StatesAt STERIS, we help our Customers create a healthier and safer world by providing innovative healthcare and life science product and service solutions around the globe. · The Role · The Senior Facilities Reliability Engineer at STERIS is essential for ensuring the reliability an ...
-
Senior Facilities Reliability Engineer
2 weeks ago
STERIS Birmingham, United StatesAt STERIS, we help our Customers create a healthier and safer world by providing innovative healthcare and life science product and service solutions around the globe. · The Role · The Senior Facilities Reliability Engineer is home based in Birmingham, AL and will travel to oth ...
-
Senior Facilities Reliability Engineer
3 weeks ago
STERIS Birmingham, United StatesAt STERIS, we help our Customers create a healthier and safer world by providing innovative healthcare and life science product and service solutions around the globe. · The Role · The Senior Facilities Reliability Engineer at STERIS is essential for ensuring the reliability an ...
-
Site Reliability Engineering Mgr
3 days ago
PNC Financial Services Group Birmingham, United StatesJob Description · Position Overview · At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture wh ...
-
Senior Facilities Reliability Engineer
1 week ago
STERIS Birmingham, United StatesAt STERIS, we help our Customers create a healthier and safer world by providing innovative healthcare and life science product and service solutions around the globe. · The Role · The Senior Facilities Reliability Engineer is home based in Birmingham, AL and will travel to oth ...
-
Reliability Engineer
1 week ago
Resolute Forest Products Coosa Pines, United StatesJob Description · Job DescriptionPrimary Purpose · The Reliability Engineer is responsible for maintaining and improving the reliability of our manufacturing operation. · Duties and Responsibilities · Manage the Maintenance Coordinators and Maintenance Planners as direct reports ...
-
Senior Reliability Engineer
1 week ago
Resolute Forest Products Childersburg, United StatesPress Tab to Move to Skip to Content Link · Select how often (in days) to receive an alert: · Senior Reliability Engineer · Location: · Coosa Pines, AL, US, · Department: · Maintenance (MN00) · We at Resolute are a diverse group of individuals who possess a wide variety of sk ...
-
Diesel Technician/Mechanic II
2 weeks ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to move your career forward? As a Technician at Penske, you'll do exactly that. Here, you'll perform preventative maintenance and repairs of all levels on the newest and best maintained fleet of vehicles in the industry. You will help our customers keep th ...
-
Diesel Technician/Mechanic II
2 weeks ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to move your career forward? As a Technician at Penske, you'll do exactly that. Here, you'll perform preventative maintenance and repairs of all levels on the newest and best maintained fleet of vehicles in the industry. You will help our customers keep th ...
-
Diesel Mechanic/Technician I
1 week ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to move your career forward? As an experienced Technician at Penske, you'll do exactly that. Here, you'll perform preventative maintenance and repairs of all levels on the newest and best maintained fleet of vehicles in the industry. You will help our cust ...
-
Diesel Mechanic/Technician I
2 weeks ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to move your career forward? As an experienced Technician at Penske, you'll do exactly that. Here, you'll perform preventative maintenance and repairs of all levels on the newest and best maintained fleet of vehicles in the industry. You will help our cust ...
-
Diesel Mechanic/Technician I
1 week ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to move your career forward? As an experienced Technician at Penske, you'll do exactly that. Here, you'll perform preventative maintenance and repairs of all levels on the newest and best maintained fleet of vehicles in the industry. You will help our cust ...
-
Diesel Technician/Mechanic
2 weeks ago
Penske Truck Leasing Birmingham, United States Full timeWhat's the Job? · Ready to accelerate your career while helping our customers move forward? As a Technician at Penske, you'll do exactly that. Here, you'll do preventative maintenance and minor repairs on the newest and best maintained fleet of vehicles in the industry. You will ...
DevOps/Site Reliability Engineer - Birmingham, United States - Saxon Global
Description
Must Have Technical Skills:
SUMMARY
The Site Reliability Engineer (SRE) is responsible for improving system reliability and
resilience. This role focuses on building automation to reduce manual effort and prevent
service-impacting incidents. The SRE combines software and systems engineering to
build and support large-scale, distributed, fault-tolerant systems. This role ensures that
critical platforms are available, reliable, and able to support a fast rate of improvement.
This role relies on monitoring platforms and is continually taking a holistic view of system
health and performance. The SRE will enhance and support cloud-based
transformations, and is focused on pushing capabilities forward, staying ahead of
customer needs and innovating for continuous improvement. The SRE provides
operational support and engineering for multiple large-scale distributed software
applications
JOB DUTIES
•Gathers and analyzes metrics from monitoring platforms to assist in performance tuning
and fault tolerance.
•Partners with development teams to improve services through testing and release
procedures.
•Participates in system design, platform management and capacity planning.
•Balances feature development speed and reliability with service-level objectives.
•Works closely with the incident response team and restoring service to normal operation.
•Understands debugging and applying troubleshooting skills.
•Investigates, blocks and rate-limits unwanted traffic.
•Utilizes monitoring systems and dashboards for proactive changes and alerting.
•Establishes continuous process improvement cycles where the process, performance,
and supporting technologies are reviewed and enhanced where applicable.
•Performs other duties as assigned.
EDUCATION & EXPERIENCE
Typically requires a bachelor's degree and five (5) to seven (7) years of experience in a
technology and/or software engineering role or an equivalent combination.
KNOWLEDGE, SKILLS, ABILITIES
•Understanding of Kubernetes, containers, clusters and elastic scalability.
•Expertise in SRE principles.
•Mindset of continually finding ways to drive scalability, stability, and performance.
•Cloud Services experience with Google Cloud Platform (GCP).
•Experience with API, service-based or microservice-based architecture.
•Proficiency in infrastructure, network, database, operating systems or security
troubleshooting and remediation.
•Architecture-level knowledge of Windows and Linux and Infrastructure systems.
•Experience with production deployment, monitoring and operational support for enterprise-class applications (Dynatrace a plus).
•Experience working with Continuous Integration/ Continuous Deployment tools.
•Experience in performance diagnostics, capacity planning, performance architecture
design, performance tuning and performance monitoring.
•A strong mix of software engineering and operational support skills.
•Knowledge of web technologies - HTTP, proxy, java, etc.
•Experience with Azure DevOps (ADO), Dynatrace, Prometheus, Terraform and Grafana.