Site Reliability Engineer - Denver, United States - Remotely

Remotely Denver, United States

2 weeks ago

Description

This is a remote position.

Site Reliability Engineer (1 year experience, remote)

Be part of our future This job posting builds our talent pool for potential future openings. We'll compare your skills and experience against both current and future needs. If there's a match, we'll contact you directly. No guarantee of immediate placement, and we only consider applications from US/Canada residents during the application process.

Hiring Type:
Full-Time

Base Salary:
$61K-$71K Per Annum.

About The Job

As an SRE, you'll troubleshoot and resolve technical issues, optimize performance, and establish reliability-based release management processes.

The SRE role is the practical implementation of DevOps principles, where speed and stability are carefully balanced, and the team acts as versatile problem solvers, filling gaps in knowledge and expertise to ensure efficient software operations.

You will:
Apply SRE principles to maintain the reliability, availability, and performance of software systems.
Automate deployment processes, configuration management, and CI/CD pipelines to streamline software development and delivery.
Planned and assisted with the migration of Windows and Linux-based machines to containerized machines.
Plan and Assist with the overall Disaster Recovery (DR) of the infrastructure and operations (InfraOps).
Manage and maintain software infrastructure, ensuring proper configuration, security, and scalability.
Perform system administration tasks, monitor system performance, troubleshoot issues, and apply necessary fixes.

Act as a versatile problem solver, filling gaps in team knowledge and expertise to ensure smooth and efficient software operations.

Facilitate smooth team and project transitions, providing guidance, training, and support for development teams to manage their infrastructure independently.

Develop a reliability rating system to assess team and project performance, collecting and analyzing metrics to evaluate adherence to best practices.

Respond quickly and effectively to critical incidents, conducting post-incident reviews to identify root causes and implement preventive measures.
Develop and maintain automation tools and scripts to improve operational efficiency.
Identify performance bottlenecks and implement optimizations to enhance system response times and resource utilization.

Stay up to date with the latest industry trends, technologies, and best practices related to SRE, DevOps, and infrastructure management.

Collaborate effectively with cross-functional teams and communicate technical concepts and recommendations clearly to both technical and non-technical stakeholders.
Implement a reliability-based release management process, allowing teams with higher reliability scores to perform quick and frequent releases.
Proactively identify potential issues and implement preventive measures to reduce incidents and outages.
Implement observability practices to detect abnormal behaviors in the software and collect information for effective problem resolution.
Set and monitor critical metrics to gain insights into system reliability, including latency, traffic, errors, and saturation levels.
Establish Service-Level Objectives (SLOs) and measure Service-Level Indicators (SLIs) to assess the quality-of-service delivery and reliability.
Planned, participated, and managed on-call rotations to ensure prompt response to reported software issues.
Utilize incident response tools to categorize the severity of reported cases and handle them promptly.
Implement configuration management tools to automate software workflows and enhance team productivity.

Projects you could work on:
Implementing automated CI/CD pipelines for smooth software deployment.
Setting up and maintaining a reliable and scalable cloud infrastructure.
Designing and implementing the migration of physical machines to virtual machines.
Designing incident response procedures and post-incident review processes.
Developing automation tools to streamline repetitive tasks and improve team productivity.
Analyzing system performance metrics and optimizing resources for better efficiency.
Establishing observability practices to detect and resolve software issues proactively.
Defining SLOs and SLIs to assess service quality and reliability across projects.
Planning and managing on-call rotations to ensure timely issue resolution.
Configuring and maintaining software workflows using configuration management tools.

#J-18808-Ljbffr

Site Reliability Engineer

3 weeks ago

AXON-Networks Denver, United States

AXON Networks delivers a robust AI-driven, analytics-based orchestration platform and a wide portfolio of next-gen high-speed routers that leverage the newest Wi-Fi technologies. Together, these technologies give ISPs the ability to manage and troubleshoot their networks in real ...
Site Reliability Engineer @

3 weeks ago

Tilt Denver, United States

Site Reliability Engineer @ Tilt · Salary, Exempt - Remote (USA) · Tilt (check us out here) is looking for a Site Reliability Engineer (SRE) to join our team and help us scale our business by ensuring the reliability, scalability, and performance of our systems and services. · ...
Product Reliability Engineer

3 weeks ago

Palantir Technologies Denver, United States

A World-Changing Company · Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missi ...
Service Reliability Engineer

1 week ago

The Bank of America Corporation Denver, United States

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders Reliability Engineer, Liabilit ...
Site Reliability Engineer

3 weeks ago

Cisco Denver, United States

#WeAreCisco and we're so happy you're thinking of joining us. Follow us on social · @WeAreCisco to learn more about what employees say about why we love where we work, or check Cisco out on Glassdoor for the latest reviews. · What You'll Do · Think back on the latest significa ...
Site Reliability Engineer

3 weeks ago

Entrust Denver, United States

Career Growth, Flexibility and Collaboration · Entrust is dedicated to keeping the world moving safely by enabling trusted identities, payments, and data protection around the globe. Headquartered in Minnesota, we offer our colleagues the ability to work globally, in a flexible ...
Site Reliability Engineer @

4 days ago

Tilt Denver, United States

Site Reliability Engineer @ Tilt · Salary, Exempt - Remote (USA) · Tilt (check us out here) is looking for a Site Reliability Engineer (SRE) to join our team and help us scale our business by ensuring the reliability, scalability, and performance of our systems and services. · ...
Senior Engineer, Reliability

5 days ago

The AES Corporation Denver, United States

Are you ready to be part of a company that's not just talking about the future, but actively shaping it? Join The AES Corporation (NYSE: AES), a · Fortune 500 company · that's leading the charge in the global energy revolution. With operations spanning · 14 countries , AES is ...
Product Reliability Engineer

3 weeks ago

Palantir Technologies Denver, United States

A World-Changing Company · Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missi ...
Associate Reliability Engineer

2 days ago

AES Corporation Denver, United States

Are you ready to be part of a company that's not just talking about the future, but actively shaping it? Join The AES Corporation (NYSE: AES), a Fortune 500 company that's leading the charge in the global energy revolution. With operations spanning 14 countries, AES is committed ...
Site Reliability Engineer

3 weeks ago

Fruition Denver, United States

Fruition is a leader in software development with a focus on delivering high-quality web solutions for clients across various sectors. Our projects involve a mix of content management systems, including Drupal, WordPress, and custom Python and applications. We are currently seek ...
Site Reliability Engineer

2 days ago

Fruition Denver, United States

Fruition is a leader in software development with a focus on delivering high-quality web solutions for clients across various sectors. Our projects involve a mix of content management systems, including Drupal, WordPress, and custom Python and applications. We are currently seek ...
Site Reliability Engineer

4 days ago

Entrust Denver, United States

Career Growth, Flexibility and Collaboration · Entrust is dedicated to keeping the world moving safely by enabling trusted identities, payments, and data protection around the globe. Headquartered in Minnesota, we offer our colleagues the ability to work globally, in a flexible ...
Principal Engineer, Reliability Engineering

3 weeks ago

Western Digital Capital Denver, United States

Principal Engineer, Reliability Engineering · Full-time · Job Type (exemption status): Exempt position - Please see related compensation & benefits details below · Salary Range: 108, ,700.00 · Business Function: Reliability Engineering · At Western Digital, our vision is to powe ...
Service Reliability Engineer

1 week ago

Bank of America Denver, United States

Service Reliability Engineer · Denver, Colorado;Chicago, Illinois · **Job Description:** · At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we d ...
Site Reliability Engineer

3 weeks ago

Dat Services Inc Denver, United States

About DAT · DATis an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of ...
Site Reliability Engineer

3 weeks ago

Splunk Denver, United States

Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out ...
Service Reliability Engineer

5 days ago

Hispanic Technology Executive Council Denver, United States

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. · One of the keys t ...
Service Reliability Engineer

1 week ago

Hispanic Technology Executive Council Denver, United States

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. · One of the keys ...
Site Reliability Engineer

3 days ago

Ping Identity Denver, United States

At Ping Identity, we're changing the way people think about enterprise security technology. With our new Identity Defined Security platform, we're building a borderless world where people have total freedom to work wherever and however they want. Without friction. Without fear. · ...

Site Reliability Engineer - Denver, United States - Remotely

Description

Site Reliability Engineer

Site Reliability Engineer @

Product Reliability Engineer

Service Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer @

Senior Engineer, Reliability

Product Reliability Engineer

Associate Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Principal Engineer, Reliability Engineering

Service Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Service Reliability Engineer

Service Reliability Engineer

Site Reliability Engineer

brook beyene

Murtuza Syed

Tom Jessiman

for Recruiters

Information

Site Reliability Engineer - Denver, United States - Remotely

Description

Site Reliability Engineer professionals in Denver