- Must be a U.S citizen or green-card holder
- 8+ years of professional experience
- Proven leadership track record
- Ability to pass a background check and obtain a public trust security clearance
- Proficient with Observability in the cloud, building monitoring & alerting frameworks (grafana, datadog, newrelic etc.)
- Has built alert escalation plans, disaster recovery infrastructure, and setup on-call rotations
- Proficient with implementing cloud infrastructure on Azure.
- Proficient with Terraform
- Experience with Linux, and Bash scripting.
- Experience with Kubernetes (AKS)
- Substantial experience with programming languages like Python
- Experience with containerization technologies (e.g.Docker, containerD)
- Ability to develop the architecture for continuous integration and deployment as well as continuous monitoring
- Experience supporting scalable and elastic applications on distributed architectures.
- Strong ability and understanding of securing systems on the application, network, and infrastructure layers.
- Experience managing network/compute/database infrastructure with infrastructure-as-code.
- Expert in basic git actions like cloning, creating branches, navigating between branches, staging code for commit, committing code, resetting, and merging.
- Ability to mentor & support junior members
- Proven ability to work under pressure and in fast-paced environments.
- Ability to operate and manage work, strategically reason, build relationships and influence others.
- Azure Certifications
- Preliminary Screen
- Technical Assessment & Review
- Client Review
-
Reliability Engineer
3 days ago
KMS Solutions Washington, United StatesReliability Engineer · KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defens ...
-
Reliability Engineer
12 hours ago
KMS Solutions Washington, United StatesJob Description · Job Description · Reliability Engineer · KMS Solutions, LLC is a technical · management/solutions · company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of exp ...
-
Reliability Engineer
3 days ago
Saint-Gobain Washington, United StatesConsistent with CertainTeed Gypsum Vision, Mission, Values and Objectives, the Reliability Engineer identifies and quantifies Line 1 and Line 2 root cause failure(s), and drives permanent solutions to address systemic or chronic mechanical deficiencies to world class levels of sa ...
-
Reliability Engineer
3 weeks ago
Valvoline Global Washington, United StatesOverview · Why Valvoline Global Operations? · Valvoline Global is a worldwide leader in automotive and industrial solutions, creating future-ready products and best-in-class services for partners around the globe. Established in 1866, we introduced the world's first branded motor ...
-
Site Reliability Engineer
3 weeks ago
System One Holdings, LLC Washington, United StatesSite Reliability Engineer · Washington, DC - 100% ONSITE · Active TS/SCI clearance is required to start · As a Site Reliability Engineer (SRE), you'll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the feder ...
-
Site Reliability Engineer
3 days ago
Palantir Technologies Washington, United StatesA World-Changing Company · Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missin ...
-
Site Reliability Engineer
1 week ago
Palantir Technologies Washington, United StatesA World-Changing Company · Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missi ...
-
Site Reliability Engineer
4 days ago
StaffWorthy Inc. Washington, United StatesWe are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value the ...
-
Site Reliability Engineer
2 weeks ago
MetroStar Systems Washington, United StatesAs a Site Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. We know that you cant have great technology services without Reliability Engineer, Liability, ...
-
Site Reliability Engineer
3 days ago
Mount Indie Washington, United StatesJob Description · Job Description · As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtai ...
-
Site Reliability Engineer
3 days ago
Cinder LLC Washington, United States[Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs · Site Reliability Engineer · Cinder United States · Date Posted · 31 Oct, 2022 · Work Location · Washington, DC, United States · Salary Offered · $110 — $220 yearly · Job Type · Full Time · Experi ...
-
Site Reliability Engineering
1 day ago
Alta It Services Washington, United StatesSite Reliability Engineering (SRE) Lead · 100% Remote · US Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal ...
-
Site Reliability Engineer
12 hours ago
Talent Discovery Pros Washington, United StatesJob DescriptionJob Description · TITLE : Site Reliability Engineer · LOCATION : Washington DC · CLEARANCE REQUIRED : TS/SCI · WORK AUTHORIZATION : US Citizen · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observabil ...
-
Site Reliability Engineer
6 days ago
MetroStar Corporation Washington, United StatesAs a Site Reliability Engineer (SRE), you'll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. · We know that you can't have great technology services without amazing people. At MetroSta ...
-
Site Reliability Engineer
2 days ago
Mission Box Solutions Washington, United StatesAs a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. Our client firmly believes that exceptional technology services ...
-
Network Reliability Engineer
2 weeks ago
Expedient Washington, United StatesSponsorship is not provided. · Expedient is actively looking for a Network Reliability Engineer to join the Network Reliability team. Expedient is an established leader in the cloud computing industry. We are an award-winning employer of great talent and cutting edge emerging tec ...
-
Maintenance Reliability Engineer
3 weeks ago
Louis Dreyfus Company B.V. Washington, United StatesPort Allen, LA, United States of America · Job Reference · JR0073330 · Professional Areas · Industry · Function · Operations, Engineering and Maintenance · Louis Dreyfus Company is a leading merchant and processor of agricultural goods. Our activities span the entire value cha ...
-
Site Reliability Engineer
2 weeks ago
Palantir Technologies Washington, United StatesSite Reliability Engineer - Security Infrastructure · Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chai ...
-
Site Reliability Engineer
2 hours ago
Tik Tok Washington, United StatesResponsibilities · About TikTok U.S.Data Security · TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created ...
-
Site Reliability Engineer
4 days ago
Tik Tok Washington, United StatesResponsibilities · TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Mumbai, Singapore, Jakarta, Seoul and Tokyo. · Why Join ...
Lead Azure Site Reliability Engineer - Washington, United States - Mechanicode
3 weeks ago
Description
We are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications.
The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices.
The engineer will be part of a larger effort to modernize the CDC DevOps enterprise framework by joining the team of 20 which is comprised of data scientists, software engineers, product owners, and DevOps engineers.
Mechanicode is a remote-first company, and this role will be 100% remote.
W2-Salary: k
Required
Mechanicode's vision is to bring peace of mind with technology.
We do so by building self-healing cloud infrastructure, resilient enough to withstand failures and sufficiently predictable to resolve issues without human intervention.
We do that by having automation as the cornerstone of our cloud solutions, significantly improving workforce attrition, and introducing agile rapid development conventions that improve the developer's experience.
About Mechanicode
Mechanicode a Cloud Digital services firm providing comprehensive DevSecOps, Cloud Native Engineering, IT Modernization & Automation services.
Founded by a former USDS engineer, Mechanicode has 13 years of experience developing innovative automation solutions improving the feedback loop in the developer experience, and using AWS/Azure Certified best practices for clients.
Mechanicode has experience in both the public and private sectors, providing modernization services that engage Agile best practices, scalable cloud architectures, and continuous integration & deployment standards.