- Work with stakeholders such as product owners to define service level objectives (SLOs) for system operations.
- Track performance against SLOs in partnership with monitoring teams or other stakeholders, and ensure systems continue to meet SLOs over time.
- Create dashboards and reports to communicate key metrics.
- Create software to improve performance, scalability, and stability of systems.
- Collaborate with development teams to promote the concept of reliability engineering during all phases of the software development lifecycle to detect and correct performance issues and meet availability goals.
- Design, code, test, and deliver software to automate manual operational work (i.e., "toil").
- Conduct blameless post mortems to troubleshoot priority incidents.
- Perform analytics on previous incidents to understand root causes and better predict and prevent future issues.
- Use automation to reduce the probability and/or impact of problem recurrence.
- Identify, evaluate, and recommend monitoring tools and diagnostic techniques to improve system observability.
- Participate in system design consulting, platform management, capacity planning and launch reviews.
- Collaborate and share lessons learned regarding performance and reliability issues with all stakeholders including developers, other SREs, operations teams, and project management teams.
- Participate in communities of practice to share knowledge and foster continuous improvement.
- Remain current with reliability engineering methods and trends such as observability-driven development and chaos engineering.
- Drive continuous improvement in software quality and infrastructure reliability and resilience.
- Strong problem solving and analytical skills.
- Strong interpersonal and written and verbal communication skills.
- Highly adaptable to changing circumstances. Interest in continuously learning new skills and technologies.
- Experience with programming and scripting languages (e.g. C#, JavaScript, Python, Bash, PowerShell).
- Experience with incident and response management.
- Experience with Agile and DevOps development methodologies.
- Experience with container technologies and supporting tools (e.g. Docker, Kubernetes, AKS).
- Experience with working in cloud ecosystems (e.g. VMware vSphere, Microsoft Azure).
- Experience with monitoring and observability tools (e.g. Azure Monitor, Dynatrace, Science Logic SL1).
- Experience with configuration management systems (e.g. Ansible, Bicep, Terraform).
- Experience working with continuous integration/continuous deployment tools (e.g. Git, Jenkin, Artifactory, Azure DevOps Pipelines, GitHub actions).
- Bachelor's degree (or equivalent years of experience).
- 2+ years of relevant work experience. SRE experience preferred.
- Medical, Dental, and Vision Insurance
- Paid Annual Leave, Sick Leave
- Flexible Spending Accounts
- Retirement funds with matching contribution
- Supplemental insurance policies, including legal, Life Insurance and AD&D among others
- Work Perks program including discounted movie and theme park tickets among other great deals
- Opportunities for further advancement within our organization
-
Site Reliability Engineer
1 week ago
Sentara Healthcare Virginia Beach, United StatesCity/State · Virginia Beach, VA · Overview · Work Shift · First (Days) (United States of America) · Role Overview · Site reliability engineers (SREs) are responsible for improving system reliability and resilience to make it faster and easier to develop and deploy new softw ...
-
Reliability Engineer
1 week ago
SubCom Portsmouth, United StatesJob Description · Are you looking for an opportunity with plenty of growth potential? Do you enjoy working in an exciting, fast-paced, collaborative environment? Are you interested in working with the world's most innovative companies to create a more connected world? · Connect ...
-
Reliability Engineer
1 week ago
SubCom Portsmouth, United StatesAre you looking for an opportunity with plenty of growth potential? Do you enjoy working in an exciting, fast-paced, collaborative environment? Are you interested in working with the world's most innovative companies to create a more connected world? · Connecting Continents. Impa ...
-
Site Reliability Engineer
1 week ago
Blu Omega Norfolk, United StatesBlu Omega is seeking a Site Reliability Engineer for a DOD program onsite in Norfolk, VA. · Candidates must have an active DoD Secret Clearance (or above) as a precondition of employment. · Introduction: We are seeking a skilled Site Reliability Engineer with at least 2 years o ...
-
Maintenance Reliability Engineer
2 days ago
Howmet Corporation Hampton, United StatesResponsibilities · Primary Purpose of Job · Under the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data ...
-
Maintenance Reliability Engineer
2 days ago
Howmet Aerospace Hampton, United StatesPrimary Purpose of Job · Under the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data analysis and improve ...
-
Maintenance Reliability Engineer
8 hours ago
Howmet Aerospace Hampton, United StatesPrimary Purpose of Job · Under the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data analysis and improv ...
-
Plant Reliability Engineer
1 day ago
Howmet Aerospace Hampton, VA, United StatesUnder the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data analysis and improvement efforts and is direc ...
-
Maintenance Reliability Engineer
1 day ago
Howmet Aerospace Hampton, VA, United StatesPrimary Purpose of Job Under the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data analysis and improvem ...
-
Maintenance Reliability Engineer
1 day ago
Howmet Aerospace Hampton, VA, United StatesPrimary Purpose of Job Under the direction of the Facilities Engineering & Maintenance Manager, this position is responsible for leading and directing maintenance resources to the fulfillment of Howmet Aerospace business objectives. This position leads data analysis and improvem ...
-
Secret Senior Reliability Engineer
1 week ago
Insight Global Hampton, United Statesleader will provide in-house consulting activities to advise/counsel personnel on a variety of complicated processes/procedures, supported by detailed calculations and guided by cost-effective decisions and recommendations. · Plans, conducts and directs engineering research and ...
-
AWS GovCloud Senior Site Reliability Engineer
2 weeks ago
Elevance Health Norfolk, United States**AWS GovCloud Senior Site Reliability Engineer** · **Location:** This position will work a hybrid model (remote and in office one day per week). Ideal candidates will live within 50 miles of one of our Pulse Point locations in Atlanta, GA, Indianapolis, IN, Chicago, IL, Richmond ...
-
Logistician Ii
2 days ago
Koam Engineering Systems Norfolk, United States**Koam Engineering Systems, Inc. (KES Inc.) is an employee owned small business specializing in technology innovation and systems integration by combining innovative products and reliable engineering services. Headquartered in San Diego, California and with offices in Gig Harbor, ...
-
Site Inspector
1 week ago
Wallace Montgomery Norfolk, United States**WHY YOU SHOULD JOIN OUR TEAM** · **ABOUT WALLACE MONTGOMERY** · Since 1975, our multi-disciplined engineering organization has grown to become a recognized leader in planning, engineering, and construction management. As an Engineering News-Record (ENR) Top 500 design firm, our ...
-
NAES Corporation Point Comfort, United StatesPort Comfort Power Station is a cutting-edge peaking power plant boasting a capacity of 100MW. At the heart of this facility are two aero-derivative GE LM6000 simple cycle gas turbines, known for their efficiency and reliability. The plant is equipped with state-of-the-art Allen ...
-
Navy Lifecycle Analyst
1 week ago
Ironclad Defense Works Norfolk, United States*This is an actual open billet that we are immediately going to staff* · **Lifecycle Engineer / Senior Systems Analyst** · The Surface Maintenance Engineering, Planning, Program (SURFMEPP) maintains, monitors, and refines Class-wide and Individual-Ship Maintenance Plans for all n ...
-
Data Center Analyst
1 week ago
Hoag Memorial Hospital Presbyterian Newport Beach, United States**Salary Range**: $ $ /hour. Actual compensation may vary based on geographic location, work experience, skill level, and education. · **Primary Duties and Responsibilities** · The Data Center Facilities (Analyst) will provide end users, architects, engineers, contractors, and di ...
-
Raytheon Portsmouth, United States Full time**Date Posted**: · **Country**: · United States of America · **Location**: · RI101: 1847 W Main Rd 1847 W. Main Road Nimitz Building, Portsmouth, RI, 02871 USA · **Position Role Type**: · Hybrid · At Raytheon, the foundation of everything we do is rooted in our values and a highe ...
-
Control Systems Engineer
1 week ago
Amazon Virginia Beach, United StatesJob Description · The Reliability and Maintenance Engineering (RME) team is hiring for Control Systems Engineering Managers · At Amazon we believe that Every Day is still Day One We're working to be the most customer-centric company on earth. To get there, we need exceptionally ...
-
Lab Technician
2 days ago
Testengeer Incorporated Point Comfort, United States**Position Description: Lab Technician** · **About Taurus Industrial Group (Taurus)** · - Taurus is a leading technical services contractor in the energy and industrial sector. Our business has over 60 years' history and service excellence with an unwavering commitment to quality ...
Site Reliability Engineer - Virginia Beach, United States - Sentara
Description
City/State
Virginia Beach, VAOverview
Work Shift
First (Days) (United States of America)Role Overview
Site reliability engineers (SREs) are responsible for improving system reliability and resilience to make it faster and easier to develop and deploy new software capabilities. SREs focus especially on building automation to reduce manual effort and prevent operations incidents.
Key Responsibilities
Skills and Experience
Qualifications
Location
This position is remote for residents of AL, DE, FL, GA, ID, IN, KS, LA, ME, MD, MN, NC, NE, NH, ND, NV, OK, OH, PA, SC, SD, TN, TX, UT, VA, WA, WV, WI and WY.
SentaraOverview
For more than a decade, Modern Healthcare magazine has ranked Sentara Healthcare as one of the nation's top integrated healthcare systems.That's because we are dedicated to growth, innovation, and patient safety at more than 300 sites of care in Virginia and northeastern North Carolina, including 12 acute care hospitals.
SentaraBenefits
As the third-largest employer in Virginia, Sentara Healthcare was named by Forbes Magazine as one of America's best large employers.We offer a variety of amenities to our employees, including, but not limited to:
Sentara employees strive to make our communities healthier places to live. We are setting the standard for medical excellence within avibrant, creative, and highly productive workplace.For information about our employee benefits, please visit:Benefits - Sentara )
Join our teamWe are committed to quality healthcare, improving health every day, and providing the opportunity for training, development, and growth
Sentara Healthcare prides itself on the diversity and inclusiveness of its close to an almost 30,000-member workforce. Diversity, inclusion, and belonging is a guiding principle of the organization to ensure its workforce reflects the communities it serves.
Job Summary
Qualifications:
BLD - Bachelor's Level DegreeSkills
Sentara Healthcare prides itself on the diversity and inclusiveness of its close to an almost 30,000-member workforce. Diversity, inclusion, and belonging is a guiding principle of the organization to ensure its workforce reflects the communities it serves.
Per Clinical Laboratory Improvement Amendments (CLIA), some clinical environments require proof of education; these regulations are posted at for further information. In an effort to expedite this verification requirement, we encourage you to upload your diploma or transcript at time of application.
In support of our mission "to improve health every day," this is a tobacco-free environment.