Sr Site Reliability Engineer - Des Moines, United States - O'Reilly Auto Parts

    O'Reilly Auto Parts background
    Description
    The Senior Site Reliability Engineer brings a software engineering perspective to delivering quality operations at scale. Responsible for the availability and performance of the platforms and services of O'Reilly Auto Parts. Creates and defines monitoring and incident response tools and processes.

    The Senior Site Reliability Engineer will create a bridge between development and operations by applying a software engineering mindset to system administration.

    Provides leadership and expertise to Site Reliability Engineers, Network Operations Center, Application Development and Infrastructure team members. Time will be split between operations/on-call duties and developing systems and software that help increase site reliability and performance.

    Essential Job Functions

    Develop methodologies for building and operating highly available and scalable services.


    Work closely with Network Operations Center to develop monitoring tools, analyze root cause of incidents, and improve the Network Operations Center's ability to independently resolve issues.

    Ensure production services are defined, measured, monitored and maintained.

    Evaluate, build and modify automation for deploying and operating production services.

    Provide leadership in reducing and resolving production incidents.

    Mentor and develop site reliability engineers.

    Create a culture of reliability and high availability in the Information Technology department.

    Develop tools and procedures used to operate scalable services.

    Train application development resources to build more resilient applications.

    Work with infrastructure to create more scalable and resilient infrastructure.

    Identify opportunities to improve all operations processes.

    Proactively monitor and review application performance.

    Monitor specific metrics, set thresholds, and trigger alerts based on those thresholds.

    Collect and analyze logging and diagnostic information.

    Help the Network Operation Center develop better monitoring and incident resolution practices.


    Facilitate effective transition of services into production ensuring that all requirements have been met in accordance with O'Reilly's Change Management standards.

    Troubleshoot business and production issues.

    Properly document all incident responses.

    Provide updates and documentation to runbooks and operational manuals.

    Document mean time to recover (MTTR) and mean time to failure (MTTF).

    Participate in on-call rotations.

    Skills/Education/Knowledge/Experience/Abilities


    Required:
    Bachelor's Degree or equivalent work experience.

    7+ years of professional experience in Site Reliability, Linux Systems Administration, DevOps, or Infrastructure Engineering.

    Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

    Experience with programming languages including Java, JavaScript and SQL.

    Experience with Shell Scripting such as Bash, Python or Ruby.

    Familiarity with automation and configuration management tools and frameworks.

    Excellent analytical and problem solving skills.

    Strong written and verbal communication skills.

    Demonstrate up-to-date expertise and apply this to the development, execution, and improvement of action plans.

    Must be well organized, detail oriented and able to self-prioritize work.

    Must exhibit a high degree of professionalism.

    Composed urgency in stressful situations.

    Desired

    ITIL Foundations Certification.

    CRE or CMRP Certifications.

    O'Reilly Auto Parts has a proven track record of growth and stability.

    O'Reilly is full of successful career stories and believes in a strong promote-from-within philosophy, encouraging you to grow your career along with the organization.

    Total Compensation Package

    Competitive Wages & Paid Time Off
    Stock Purchase Plan & 401k with Employer Contributions Starting Day One
    Medical, Dental, & Vision Insurance with Optional Flexible Spending Account (FSA)
    Team Member Health/Wellbeing Programs
    Tuition Educational Assistance Programs
    Opportunities for Career Growth

    O'Reilly Auto Parts is an equal opportunity employer.

    Hiring decisions are administered without regard to race, color, creed, religion, sex, pregnancy, sexual orientation, gender identity, age, national origin, ancestry, citizenship status, disability, veteran status, genetic information, or any other basis protected by applicable federal, state or local law.


    Reasonable Accommodations:

    Qualified individuals with known disabilities may be entitled to reasonable accommodation under the Americans with Disabilities Act and certain state or local laws.

    #J-18808-Ljbffr