SRE Engineer - Dearborn, MI, United States

Only for registered members Dearborn, MI, United States

3 days ago

Default job background
$90,000 - $170,000 (USD) per year *
* This salary range is an estimation made by beBee
Description · As a Site Reliability Engineer at Ford Motor Company, you will play a pivotal role in elevating the performance and dependability of our Marketing and Sales Tech platform and applications. In this essential position, your responsibilities will include closely collab ...
Job description
Description

As a Site Reliability Engineer at Ford Motor Company, you will play a pivotal role in elevating the performance and dependability of our Marketing and Sales Tech platform and applications. In this essential position, your responsibilities will include closely collaborating with diverse teams across the organization to fortify our systems, ensuring they are not only robust and scalable but also equipped to efficiently manage the complexities of a global customer base. Your expertise in site reliability will be crucial in driving ongoing enhancements to our technology landscape. This continuous improvement effort is vital to maintaining Ford's leadership in innovation within the automotive industry, helping us set standards in service reliability and satisfaction. Your contributions will directly impact the smooth operation and evolutionary growth of our MS Tech capabilities, aligning with Ford's commitment to excellence and innovation.

Responsibilities

Incident Management & Operational Excellence

  • 24/7 Response: Participate in a 24/7 on-call rotation, providing rapid response to critical incidents and ensuring high availability for the NA eCommerce platform.
  • Incident Triage: Act as a decisive member of triage teams to diagnose, troubleshoot, and resolve complex production issues, directly contributing to the reduction of Mean Time to Recovery (MTTR).
  • Standardization: Diligently execute and contribute to the continuous improvement of operational runbooks and Standard Operating Procedures (SOPs) to ensure consistent incident response.

Problem Management & Root Cause Analysis

  • Blameless Culture: Lead and participate in blameless post-mortems and Root Cause Analysis (RCA) sessions to identify systemic weaknesses and implement preventative measures.
  • Strategic Engineering: Partner with cross-functional development and platform teams to architect long-term reliability solutions based on RCA findings.

Service Level Management (SLIs, SLOs, & Error Budgets)

  • Reliability Framework: Define and track meaningful Service Level Indicators (SLIs) and Objectives (SLOs) to measure service availability and performance.
  • Stakeholder Alignment: Collaborate with Product Owners to establish acceptable service levels and manage Error Budgets to balance velocity with reliability.
  • Performance Insight: Provide critical analysis during monthly release reviews, evaluating the impact of changes on service health and SLO adherence.

Modern Observability & Monitoring

  • Full-Stack Visibility: Leverage and optimize Ford's observability suite (Dynatrace, GCP Logging, etc.) to monitor system health and proactively identify anomalies.
  • Gap Remediation: Identify and document observability blind spots, implementing technical solutions to ensure comprehensive system visibility.
  • Monitoring-as-Code: Manage and refine metric collection, dashboard creation, and alert definitions, utilizing Terraform to provision monitoring infrastructure following Ford standards.
  • Alerting Strategy: Design robust notification strategies and thresholds to alert stakeholders of KPI/SLO violations using Error Budget signals.

Automation & Reliability Engineering

  • Toil Reduction: Champion the elimination of manual, repetitive tasks by developing automation scripts, tools, and streamlined workflows.
  • Resilience Engineering: Design and implement self-healing mechanisms to automatically detect and remediate common system failures, reducing the need for manual intervention.
  • Next-Gen Tech: Implement and manage AI-driven observability solutions to enhance proactive system monitoring and predictive maintenance.

Collaboration & Communication

  • Cross-Functional Leadership: Coordinate with platform and engineering teams to resolve production bottlenecks and drive continuous process improvements.
  • Strategic Reporting: Deliver clear, data-driven status reports on system health, incident trends, and SRE initiatives to leadership and program stakeholders.
Qualifications
  • Education: Bachelor's in computer science or related field. 
  • Experience: Minimum of 5+ years of professional experience in Site Reliability Engineering, or DevOps. 
  • Cloud Fluency (GCP): Deep hands-on experience with Google Cloud Platform, specifically Cloud Run, GKE, and OpenShift. You understand the nuances of container orchestration and serverless architectures. 
  • Infrastructure as Code (IaC): Advanced proficiency in Terraform. You must have experience writing reusable modules, managing state, and automating infrastructure provisioning (not just running existing scripts). 
  • Observability Engineering: Experience in comprehensive system observability using primary telemetry – Metrics, Events, Logs and Traces.
    • Hands-on experience with Dynatrace (or similar APM tools like Datadog/New Relic), including distributed tracing, synthetic monitoring, and code-level profiling.
  • Coding & Scripting: Proficiency in at least one high-level programming language (Java, , Python, or Go). You will need to read application code to assist with instrumentation and debug complex production issues. 
  • Incident Management: Proven experience managing high-severity incidents. You understand the lifecycle of an incident: Triage -> Mitigation -> Resolution -> Blameless Post-Mortem (RCA).

Grade 7 or 8.

#LI-On-Site #LI-DS2 


Similar jobs

  • Work in company

    DevOps / SRE Engineer

    Only for registered members

    A Senior DevOps / SRE Engineer is sought to lead CI/CD, DevSecOps, and Site Reliability initiatives for large-scale vehicle and product launches. · ...

    Dearborn, MI

    6 days ago

  • Work in company

    Senior Site Reliability Engineer

    Only for registered members

    Description · Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance customer experience and improve people's lives, this is the opportunity ...

    Dearborn, MI, United States $145,000 - $230,000 (USD) per year

    3 days ago

  • Work in company Remote job

    Cybersecurity Platform Engineering Manager

    Only for registered members

    Job Description · We are seeking a · Risk Engineering Manager of Others · to lead and scale a team of technically strong Risk Engineers focused on · engineering risk-reducing solutions · , not just defining processes or controls. This role is accountable for · people leadership, ...

    Dearborn, MI

    25 minutes ago

  • Work in company

    Back-end Software Development Engineering Senior Engineer

    Only for registered members

    The Back-end Software Development Senior Engineer is responsible for designing, developing and maintaining the back-end (server-side) components of enterprise applications. · Design develop test and deliver high-quality software solutions using modern tools languages frameworks a ...

    Dearborn, MI

    1 month ago

  • Work in company Remote job

    Senior Full Stack Software Engineer

    Only for registered members

    Senior Full Stack Software Engineer to lead the development of a greenfield telecom platform using Node / Angular / AWS. · ...

    Dearborn, MI

    1 week ago

  • Work in company

    DevOps Specialist

    Only for registered members

    This DevOps Senior Specialist will lead a team supporting cross-enterprise initiatives and large-scale vehicle and product launches. · ...

    Dearborn, MI

    6 days ago

  • Work in company

    DevOps Engineer

    Only for registered members

    We drive enterprise-wide value through data, automation and integrated engineering practices. In this role you will be a part of a team responsible for cross-enterprise initiatives and major product launches. · Implement and maintain next-generation Continuous Integration (CI) to ...

    Dearborn, MI

    6 days ago

  • Work in company

    Cloud DevOps Engineer

    Only for registered members

    +Job summary · We believe freedom of movement drives human progress. We also believe in providing you with the freedom to define and realize your dreams. · +Setting up, maintaining, and optimizing pipelines (Jenkins/GitHub Actions) · Deploy updates and fixes as required to ensure ...

    Dearborn, MI

    6 days ago

  • Work in company

    Architect – Supply Chain

    Only for registered members

    The Enterprise Architect – Supply Chain will collaborate with Product and Software Delivery Teams to define and implement the architecture for Ford's Supply Chain Management. · Define business and technical architecture solutions to modernize Ford's Supply Chain in alignment with ...

    Dearborn, MI

    1 month ago

  • Work in company

    Architect – Supply Chain

    Only for registered members

    The Enterprise Architect – Supply Chain will collaborate with Product and Software Delivery Teams to define and implement the architecture for Ford's Supply Chain Management. · ...

    Dearborn, MI, United States

    1 week ago

  • Work in company

    Customer Identity Platform Engineer

    Only for registered members

    We are the movers of the world and the makers of the future. · We get up every day, roll up our sleeves and build a better world -- together. · At Ford, we're all a part of something bigger than ourselves. ...

    Dearborn, MI

    1 month ago

  • Work in company

    Customer Identity Platform Engineer

    Only for registered members

    Description · We are the movers of the world and the makers of the future. We get up every day, roll up our sleeves and build a better world -- together. At Ford, we're all a part of something bigger than ourselves. Are you ready to change the way the world moves? · Enterprise Te ...

    Dearborn, MI, United States

    3 days ago