Jobs
>
Fountain Valley

    10471 - Manager, SRE - Fountain Valley, United States - Hyundai Autoever America

    Default job background
    Description

    Job Description

    Job Description

    10471 – Manager, SRE

    Purpose:

    The Site Reliability Engineering (SRE) Manager will be working with the development & operations team, focusing on ensuring that connected car systems are working as expected and the underlying infrastructure and network is running smoothly. This role is responsible for the day-to-day operations of the DevOps team and combines a mix of project management, team management, and engineering duties. The DevOps team are subject-matter experts within Telematics domain and provide insight and engineering advice to development and product teams, with a goal to create a highly reliable and scalable software system that can run with minimum failure

    Essential Functions:

    • Act as primary point-of-contact (PoC) on all connected card infrastructure operations and projects
    • Work collaboratively with software engineering to define infrastructure and deployment requirements; be a sounding board and provide recommendations for engineering team around infrastructure design and deployment.
    • They first set a goal to create a highly reliable and scalable software system that can run with minimum failure
    • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
    • Be the driving force behind our automation and observability initiatives. Build tools and automation that eliminate repetitive tasks and prevent incident occurrence.
    • Build and maintain operational tools for deployment, monitoring, and analysis of connected car infrastructure and systems
    • Perform infrastructure cost analysis and optimization
    • Provide project management, sprint planning, and road-mapping support to the DevOps team
    • Activities include designing, developing, installing, and maintaining software solutions.
    • Work with engineering teams to refine deployment and release processes.
    • Collaborate with the engineering team on projects as the expert on reliability, performance, and efficiency.
    • Manage on-call rotations across connected car applications, using a follow-the-sun model.
    • Participate in 24x7 operational support and on-call rotation shifts.
    • Ensure that all system design and procedures are documented and up to date.
    • Monitor and stress test systems to collect metrics for tuning and capacity planning.
    • Work to automate detection and resolution of recurring issues.
    • Ensure safety, predictability, repeatability, and auditability of all build and deploy processes.
    • Partner with development teams to improve services through rigorous testing and release procedures
    • Participate in system design consulting, platform management, and capacity planning
    • Create sustainable systems and services through automation and uplifts
    • Balance feature development speed and reliability with well-defined service level objectives

    Basic requirements:

    • Bachelor's or Master's degree or equivalent in the field of computers, information systems or related degree.
    • 2+ experience as a manager or PM or in a Technical Leadership capacity, preferably in automobile industry within the Telematics domain.
    • Programming experience with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
    • Proven track record of designing, building, optimizing, and maintaining infrastructure on a large scale.
    • Experience with distributed systems in a production operations environment
    • Expertise analyzing complex application, database, network, and OS issues across a distributed large scale customer facing system
    • Strong communication skills and ability to work effectively across multiple business and technical teams
    • Demonstrated ability to deliver results on time with high quality
    • Extensive experience leading customer facing systems in a high uptime 24/7 environment
    • A depth and breadth of experience with server-side Java development, Oracle and distributed databases
    • A well-developed understanding of the theory and principles of operation of the internet and packet data protocols.
    • Exposure to Cloud, SaaS, and virtualization concepts and performance concerns.
    • Working knowledge of operating system design, processes, and threading model.
    • Knowledge of defining and monitoring system quality measures, including SLO and SLA.
    • Built tooling to improve reliability of systems, automated remediation of issues, or improve scalability.
    • Experience with different flavors of Linux, i.e. RedHat, Ubuntu, CentOS, etc.
    • Hands-on experience collecting performance data, analyzing, troubleshooting, and tuning.
    • Experience with the operations of application with high concurrency, scalability, or availability requirements.
    • Experience leading high performing engineering teams.
    • Experience with containers and container orchestration tools (Docker, Kubernetes)
    • Experience with MySQL, Elasticsearch, Couchbase, Mongo and Redis

    Nice to have:

    • Experience with stream-processing open-source frameworks/systems, i.e. Kafka, Spark, etc.
    • Experience with distributed storage technologies like NFS, HDFS, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes, Yarn)

    Salary Range - $112,830 to $173,756

    Powered by JazzHR

    HBkbR0naPy



  • Hyundai AutoEver America Fountain Valley, United States

    Manager, SRE · Purpose: · The Site Reliability Engineering (SRE) Manager will be working with the development & operations team, focusing on ensuring that connected car systems are working as expected and the underlying infrastructure and network is running smoothly. This role ...

  • Avestacs

    Usa - Sre Director

    6 days ago


    Avestacs Los Angeles, United States

    **Job Title**: SRE Director · **Location**: Tempe, AZ or Los Angeles, CA · **Type**: Fulltime · We're on the lookout for visionary individuals to join our pioneering team, tasked with shaping the future of streaming products. Now is your chance to be part of creating and deliveri ...


  • Google Los Angeles, United States

    This role may also be located in our Playa Vista, CA campus. · **Minimum qualifications**: · - Bachelor's degree in Computer Science, Engineering, related technical field, or equivalent practical experience. · - 5 years of experience in a technical project management or a custome ...


  • Cox Automotive Westminster, United States

    This position is · hybrid · and can work from any of the following office locations: · Atlanta, GA; Burlington, VT; Irvine, CA; Mission, KS. · We are seeking a seasoned Site Reliability Engineering (SRE) Manager to lead our team of talented engineers. The SRE Manager will be ...


  • Anduril Costa Mesa, United States Full time

    · Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, And ...

  • First American

    DevOps Manager

    3 days ago


    First American Santa Ana, United States

    Who We AreCome join First American's Digital Title Group, newly formed to re-imagine and digitize the title search and examination process through Big Data, AI, document automation and modern, cloud-native application development. As a market leading title insurance company, powe ...


  • First American Santa Ana, United States

    Who We Are · Join a team that puts its People First Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and emp ...


  • First American Santa Ana, United States

    Who We Are · Join a team that puts its People First Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and emp ...


  • Kinetic Personnel Group Santa Ana, United States

    Job Description · Job DescriptionKinetic Personnel Group is recruiting for a Systems Administrator for a $5-billion/year Health Agency in the Santa Ana, California area. This health agency is renowned for the work it does in the community and being a great place to work. · System ...

  • O. C. Credit Union

    DevOps Engineer

    1 week ago


    O. C. Credit Union Santa Ana, United States

    Exciting Opportunity at Orange County's Credit Union · CREDIT UNION'S PURPOSE: Simple Banking. For People, Not Profit. · CREDIT UNION'S CORE VALUES: Integrity, Service Excellence, Growth & Development, High Performance, Mutual Respect, Community, and Fun · Workplace Excellence. ...

  • Orange County's Credit Union

    DevOps Engineer

    3 days ago


    Orange County's Credit Union Santa Ana, United States

    Job Description · Job DescriptionExciting Opportunity at Orange County's Credit UnionCREDIT UNION'S PURPOSE: Simple Banking. For People, Not Profit. · CREDIT UNION'S CORE VALUES: Integrity, Service Excellence, Growth & Development, High Performance, Mutual Respect, Community, and ...


  • Chipotle Mexican Grill Newport Beach, United States

    Sr Manager, Quality Engineering (Reliability Engineering · Description · CULTIVATE A BETTER WORLD · Food served fast does not have to be a typical fast-food experience. Chipotle has always done things differently, both in and out of our restaurants. We are changing the face of f ...


  • Scaleway Newport Beach, United States

    Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...


  • Scaleway Newport Beach, United States

    Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...


  • Scaleway Newport Beach, United States

    Fonde en 1999, Scaleway est la filiale cloud du groupe Iliad, lun des leaders des tlcommunications en Europe. Notre mission est de favoriser une industrie numrique plus responsable en aidant les dveloppeurs et les entreprises crer, dployer et adapter des applications n'importe qu ...


  • Broadcom Corporation Newport Beach, United States

    Please Note: · 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account) · 2. If you already have a Candidate Account, please Sign-In before you apply. · Job Description: The Elevator Pitch: Why woul ...


  • Scaleway Newport Beach, United States

    Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...


  • Scaleway Newport Beach, CA, United States

    Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...


  • Broadcom Corporation Newport Beach, United States

    Please Note: · 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account) · 2. If you already have a Candidate Account, please Sign-In before you apply. · Job Description: · The Elevator Pitch: Why woul ...


  • Scaleway Newport Beach, CA, United States

    Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...