- Act as primary point-of-contact (PoC) on all connected card infrastructure operations and projects
- Work collaboratively with software engineering to define infrastructure and deployment requirements; be a sounding board and provide recommendations for engineering team around infrastructure design and deployment.
- They first set a goal to create a highly reliable and scalable software system that can run with minimum failure
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Be the driving force behind our automation and observability initiatives. Build tools and automation that eliminate repetitive tasks and prevent incident occurrence.
- Build and maintain operational tools for deployment, monitoring, and analysis of connected car infrastructure and systems
- Perform infrastructure cost analysis and optimization
- Provide project management, sprint planning, and road-mapping support to the DevOps team
- Activities include designing, developing, installing, and maintaining software solutions.
- Work with engineering teams to refine deployment and release processes.
- Collaborate with the engineering team on projects as the expert on reliability, performance, and efficiency.
- Manage on-call rotations across connected car applications, using a follow-the-sun model.
- Participate in 24x7 operational support and on-call rotation shifts.
- Ensure that all system design and procedures are documented and up to date.
- Monitor and stress test systems to collect metrics for tuning and capacity planning.
- Work to automate detection and resolution of recurring issues.
- Ensure safety, predictability, repeatability, and auditability of all build and deploy processes.
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through automation and uplifts
- Balance feature development speed and reliability with well-defined service level objectives
- Bachelor's or Master's degree or equivalent in the field of computers, information systems or related degree.
- 2+ experience as a manager or PM or in a Technical Leadership capacity, preferably in automobile industry within the Telematics domain.
- Programming experience with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
- Proven track record of designing, building, optimizing, and maintaining infrastructure on a large scale.
- Experience with distributed systems in a production operations environment
- Expertise analyzing complex application, database, network, and OS issues across a distributed large scale customer facing system
- Strong communication skills and ability to work effectively across multiple business and technical teams
- Demonstrated ability to deliver results on time with high quality
- Extensive experience leading customer facing systems in a high uptime 24/7 environment
- A depth and breadth of experience with server-side Java development, Oracle and distributed databases
- A well-developed understanding of the theory and principles of operation of the internet and packet data protocols.
- Exposure to Cloud, SaaS, and virtualization concepts and performance concerns.
- Working knowledge of operating system design, processes, and threading model.
- Knowledge of defining and monitoring system quality measures, including SLO and SLA.
- Built tooling to improve reliability of systems, automated remediation of issues, or improve scalability.
- Experience with different flavors of Linux, i.e. RedHat, Ubuntu, CentOS, etc.
- Hands-on experience collecting performance data, analyzing, troubleshooting, and tuning.
- Experience with the operations of application with high concurrency, scalability, or availability requirements.
- Experience leading high performing engineering teams.
- Experience with containers and container orchestration tools (Docker, Kubernetes)
- Experience with MySQL, Elasticsearch, Couchbase, Mongo and Redis
- Experience with stream-processing open-source frameworks/systems, i.e. Kafka, Spark, etc.
- Experience with distributed storage technologies like NFS, HDFS, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes, Yarn)
-
10471 - Manager, SRE
1 week ago
Hyundai AutoEver America Fountain Valley, United StatesManager, SRE · Purpose: · The Site Reliability Engineering (SRE) Manager will be working with the development & operations team, focusing on ensuring that connected car systems are working as expected and the underlying infrastructure and network is running smoothly. This role ...
-
Usa - Sre Director
6 days ago
Avestacs Los Angeles, United States**Job Title**: SRE Director · **Location**: Tempe, AZ or Los Angeles, CA · **Type**: Fulltime · We're on the lookout for visionary individuals to join our pioneering team, tasked with shaping the future of streaming products. Now is your chance to be part of creating and deliveri ...
-
Technical Account Manager, Cloud
1 day ago
Google Los Angeles, United StatesThis role may also be located in our Playa Vista, CA campus. · **Minimum qualifications**: · - Bachelor's degree in Computer Science, Engineering, related technical field, or equivalent practical experience. · - 5 years of experience in a technical project management or a custome ...
-
Manager, Site Reliability Engineering
1 week ago
Cox Automotive Westminster, United StatesThis position is · hybrid · and can work from any of the following office locations: · Atlanta, GA; Burlington, VT; Irvine, CA; Mission, KS. · We are seeking a seasoned Site Reliability Engineering (SRE) Manager to lead our team of talented engineers. The SRE Manager will be ...
-
Site Reliability Engineer, C2 Systems
1 week ago
Anduril Costa Mesa, United States Full time· Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, And ...
-
DevOps Manager
3 days ago
First American Santa Ana, United StatesWho We AreCome join First American's Digital Title Group, newly formed to re-imagine and digitize the title search and examination process through Big Data, AI, document automation and modern, cloud-native application development. As a market leading title insurance company, powe ...
-
Lead DevOps Engineer
1 week ago
First American Santa Ana, United StatesWho We Are · Join a team that puts its People First Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and emp ...
-
Principal DevOps Engineer
1 week ago
First American Santa Ana, United StatesWho We Are · Join a team that puts its People First Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and emp ...
-
Systems Administrator
3 days ago
Kinetic Personnel Group Santa Ana, United StatesJob Description · Job DescriptionKinetic Personnel Group is recruiting for a Systems Administrator for a $5-billion/year Health Agency in the Santa Ana, California area. This health agency is renowned for the work it does in the community and being a great place to work. · System ...
-
DevOps Engineer
1 week ago
O. C. Credit Union Santa Ana, United StatesExciting Opportunity at Orange County's Credit Union · CREDIT UNION'S PURPOSE: Simple Banking. For People, Not Profit. · CREDIT UNION'S CORE VALUES: Integrity, Service Excellence, Growth & Development, High Performance, Mutual Respect, Community, and Fun · Workplace Excellence. ...
-
DevOps Engineer
3 days ago
Orange County's Credit Union Santa Ana, United StatesJob Description · Job DescriptionExciting Opportunity at Orange County's Credit UnionCREDIT UNION'S PURPOSE: Simple Banking. For People, Not Profit. · CREDIT UNION'S CORE VALUES: Integrity, Service Excellence, Growth & Development, High Performance, Mutual Respect, Community, and ...
-
Sr Manager, Quality Engineering
5 days ago
Chipotle Mexican Grill Newport Beach, United StatesSr Manager, Quality Engineering (Reliability Engineering · Description · CULTIVATE A BETTER WORLD · Food served fast does not have to be a typical fast-food experience. Chipotle has always done things differently, both in and out of our restaurants. We are changing the face of f ...
-
Senior Product Manager Dedibox
1 week ago
Scaleway Newport Beach, United StatesFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...
-
Product Manager Instances
2 days ago
Scaleway Newport Beach, United StatesFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...
-
Product Manager Instances
4 days ago
Scaleway Newport Beach, United StatesFonde en 1999, Scaleway est la filiale cloud du groupe Iliad, lun des leaders des tlcommunications en Europe. Notre mission est de favoriser une industrie numrique plus responsable en aidant les dveloppeurs et les entreprises crer, dployer et adapter des applications n'importe qu ...
-
Software development engineer in Test
4 days ago
Broadcom Corporation Newport Beach, United StatesPlease Note: · 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account) · 2. If you already have a Candidate Account, please Sign-In before you apply. · Job Description: The Elevator Pitch: Why woul ...
-
Software Engineer PHP/Symfony
2 weeks ago
Scaleway Newport Beach, United StatesFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...
-
Senior Product Manager Dedibox
2 weeks ago
Scaleway Newport Beach, CA, United StatesFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...
-
Software development engineer in Test
2 weeks ago
Broadcom Corporation Newport Beach, United StatesPlease Note: · 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account) · 2. If you already have a Candidate Account, please Sign-In before you apply. · Job Description: · The Elevator Pitch: Why woul ...
-
Site Reliability Engineer
2 weeks ago
Scaleway Newport Beach, CA, United StatesFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à ...
10471 - Manager, SRE - Fountain Valley, United States - Hyundai Autoever America
Description
Job Description
Job Description10471 – Manager, SRE
Purpose:
The Site Reliability Engineering (SRE) Manager will be working with the development & operations team, focusing on ensuring that connected car systems are working as expected and the underlying infrastructure and network is running smoothly. This role is responsible for the day-to-day operations of the DevOps team and combines a mix of project management, team management, and engineering duties. The DevOps team are subject-matter experts within Telematics domain and provide insight and engineering advice to development and product teams, with a goal to create a highly reliable and scalable software system that can run with minimum failure
Essential Functions:
Basic requirements:
Nice to have:
Salary Range - $112,830 to $173,756
Powered by JazzHR
HBkbR0naPy