-
Liebherr Group Mountain View, United StatesSafety & Reliability Engineer für den Bereich Elektroniksysteme (m/w/d) Lindenberg | Job ID 70278 · Organization · Liebherr-Aerospace Lindenberg GmbH · Country · Deutschland · Entry level · Berufserfahrene · Faszinierendes schaffen: Ihre Aufgaben · Entwicklung, Nachweisführu ...
-
Packaging Reliability Engineer
1 week ago
Yoh - A Day & Zimmerman Company Mountain View, United StatesPackaging Reliability Engineer · As a Packaging Reliability Engineer, you will be responsible for qualifying packaging for consumer electronic products. The company creates iconic packaging that meets a high bar for reliability and demonstrates care for the people who use them an ...
-
Packaging Reliability Engineer
6 days ago
Yoh, A Day & Zimmermann Company Mountain View, United StatesJob Description · Job Description · Packaging Reliability Engineer · As a Packaging Reliability Engineer, you will be responsible for qualifying packaging for consumer electronic products. The company creates iconic packaging that meets a high bar for reliability and demonstrat ...
-
Staff Reliability Engineer
2 weeks ago
Cavnue Mountain View, United StatesWe believe that the future of transportation is automated. Automated travel will be safer, more comfortable, more efficient and a powerful economic enabler for our communities. However, automating driving is a massively complex engineering challenge, requiring vehicles to navigat ...
-
Site Reliability Engineer
4 weeks ago
Wayve Mountain View, United StatesJob Overview · At Wayve, we're not just another autonomous vehicle company. We stand out with our revolutionary approach to self-driving technology, embracing the power of embodied AI to redefine the boundaries of what's possible. While others depend on static maps and rigid rule ...
-
Site Reliability Engineering
1 week ago
NewsBreak Mountain View, United StatesAbout NewsBreak · NewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust collabor ...
-
Packaging Reliability Engineer
1 week ago
Yoh Mountain View, United StatesPackaging Reliability Engineer · As a Packaging Reliability Engineer, you will be responsible for qualifying packaging for consumer electronic products. The company creates iconic packaging that meets a high bar for reliability and demonstrates care for the people who use them an ...
-
Staff Reliability Engineer
2 weeks ago
Cavnue Mountain View, United StatesWe believe that the future of transportation is automated. · Automated travel will be safer, more comfortable, more efficient and a powerful economic enabler for our communities. However, automating driving is a massively complex engineering challenge, requiring vehicles to navi ...
-
Site Reliability Engineer
5 days ago
Intershop Communications AG Mountain View, United States(Senior) Site Reliability Engineer (m/f/d) · Homeoffice · Jena · Senior · We are Intershop - We're built to boost your business · As an e-commerce pioneer, we have been setting standards in the development of software for digital commerce for almost 30 years. With our cloud of ...
-
Site Reliability Engineer
3 weeks ago
Wayve Mountain View, United StatesAt Wayve, we're not just another autonomous vehicle company. We stand out with our revolutionary approach to self-driving technology, embracing the power of embodied AI to redefine the boundaries of what's possible. While others depend on static maps and rigid rules, we believe i ...
-
Site Reliability Engineering
3 days ago
NewsBreak Mountain View, United StatesAbout NewsBreak · NewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust collabor ...
-
Site Reliability Engineering
3 weeks ago
Newsbreakdigest Mountain View, United StatesResponsibilities · Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement. · Build and manage systems, infrastructure and applications through automation · Support services before they go live through activ ...
-
Site Reliability Engineering
3 weeks ago
News Break Mountain View, United StatesAbout NewsBreakNewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust collaboratio ...
-
Site Reliability Engineer
1 week ago
Tik Tok Mountain View, United StatesResponsibilities · About TikTok · TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. · W ...
-
Site Reliability Engineer
3 weeks ago
TikTok Mountain View, CA, United StatesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Mumbai, Singapore, Jakarta, Seoul and Tokyo. Our Trust and Safety engineerin ...
-
Site Reliability Engineering
3 weeks ago
NewsBreak Mountain View, United StatesAbout NewsBreak · NewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust collabor ...
-
Site Reliability Engineer
3 weeks ago
Motion Recruitment Partners LLC Mountain View, United StatesThis company is a well-known American image sharing and social media service designed to enable saving and discovery of information (specifically "ideas") on the internet using images and, on a smaller scale, animated GIFs and videos. As a Devops Engineer, You will manage access ...
-
Site Reliability Engineer
2 weeks ago
Amiseq Inc. Sunnyvale, United StatesSite Reliability Engineer · Not sure what skills you will need for this opportunity Simply read the full description below to get a complete picture of candidate requirements. · Sunnyvale, CA - Hybrid · 6-12 Months W2 Contract · Job Description: · Hands on development on bui ...
-
Site Reliability Engineer
2 weeks ago
AMISEQ Sunnyvale, United StatesSite Reliability Engineer · Sunnyvale, CA - Hybrid · 6-12 Months W2 Contract · Job Description: · Hands on development on building n-tier applications using RESTful Services, Java/J2EE, JavaScript, Python, NoSql. · • Working knowledge of one or more cloud technologies such as AZ ...
-
Site Reliability Engineer
1 week ago
Motion Recruitment Mountain View, United States CONTRACTThis company is a well-known American image sharing and social media service designed to enable saving and discovery of information (specifically "ideas") on the internet using images and, on a smaller scale, animated GIFs and videos. As a Devops Engineer, You will manage access ...
Staff Site Reliability Engineer - Mountain View, United States - Synopsys
Description
We are seeking a talented and experienced professional to join our team as Staff SRE Engineer.The successful candidate will have the responsibility of designing, implementing, and maintaining the observability platform that monitors the health of our production systems on-prem and in the Cloud.
The candidate should have an exceptional background in software development, system administration, and monitoring tools, as well as a passion for building scalable and reliable systems.
Key Responsibilities
Design and implement the SRE & Observability platform to monitor the health of our production systems providing a holistic view of the environment.
Ensure that the SRE & Observability platform is scalable, reliable, and can handle large volumes of data.
Design and implement SRE best practices for the team and identify KPIs for various systems, organizations, and stakeholders.
Partner with multiple teams to identify data points needed to define SLA, SLI, SLO, error budgets and KPIs.
Automate the deployment and configuration of monitoring tools to reduce human error and increase efficiency.
Develop custom scripts and tools to extend the functionality of the monitoring platform, including, but not limited to Proactive remediation and Self-Healing.
Perform root cause analysis on incidents, prepare detailed reports to present to the stakeholders, and develop solutions to prevent similar incidents from occurring in the future.
Provide guidance and mentorship to junior members of the team.Drive the design and implementation of major SRE initiatives.
Act as a SME on SRE & Observability, providing guidance to other teams across the organization.
Continuously evaluate and implement new tools and technologies to improve the SRE platform.
Qualifications
Have a proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
Deep knowledge of Linux OS, Networking and NFS technologies.
Experience with data stores and search engines such as Elasticsearch is a must. Other technologies like Prometheus, Grafana, and similar technologies is a plus.
Solid Python programming skills and experience.
Expertise in cloud computing platforms such as Azure, AWS, or GCP.
Experience with EDA workloads and environments including technologies such as Grid engine, LSF, SLURM, Linux, networking and NFS is required.
Familiarity with containerization technologies such as Docker, Swarm and Kubernetes.Excellent problem-solving skills and attention to detail.
Ability to work collaboratively with other teams and stakeholders.
Proven communication skills, both verbal and written.
Ability to work in a fast-paced and dynamic environment.
Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
10+ years of experience in software development, system administration and monitoring tools.
The base salary range across the U.S. for this role is between $117,000-$204,000. In addition, this role may be eligible for an annual bonus, equity, and other discretionary bonuses. Synopsys offers comprehensive health, wellness, and financial benefits as part of a of a competitive total rewards package. The actual compensation offered will be based on a number of job-related factors, including location, skills, experience, and education. Your recruiter can share more specific details on the total rewards package upon request.
#J-18808-Ljbffr