AI Operations Observability Engineer - Milpitas, California, US
5 days ago

Job summary
The DevOps team within Cisco's newly formed AI Software and Platform group designs and operates the infrastructure that powers Cisco's exciting array of AI powered offerings.
+Responsibilities
- Deliver 100% coverage of distributed tracing in partnership with software development teams.
- Own libraries process documents design patterns enabling rapid adoption of OpenTelemetry.
Qualifications
- Bachelors +7 years working in a platform services or DevOps role with specific ownership of observability.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
We are seeking a forward-thinking Observability and IaC Engineer to lead the automation and instrumentation of our next-generation cloud-native platforms. · You will not just manage tools but architect an integrated “Observability-as-Code” ecosystem that supports distributed trac ...
2 weeks ago
We are seeking Software Engineers to join our efforts in building, maintaining and optimizing highly scalable reliable and secure systems. · Experience in Software Engineering Site Reliability Engineering DevOps or a related field. · Proficiency in at least one programming or scr ...
4 weeks ago
We are a dynamic team with members from diverse backgrounds designing and implementing distributed backend services at Uber. Join us to design system architecture and deliver a centralized logging system for Uber. · Design system architecture own key components to deliver a centr ...
1 week ago
We are looking for a Software Engineer to join our Observability team at Uber. · The Logging team ingests over 35 million log events per second in near real time and handles over 18 petabytes of data available for search. · We dream big, aim high, and execute with precision. ...
1 month ago
We're hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform: · the system that carries every metric, log, trace, profile, and event our engineers rely on to understand and debug their services.NVIDIA runs some of the most ...
1 month ago
We re hiring an Engineering Manager to lead the team that builds and operates NVIDIA s global observability platform: the system that carries every metric log trace profile and event our engineers rely on to understand and debug their services. · ...
1 month ago
NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform. · ...
3 days ago
NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation multi-region observability platform. · This platform powers NVIDIA's rapidly expanding AI Data and Observability ecosystem operating at an immense scale. · Architecting end-to- ...
1 month ago
Job summary · NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, · High-Performance Computing, · and Visualization. ...
2 weeks ago
NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform. · ...
1 month ago
+p>Job summary · We are seeking a forward-thinking Observability and IaC Engineer to lead the automation and instrumentation of our next-generation cloud-native platforms. · +li>Design and implement robust pipelines that collect and aggregate telemetry data (logs, metrics, events ...
2 weeks ago
We are hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform.We pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. · ...
1 month ago
+NVIDIA has been reinventing computer graphics PC gaming and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. · + · +Lead the design development and deployment of AIOps & Observability platforms includin ...
1 month ago
Cerebras Systems is building the world's largest AI chip. We are seeking a talented Platform Software Engineer to join our team. · Set Technical Direction: Own the long-term architecture and roadmap for the observability platform across all pillars (metrics, logs, traces, and eve ...
1 week ago
+We are seeking senior engineers who specialize in all pillars of Observability to play a pivotal role in CoreWeave's ability to bring the best performing systems to the market. · +Design build and own core observability infrastructure including highly scalable reliable logging m ...
1 month ago
We're hiring Site Reliability Engineers who want to work on the systems that power everything from large-scale data pipelines to model training clusters to real-time decision making. · Architecting and operating large-scale observability systems that span global regions and suppo ...
6 days ago
We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability platforms at NVIDIA. · The platforms are used by internal teams to monitor, diagnose, and optimize the products, millions of assets and services in cloud, on-prem data cent ...
1 month ago
We are seeking senior engineers who specialize in all pillars of Observability to play a pivotal role in CoreWeave's ability to bring the best performing systems to the market. · Six or more years of experience in software or infrastructure engineering. · Proficiency in Go (our p ...
1 month ago
NVIDIA is hiring Senior Site Reliability Engineers who will help design and run the company's global telemetry backbone. · ...
1 month ago
We're hiring Site Reliability Engineers who want to work on the systems that power everything from large-scale data pipelines to model training clusters to real-time decision making. · We're looking for someone who can design and run NVIDIA's global telemetry backbone, the platfo ...
4 weeks ago