Senior Software Architect, Observability Platform - Santa Clara, CA

Only for registered members Santa Clara, CA, United States

4 weeks ago

Default job background
+

Job summary

NVIDIA's Infrastructure organization is seeking a Senior Software Architect for our Observability Platform to architect and implement distributed observability systems for data centers enabling EDA workflows .
+

Responsibilities

+
  • Collaborate with HW and SW engineering teams to deliver observability solutions that meet their needs in EDA clusters.
  • Develop test and deploy data collectors pipelines visualization and retrieval services.
  • Define data collection and retention policies to balance network bandwidth system load and storage capacity costs with data analysis requirements.
  • Work in a diverse team to provide operational and strategic data to empower our engineers and researchers to improve performance productivity and efficiency.

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Santa Clara $224,000 - $356,500 (USD)

    We're hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform: · the system that carries every metric, log, trace, profile, and event our engineers rely on to understand and debug their services.NVIDIA runs some of the most ...

  • Only for registered members Santa Clara Full time $224,000 - $356,500 (USD)

    We re hiring an Engineering Manager to lead the team that builds and operates NVIDIA s global observability platform: the system that carries every metric log trace profile and event our engineers rely on to understand and debug their services. · ...

  • Only for registered members US, CA, Santa Clara

    Job summary · NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, · High-Performance Computing, · and Visualization. ...

  • Only for registered members Santa Clara, CA

    We are hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform.We pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. · ...

  • Only for registered members Mountain View, California

    Job summaryAs the Manager of the Observability Platform team, you will lead the engineers responsible for building and scaling the next generation of Databricks' global observability systems. · ...

  • Only for registered members Mountain View Full time $222,000 - $300,000 (USD)

    Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast are relying on the Databricks Data Intelligence Platform to unify and democratize data. · The Manager of the Observability Platform team will lead engineers responsible ...

  • Only for registered members Mountain View, CA

    We are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. · Lead the design and development of the next-generation observability platforms th ...

  • Only for registered members Milpitas, CA

    We are the Catalyst Center Platforms and Capabilities team responsible for delivering scalable secure and high-productivity cloud-native infrastructure and PaaS capabilities that power thousands of enterprise customers. · 8+ years of full-stack development experience with focus o ...

  • Only for registered members Santa Clara $248,000 - $391,000 (USD)

    +Job summary · NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability platforms at NVIDIA. · ...

  • Only for registered members Santa Clara $184,000 - $356,500 (USD)

    NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation multi-region observability platform. · This platform powers NVIDIA's rapidly expanding AI Data and Observability ecosystem operating at an immense scale. · Architecting end-to- ...

  • Only for registered members Santa Clara Full time $248,000 - $391,000 (USD)

    We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability platforms at NVIDIA. · The platforms are used by internal teams to monitor, diagnose, and optimize the products, millions of assets and services in cloud, on-prem data cent ...

  • Senior Engineer

    1 month ago

    Only for registered members Santa Clara $224,000 - $356,500 (USD)

    +NVIDIA is a pioneer in accelerated computing.We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation observability platform for large-scale AI workloads, · ++Design and implement full-stack observability systems covering metrics, logs, ...

  • Only for registered members Santa Clara $184,000 - $356,500 (USD)

    NVIDIA is hiring Senior Site Reliability Engineers who will help design and run the company's global telemetry backbone. · ...

  • Only for registered members Santa Clara $184,000 - $356,500 (USD)

    We're hiring Site Reliability Engineers who want to work on the systems that power everything from large-scale data pipelines to model training clusters to real-time decision making. · Architecting and operating large-scale observability systems that span global regions and suppo ...

  • Only for registered members Santa Clara $208,000 - $327,750 (USD)

    This product manager will lead the development of foundational tools dedicated to ensuring the resiliency and observability of large-scale accelerated computing platforms. · We have some of the most forward-thinking and hardworking people in the world working for us and, due to o ...

  • Only for registered members Santa Clara Full time $184,000 - $356,500 (USD)

    We're hiring Site Reliability Engineers who want to work on the systems that power everything from large-scale data pipelines to model training clusters to real-time decision making. · Architecting and operating large-scale observability systems that span global regions and suppo ...

  • Only for registered members Santa Clara Full time $208,000 - $327,750 (USD)

    NVIDIA has become the platform upon which every new AI-powered application is built. From healthcare research applications to autonomous vehicles or voice-recognition systems there is a need to simplify and deliver predictability for AI applications and workflows and NVIDIA is ri ...

  • Only for registered members Santa Clara

    This Position is open to these office locations: Santa Clara CA; Kirkland WA; San Diego CA; Orlando FL; Chicago ILThe Director of AI Data Center Control Plane leads the engineering implementation of AI-first solutions across Big Data Observability and other data center control pl ...

  • Only for registered members Santa Clara $272,000 - $431,250 (USD)

    We serve and collaborate directly with NVIDIA's rapidly growing AI, HW, and SW engineering and research teams across the company. · ...

  • Only for registered members Santa Clara, CA

    NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform. · ...