Principal Firmware Engineer – Server Manageability and Observability - US, CA, Santa Clara

Only for registered members US, CA, Santa Clara, United States

1 day ago

Default job background
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimi ...
Job description

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level.

Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.

What you'll be doing:

  • Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries.

  • As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products.

  • Align NVIDIA's roadmap with major customers' requirements through direct engagement.

  • Develop and drive adoption of new technologies and protocols.

  • Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies.

What we need to see:

  • Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces.

  • Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs).

  • Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals.

  • Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI).

  • Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts

  • Experience collaborating with platform security experts to define tradeoffs between security and ease of use.

  • Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution.

  • BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience).

  • 15+ years in the area of System architecture and design.

Ways to stand out from the crowd:

  • Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF.

  • Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA)

  • Knowledge of enterprise storage architectures and distributed parallel processing paradigms

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you

NVIDIA's invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until February 22, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.


Similar jobs

  • At NVIDIA, we pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. NVIDIA runs some of the most demanding AI, data, and platform workloads on the planet and none of it works without a reliable, high-scale observabi ...

    Santa Clara $224,000 - $356,500 (USD)

    1 day ago

  • At NVIDIA, we pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. NVIDIA runs some of the most demanding AI, data, and platform workloads on the planet and none of it works without a reliable, high-scale observabi ...

    Santa Clara, CA

    22 hours ago

  • Work in company

    Engineering Manager, Observability Platform

    Only for registered members

    We're hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform: · the system that carries every metric, log, trace, profile, and event our engineers rely on to understand and debug their services.NVIDIA runs some of the most ...

    Santa Clara $224,000 - $356,500 (USD)

    1 month ago

  • Work in company

    Engineering Manager, Observability Platform

    Only for registered members

    We re hiring an Engineering Manager to lead the team that builds and operates NVIDIA s global observability platform: the system that carries every metric log trace profile and event our engineers rely on to understand and debug their services. · ...

    Santa Clara $224,000 - $356,500 (USD) Full time

    1 month ago

  • Work in company

    Engineering Manager, Observability Platform

    Only for registered members

    At NVIDIA, we pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. NVIDIA runs some of the most demanding AI, data, and platform workloads on the planet and none of it works without a reliable, high-scale observabi ...

    US, CA, Santa Clara $224,000 - $356,500 (USD) per year

    1 day ago

  • Work in company

    Engineering Manager, Observability Platform

    Only for registered members

    We are hiring an Engineering Manager to lead the team that builds and operates NVIDIA's global observability platform.We pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. · ...

    Santa Clara, CA

    1 month ago

  • Work in company

    Technical Product Manager – Observability

    Only for registered members

    About Agoda · At Agoda, we bridge the world through travel. Our story began in 2005... · Own the full product lifecycle for observability platforms. · Define the vision for anomaly detection... · ...

    San Jose, CA

    5 days ago

  • NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimi ...

    Santa Clara $272,000 - $431,250 (USD)

    1 day ago

  • We're looking for a strong technical architect to own the end-to-end architecture of NVIDIA's data center systems. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.Serve as the primary techni ...

    Santa Clara $272,000 - $431,250 (USD)

    1 month ago

  • NVIDIA is seeking a strong technical architect to own the end-to-end architecture of its data center systems, including firmware, kernel drivers, operating systems, and user mode drivers. The ideal candidate will work with component leads internally and engage with industry-leadi ...

    Santa Clara $272,000 - $425,500 (USD)

    1 month ago

  • NVIDIA data center systems have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. · ...

    Santa Clara, CA

    6 days ago

  • Work in company

    Engineering Manager, Cloud Networking Observability

    Only for registered members

    We are seeking a seasoned, visionary Engineering Manager to lead and expand a diverse team of highly skilled engineers focused on advancing observability, monitoring, and telemetry capabilities for Apple's critical cloud networking infrastructure. · Work with key stakeholders to ...

    Sunnyvale $228,100 - $342,800 (USD)

    2 weeks ago

  • Work in company

    Engineering Manager, Cloud Networking Observability

    Only for registered members

    +Job summaryTheAppleCloudNetworkingteambuildsandoperatesoftware-definednetworkplatformsthatworkatscale todeliveramulti-cloudnetworkandsecuritywithaglobalfootprint. · Weareafast-movingteamatthe forefrontofcraftingApple'shybridcloudnetworkinfrastructurewhichfuelsApple'sservicesincl ...

    Sunnyvale $228,100 - $342,800 (USD)

    1 month ago

  • Work in company

    Engineering Manager, Cloud Networking Observability

    Only for registered members

    We are seeking a seasoned, visionary Engineering Manager to lead and expand a diverse team of highly skilled engineers focused on advancing observability, monitoring, and telemetry capabilities for Apple's critical cloud networking infrastructure. · Work with key stakeholders to ...

    Sunnyvale, CA

    1 month ago

  • Work in company

    Product Marketing Manager, Cloud Observability

    Only for registered members

    We are looking for a Product Marketing Manager to drive the acquisition of new enterprise customers and developers on Google Cloud. The successful candidate will own the narrative, positioning, and value proposition for our Cloud Observability portfolio. · ...

    Sunnyvale, CA

    2 days ago

  • Work in company

    Sr. Manager, Observability Platform Engineering

    Only for registered members

    Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast are relying on the Databricks Data Intelligence Platform to unify and democratize data. · The Manager of the Observability Platform team will lead engineers responsible ...

    Mountain View $222,000 - $300,000 (USD) Full time

    1 month ago

  • Work in company

    Sr. Manager, Observability Platform Engineering

    Only for registered members

    Job summaryAs the Manager of the Observability Platform team, you will lead the engineers responsible for building and scaling the next generation of Databricks' global observability systems. · ...

    Mountain View, California

    5 days ago

  • Work in company

    Sr. Manager, Observability Platform Engineering

    Only for registered members

    We are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. · Lead the design and development of the next-generation observability platforms th ...

    Mountain View, CA

    1 month ago

  • The Apple Cloud Networking team builds and operates software-defined network platforms that work at scale to deliver a multi-cloud network and security with a global footprint. · We are seeking a seasoned, visionary Engineering Manager to lead and expand a diverse team of highly ...

    Sunnyvale $228,100 - $342,800 (USD)

    1 month ago

  • The Apple Cloud Networking team builds and operates software-defined network platforms that work at scale to deliver a multi-cloud network and security with a global footprint. · We are seeking a seasoned, visionary Engineering Manager to lead and expand a diverse team of highly ...

    Sunnyvale, CA

    1 month ago