Senior Software Development Engineer – Distributed Inference - Santa Clara, CA

Only for registered members Santa Clara, CA, United States

3 weeks ago

Default job background
WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Senior Software Development Engineer - LLM Kernel

    Only for registered members

    We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...

    Santa Clara

    1 month ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    About · NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intell ...

    Santa Clara $272,000 - $431,250 (USD)

    6 days ago

  • Work in company

    Solutions Architect, Inference Deployments

    Only for registered members

    We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect focused on inference, you'll collaborate closely with our engineering, DevOps, and customers to develop enter ...

    Santa Clara $152,000 - $241,500 (USD)

    2 days ago

  • Work in company

    Senior Software Development Engineer – LLM Inference Framework

    Only for registered members

    We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...

    Santa Clara

    1 month ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA Dynamo is an innovative platform for efficient inference of large language and reasoning models in distributed GPU environments. · We're searching for engineers to build the next generation of scalable AI systems as a Principal Software Engineer on the Dynamo project. · Ad ...

    Santa Clara $272,000 - $431,250 (USD) Full time

    1 month ago

  • Work in company

    Solutions Architect, Inference Deployments

    Only for registered members

    We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect focused on inference, you'll collaborate closely with our engineering, DevOps, and customers to develop enter ...

    Santa Clara $152,000 - $241,500 (USD) Full time

    3 days ago

  • Work in company

    Staff Software Engineer

    Only for registered members

    We are looking for a versatile Machine Learning Infrastructure Engineer to join XPENG's Fuyao AI Platform team — a core AI infrastructure powering autonomous driving, · robotics, · and intelligent cockpit teams with large-scale data processing, · model training, · and inference a ...

    Santa Clara $179,400 - $303,600 (USD)

    1 month ago

  • Work in company

    Senior System Software Engineer – Dynamo Tools

    Only for registered members

    We are seeking a Senior System Software Engineer to own and advance the AI-Perf analysis, NVIDIA's flagship framework for benchmarking, experimentation, and analysis of LLMs, · Generative AI, · and deep learning inference workloads.In this role, · You'll combinesystems research, ...

    Santa Clara $224,000 - $356,500 (USD)

    1 month ago

  • Work in company

    Senior System Software Engineer – Dynamo Tools

    Only for registered members

    +We are seeking a Senior System Software Engineer to own and advance the AI-Perf analysis, NVIDIA's flagship framework for benchmarking, experimentation, and analysis of LLMs, Generative AI, and deep learning inference workloads. · +Lead the design, development, and roadmap of AI ...

    Santa Clara $184,000 - $356,500 (USD) Full time

    1 month ago

  • Work in company

    AI Cluster Validation Engineer

    Only for registered members

    We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. · The RoleAMD is looking for an AI Cluster Validation Engineer who is passionate ab ...

    Santa Clara

    1 month ago

  • Work in company

    Staff Software Engineer

    Only for registered members

    We are looking for a versatile Machine Learning Infrastructure Engineer to join XPeng's Fuyao AI Platform team — a core AI infrastructure powering autonomous driving, robotics, and intelligent cockpit applications. · Design and optimize large-scale data processing, production and ...

    Santa Clara $179,400 - $303,600 (USD)

    1 month ago

  • Work in company

    AI Cluster Test Automation Engineer

    Only for registered members

    WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

    Santa Clara $104,000 - $156,000 (USD)

    4 hours ago

  • Work in company

    Senior System Software Engineer

    Only for registered members

    We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server . We are a fast-paced team building a highly-performant AI inference platform to make design and deployment of new AI models easier and accessible to all users. · ...

    Santa Clara $152,000 - $287,500 (USD)

    1 week ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVID ...

    Santa Clara $272,000 - $431,250 (USD)

    1 week ago

  • Work in company

    Senior Solutions Architect, Robotics Infrastructure

    Only for registered members

    We're building a group of innovators to assist enterprises in deploying and accelerating NVIDIA's three computer workloads for Physical AI. · We are seeking a hands-on Solutions Architect with deep expertise in backend infrastructure, inference and cloud-native applications to de ...

    Santa Clara $152,000 - $287,500 (USD)

    1 month ago

  • Work in company

    Senior Solutions Architect, Robotics Infrastructure

    Only for registered members

    ++We're building a group of innovators to assist enterprises in deploying and accelerating NVIDIA's three computer workloads for Physical AI. · ++Support customers in building scalable and observable GPU-accelerated pipelines for key robotics workloads using Kubernetes, cloud-nat ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    1 month ago

  • Work in company

    Senior System Software Engineer

    Only for registered members

    We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server. NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. · Develop world-class GPU-accelerated AI inference serving software. · Contribute to feature de ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    1 week ago

  • Work in company

    Senior Software Development Engineer - LLM Kernel

    Only for registered members

    As a Senior Member of Technical Staff, you will be a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. You will play a critical role in advancing high-performance LLM serving by optimizing GPU kernels, inference runtimes, and distribut ...

    Santa Clara, CA

    1 month ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology—and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of ...

    Santa Clara $248,000 - $391,000 (USD)

    6 days ago

  • Work in company

    Software Development Engineer - Kernel Development

    Only for registered members

    We are seeking a skilled Software Development Engineer to join our team as we push the limits of innovation to solve real-world challenges.We're looking for someone with strong technical expertise in Python development within Linux environments. · ...

    Santa Clara

    1 month ago