Senior Staff Software Development Engineer- GPU, LLM, AI - Santa Clara, CA

Only for registered members Santa Clara, CA, United States

1 month ago

Default job background
WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Senior Software Development Engineer - LLM Kernel

    Only for registered members

    We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...

    Santa Clara

    1 month ago

  • We are now looking for a Senior Deep Learning Software Engineer, LLM Performance NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference NVIDIA is rapidly growing our research and development for Deep Learn ...

    Santa Clara $184,000 - $356,500 (USD)

    5 days ago

  • We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you'll collaborate closely with our engineering, DevOps, and customer success teams to fos ...

    Santa Clara $152,000 - $218,500 (USD)

    6 days ago

  • NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards, routes requests, and manages ...

    Santa Clara $272,000 - $431,250 (USD)

    5 days ago

  • Work in company

    Senior Deep Learning Software Engineer, LLM Performance

    Only for registered members

    We are now looking for a Senior Deep Learning Software Engineer, LLM Performance NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference NVIDIA is rapidly growing our research and development for Deep Learn ...

    US, CA, Santa Clara $184,000 - $287,500 (USD) per year

    6 days ago

  • Work in company

    Senior Deep Learning Software Engineer, LLM Performance

    Only for registered members

    We are now looking for a Senior Deep Learning Software Engineer to join our team. Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT LLM, VLLM, SGLang and LLM benchmarks. · We specialize in developing GPU-accelerated Dee ...

    Santa Clara, CA

    4 weeks ago

  • Work in company

    Senior Software Development Engineer - LLM Kernel

    Only for registered members

    As a Senior Member of Technical Staff, you will be a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. You will play a critical role in advancing high-performance LLM serving by optimizing GPU kernels, inference runtimes, and distribut ...

    Santa Clara, CA

    3 weeks ago

  • Work in company

    Senior Software Engineer, TensorRT-LLM

    Only for registered members

    We are now looking for a TensorRT-LLM Software Development Engineer · NVIDIA is hiring software engineers for its TensorRT-LLM team.Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, · enabling breakthroughs in areas ...

    Santa Clara $152,000 - $287,500 (USD)

    2 weeks ago

  • Work in company

    Senior Software Development Engineer – LLM Inference Framework

    Only for registered members

    We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...

    Santa Clara

    1 month ago

  • Work in company

    Senior Software Development Engineer, TensorRT-LLM

    Only for registered members

    We are now looking for a TensorRT-LLM Software Development Engineer to join our team. The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must. · ...

    Santa Clara $148,000 - $287,500 (USD)

    1 month ago

  • Work in company

    Senior Deep Learning Architect, LLM Inference

    Only for registered members

    We are now looking for a Senior Deep Learning Architect for LLM Inference NVIDIA is at the forefront of the generative AI revolution Our Inference Benchmarking IB team is specifically focused on advanced inference server performance for Large Language Models LLMs If you re passio ...

    Santa Clara $224,000 - $356,500 (USD)

    1 month ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    About · NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intell ...

    Santa Clara $272,000 - $431,250 (USD)

    1 day ago

  • Work in company

    High-Performance LLM Training Engineer - New College Grad 2026

    Only for registered members

    NVIDIA seeks experienced engineers specializing in performance analysis and optimization for high-performance LLM training workloads. This position optimizes NVIDIA's software stack in frameworks like PyTorch and JAX for training on thousands of GPUs.Understand, analyze, profile, ...

    Santa Clara $124,000 - $195,500 (USD)

    1 week ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intelligent re ...

    Santa Clara $272,000 - $425,500 (USD)

    1 month ago

  • Work in company

    Senior Math Libraries Engineer

    Only for registered members

    We are looking for a passionate and energized engineer to accelerate the integration of APIs across the CUDA-X math library software stack into modern LLM and agentic AI workflows. · Drive the architecture and implementation of math libraries APIs and documentation structures des ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    1 month ago

  • Work in company

    High-Performance LLM Training Engineer - New College Grad 2026

    Only for registered members

    + High-Performance LLM Training Engineer - New College Grad 2026 + NVIDIA seeks experienced engineers specializing in performance analysis and optimization to improve efficiency of LLM training workloads on thousands of GPUs. Key responsibilities include understanding AI training ...

    Santa Clara $124,000 - $195,500 (USD) Full time

    1 week ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA Dynamo is an innovative platform for efficient inference of large language and reasoning models in distributed GPU environments. · We're searching for engineers to build the next generation of scalable AI systems as a Principal Software Engineer on the Dynamo project. · Ad ...

    Santa Clara $272,000 - $431,250 (USD) Full time

    1 month ago

  • Work in company

    Senior Staff Software Development Engineer- GPU/AI/ML

    Only for registered members

    At AMD we build great products that accelerate next-generation computing experiences—from AI and data centers to PCs gaming embedded systems. · ...

    Santa Clara

    1 month ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    NVIDIA Dynamo is an innovative platform focused on efficient inference for large language models in distributed GPU environments. · ...

    Santa Clara $272,000 - $425,500 (USD) Full time

    1 month ago

  • Work in company

    Solutions Architect, AI and ML

    Only for registered members

    We are looking for an experienced Cloud Solution Architect to help assist customers with adoption of GPU hardware and Software as well as building and deploying Machine Learning (ML) Deep Learning (DL) data analytics solutions on various Cloud Computing Platforms. · A Solutions A ...

    Santa Clara

    1 month ago