Principal Engineer Inference Stack - Santa Clara
1 month ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...
1 month ago
Software Development Engineer- SGLang and Inference Stack
Only for registered members
We are seeking a skilled Software Development Engineer to join our team at AMD. As a core member of our team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · Skilled engineer with strong technical and analytical expertise in GPGP ...
3 weeks ago
Software Development Engineer- SGLang and Inference Stack
Only for registered members
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · ...
3 weeks ago
Senior Software Development Engineer – SGLang and Inference Stack
Only for registered members
We are seeking a skilled Software Development Engineer to optimize and develop deep learning frameworks for AMD GPUs. · ...
3 weeks ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · This role offers an opportunity to directly impact the ha ...
2 months ago
We are looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · ...
2 weeks ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · PhD in CS, EE or CSEE or equivalent experience. · 5+ years of experience. · ...
1 month ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · Implement language and multimodal model inference as part ...
2 weeks ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardwa ...
2 weeks ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution. ...
2 months ago
Senior Software Development Engineer – LLM Inference Framework
Only for registered members
We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...
1 month ago
NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVID ...
1 week ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
NVIDIA is at the forefront of the generative AI revolution. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · Master's or PhD in Computer Science or related field. · 5+ years of experience in deep learning. · ...
1 month ago
We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...
1 month ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, · and multi-modality models. · You will design, · implement,and productionize model optimization algorithms for inference · and deployment on NVIDIA's latest hardware platforms. · ...
1 month ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
We are seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · In this role you will design implement and productionize model optimization algorithms for inference and deployment on NVIDIA's latest hardware platforms. · De ...
1 month ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
+Job summary · NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ran ...
1 month ago
We are looking for a Senior Technical Product Marketing Manager to join our rapidly growing data center business.You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs , CPUs , networking , CUDA libraries , model ...
1 month ago
We are now looking for a Senior Deep Learning Engineer At NVIDIA, we are at the forefront of advancing the capabilities of artificial intelligence. We are seeking an ambitious and forward-thinking senior deep learning engineer to contribute to the development of next-generation i ...
4 days ago
Senior Deep Learning Software Engineer, Inference and Model Optimization
Only for registered members
The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from neural architecture search and pruning to sparsity, quantization ...
1 month ago