Principal Engineer Inference Stack - Santa Clara
1 week ago

Job summary
We are looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks.Join us as we shape the future of AI and beyond.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Principal Engineer Inference Stack
1 week ago
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...
We are seeking a skilled Software Development Engineer to join our team at AMD. As a core member of our team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · Skilled engineer with strong technical and analytical expertise in GPGP ...
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · ...
We are seeking a skilled Software Development Engineer to optimize and develop deep learning frameworks for AMD GPUs. · ...
Senior DL Algorithms Engineer
1 month ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · This role offers an opportunity to directly impact the ha ...
Senior DL Algorithms Engineer
4 weeks ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · PhD in CS, EE or CSEE or equivalent experience. · 5+ years of experience. · ...
Senior DL Algorithms Engineer
1 month ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution. ...
We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...
NVIDIA is at the forefront of the generative AI revolution. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · Master's or PhD in Computer Science or related field. · 5+ years of experience in deep learning. · ...
We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...
+Job summary · NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ran ...
We are seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs,VLMs,multimodal,diffusion models. · ...
We are seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · In this role you will design implement and productionize model optimization algorithms for inference and deployment on NVIDIA's latest hardware platforms. · De ...
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLMs) and diffusion models for maximal inference efficiency using techniques ranging from quan ...
Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, · and multi-modality models. · You will design, · implement,and productionize model optimization algorithms for inference · and deployment on NVIDIA's latest hardware platforms. · ...
We are looking for a Senior Technical Product Marketing Manager to join our rapidly growing data center business.You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs , CPUs , networking , CUDA libraries , model ...
Senior DL Algorithms Engineer
1 month ago
We are now looking for a Senior DL Algorithms Engineer who can help us squeeze every last clock cycle out of Deep Learning workloads. · ...
Principal Software Engineer
1 month ago
NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intelligent re ...
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models LLM and diffusion models for maximal inference efficiency using techniques ranging from neural ...
Senior DL Algorithms Engineer
6 days ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs). · Contribute new features, fix bugs and deliver p ...