Senior Software Development Engineer – SGLang and Inference Stack - Santa Clara, CA
2 weeks ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
We are looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks.Join us as we shape the future of AI and beyond. · ...
3 weeks ago
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...
3 weeks ago
Software Development Engineer- SGLang and Inference Stack
Only for registered members
We are seeking a skilled Software Development Engineer to join our team at AMD. As a core member of our team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · Skilled engineer with strong technical and analytical expertise in GPGP ...
2 weeks ago
Software Development Engineer- SGLang and Inference Stack
Only for registered members
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · ...
2 weeks ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · This role offers an opportunity to directly impact the ha ...
1 month ago
We are looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · ...
1 week ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · PhD in CS, EE or CSEE or equivalent experience. · 5+ years of experience. · ...
1 month ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · Implement language and multimodal model inference as part ...
1 week ago
We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardwa ...
1 week ago
We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution. ...
1 month ago
Senior Software Development Engineer – LLM Inference Framework
Only for registered members
We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...
1 month ago
NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVID ...
3 days ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
NVIDIA is at the forefront of the generative AI revolution. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · Master's or PhD in Computer Science or related field. · 5+ years of experience in deep learning. · ...
1 month ago
We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...
1 month ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, · and multi-modality models. · You will design, · implement,and productionize model optimization algorithms for inference · and deployment on NVIDIA's latest hardware platforms. · ...
3 weeks ago
Senior GenAI Algorithms Engineer — Model Optimizations for Inference
Only for registered members
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLMs) and diffusion models for maximal inference efficiency using techniques ranging from quan ...
1 month ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
We are seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · In this role you will design implement and productionize model optimization algorithms for inference and deployment on NVIDIA's latest hardware platforms. · De ...
3 weeks ago
Senior GenAI Algorithms Engineer — Post-Training Optimizations
Only for registered members
+Job summary · NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ran ...
3 weeks ago
We are looking for a Senior Technical Product Marketing Manager to join our rapidly growing data center business.You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs , CPUs , networking , CUDA libraries , model ...
1 month ago
We are now looking for a Senior DL Algorithms Engineer who can help us squeeze every last clock cycle out of Deep Learning workloads. · ...
1 month ago