Senior AI Researcher, On-Device LLM Efficiency - Santa Clara, CA
3 weeks ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
As a Qualcomm Machine Learning Researcher, you will conduct fundamental research that creates innovative machine learning methodology that achieves beyond state-of-the-art performance. · ...
3 weeks ago
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus ...
4 days ago
Join NVIDIA's TensorRT Edge-LLM team and help shape the next generation of edge AI for automotive and robotics. · ...
3 weeks ago
+Job summary · Xpeng is a leading smart technology company that integrates AI and autonomous driving technologies into its vehicles. · +ResponsibilitiesFine-tune pre-trained LLMs for humanoid robots and autonomous driving cars. · Develop efficient LLM training and deployment pipe ...
1 month ago
Experience · 10–12 years of experience in AI/ML related roles with strong hands-on experience in Large Language Models (LLMs), Generative AI, and Agentic AI architectures. · Key Technical Skills · Generative AI & LLM Expertise · 2–3 years of hands-on experience building Generativ ...
2 days ago
2026 Campus Recruiting Robotics Center Internship Position
Only for registered members
This internship position is part of the Campus Recruiting Robotics Center. · ...
1 month ago
We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...
1 month ago
2026 Campus Recruiting Robotics Center Internship Position
Only for registered members
The Robotics Center is looking for interns to join their team. · Robotics Foundation Model - LLM/VLM codesign with in-house chips, efficient training and deployment. · Manipulation – Data-driven end-to-end manipulation (e.g., Diffusion Policy, VLA). · ...
1 month ago
We are now looking for a Senior DL Algorithms Engineer to optimize and deploy Large Language Models (LLMs), Vision-Language Models (VLMs), and World Foundation Models (WFMs) in production environments. · Optimize deep learning models for low-latency, high-throughput inference. · ...
3 weeks ago
+Job summary · This is an exciting opportunity to work as a Solution Architect for Elista Global LLC. The role involves designing and architecting GenAI applications, including Retrieval-Augmented Generation (RAG), LLM orchestration, and advanced prompt design strategies. · +Resp ...
1 month ago
This is an exciting opportunity for a Senior Architect to join NVIDIA's applied AI team for chip design. As part of this team, you will have the chance to tap into the unlimited potential of AI and change the landscape of the chip industry.Design large-scale enterprise AI infrast ...
1 month ago
2026 Campus Recruiting Robotics Center Full-Time Position
Only for registered members
The Robotics Center is seeking highly motivated and talented individuals to join its team as Full-time Positions in AI and Machine Learning (VLA, VLM, LLM) and Motion Control roles. Our team focuses on developing cutting-edge robotics technologies. · ...
1 month ago
We are seeking outstanding engineers to join our team and help shape the future of LLM inference. · We constantly reflect on how to improve these systems, developing new inference algorithms and protocols, · improving existing models, and seamlessly integrating improvements to en ...
1 month ago
We are seeking a Senior/Lead AI/ML Engineer with 7+ years of expertise in artificial intelligence and machine learning.You should have strong experience working with large language models (LLMs), multimodal AI, · and developing innovative applications using techniques such as Ret ...
1 month ago
We are now looking for a Senior Machine Learning Engineer for Quantized Inference NVIDIA is seeking machine learning engineers to accelerate the discovery and deployment of efficient inference recipes for LLMs. A recipe defines which operators are transformed into low-precision o ...
1 week ago
AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026
Only for registered members
+Job summary · NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer to join our performance engineering team. · +QualificationsMaster's or PhD in Computer Science, Computer Engineering, or a related field · Strong hands-on programming ...
1 month ago
We are seeking a highly experienced Machine Learning Engineer to build deploy and optimize Large Language Model LLM based applications with a strong emphasis on MLOpsLLMOps LLM operations and scalable production systems. · ...
1 month ago
Average salary for Solution Architect in Santa Clara is $140,000 per year. · Design and architect GenAI applications including Retrieval-Augmented Generation (RAG), LLM orchestration (LangChain LangGraph) and advanced prompt design strategies. · ...
1 month ago
We are now looking for a Senior DL Algorithms Engineer to optimize and deploy Large Language Models (LLMs), Vision-Language Models (VLMs), and World Foundation Models (WFMs) in production environments. · ...
1 month ago
DL Algorithms Engineer - Cosmos - New College Graduate 2026
Only for registered members
We are now looking for a DL Algorithms Engineer We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying Large Language Models (LLMs), Vision-Language Models (VLMs), and World Foundation Models (WFMs) in production enviro ...
1 month ago