Senior Software Engineer, Quantized Inference - US, CA, Santa Clara

Only for registered members US, CA, Santa Clara, United States

1 day ago

Default job background
We are now looking for a Senior Software Engineer for Quantized Inference NVIDIA is seeking software engineers to accelerate the discovery and deployment of efficient inference recipes for LLMs. A recipe defines which operators are transformed into low-precision or sparsified var ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Senior Machine Learning Engineer, Quantized Inference

    Only for registered members

    We are now looking for a Senior Machine Learning Engineer for Quantized Inference NVIDIA is seeking machine learning engineers to accelerate the discovery and deployment of efficient inference recipes for LLMs. A recipe defines which operators are transformed into low-precision o ...

    Santa Clara $152,000 - $287,500 (USD)

    2 days ago

  • Work in company

    Senior Machine Learning Engineer, Quantized Inference

    Only for registered members

    We are now looking for a Senior Machine Learning Engineer for Quantized Inference NVIDIA is seeking machine learning engineers to accelerate the discovery and deployment of efficient inference recipes for LLMs. A recipe defines which operators are transformed into low-precision o ...

    US, CA, Santa Clara

    1 day ago

  • Work in company

    Staff Software Engineer

    Only for registered members

    We are looking for a versatile Machine Learning Infrastructure Engineer to join XPENG's Fuyao AI Platform team — a core AI infrastructure powering autonomous driving, · robotics, · and intelligent cockpit teams with large-scale data processing, · model training, · and inference a ...

    Santa Clara $179,400 - $303,600 (USD)

    1 month ago

  • Work in company

    Senior Computer Vision Engineer

    Only for registered members

    We are seeking a passionate and skilled Senior Computer Vision Engineer to lead the development and deployment of high-performance large-scale AI models. · Optimize large-scale multimodal models for low-latency inference and efficient memory usage across diverse hardware platform ...

    Santa Clara $174,720 - $295,680 (USD)

    1 month ago

  • Work in company

    Senior Computer Vision Engineer

    Only for registered members

    We are seeking a passionate and skilled Senior · Computer Vision · Engineer · to lead the development and deployment of high-performance, large-scale AI models. · A fun, supportive and engaging environment · Infrastructures and computational resources to support your work. · Oppo ...

    Santa Clara, CA

    1 month ago

  • Work in company

    Solutions Architect, Inference Deployments

    Only for registered members

    We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect focused on inference, you'll collaborate closely with our engineering, DevOps, and customers to develop enter ...

    Santa Clara $152,000 - $241,500 (USD)

    12 hours ago

  • Work in company

    Senior GenAI Algorithms Engineer — Post-Training Optimizations

    Only for registered members

    NVIDIA is at the forefront of the generative AI revolution. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · Master's or PhD in Computer Science or related field. · 5+ years of experience in deep learning. · ...

    Santa Clara $184,000 - $287,500 (USD)

    1 month ago

  • Work in company

    AI Model Optimization Engineer

    Only for registered members

    We are looking for a Senior Software Development Engineer to own the end-to-end model execution stack on AMD Instinct GPUs. · The RoleThe AMD AI Group is looking for a Senior Software Development Engineer to own the end-to-end model execution stack on AMD Instinct GPUs - spanning ...

    Santa Clara $138,000 - $207,000 (USD)

    1 week ago

  • Work in company

    Staff Engineer, Application Engineering

    Only for registered members

    As a Staff Application Software Engineer in the WIN Customer Engineering team, you will lead the integration of state-of-the-art AI/ML capabilities into Qualcomm's next-generation Wi-Fi 7 and Wi-Fi 8 Access Point platforms. · ...

    Santa Clara $145,000 - $217,600 (USD) Full time

    2 weeks ago

  • Work in company

    Staff Engineer, Application Engineering

    Only for registered members

    As a Staff Application Software Engineer in the WIN Customer Engineering team you will lead the integration of state-of-the-art AI/ML capabilities into Qualcomm's next-generation Wi-Fi 7 and Wi-Fi 8 Access Point platforms. You will bridge the gap between advanced AI research and ...

    Santa Clara $145,000 - $217,600 (USD)

    2 weeks ago

  • Work in company

    Senior Deep Learning Software Engineer

    Only for registered members

    ++We are looking for outstanding Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles.+ · The base salary range is $152,000 USD - $241,500 USD for Level 3,+and $184,000 USD - $287,500 USD for Level 4.+ ...

    Santa Clara $152,000 - $287,500 (USD)

    1 month ago

  • Work in company

    Solutions Architect, Inference Deployments

    Only for registered members

    We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect focused on inference, you'll collaborate closely with our engineering, DevOps, and customers to develop enter ...

    Santa Clara $152,000 - $241,500 (USD) Full time

    20 hours ago

  • NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLMs) and diffusion models for maximal inference efficiency using techniques ranging from quan ...

    Santa Clara

    1 month ago

  • Work in company

    Senior GenAI Algorithms Engineer — Post-Training Optimizations

    Only for registered members

    Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, · and multi-modality models. · You will design, · implement,and productionize model optimization algorithms for inference · and deployment on NVIDIA's latest hardware platforms. · ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    4 weeks ago

  • Work in company

    Senior GenAI Algorithms Engineer — Post-Training Optimizations

    Only for registered members

    We are seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, and multi-modality models. · In this role you will design implement and productionize model optimization algorithms for inference and deployment on NVIDIA's latest hardware platforms. · De ...

    Santa Clara $152,000 - $287,500 (USD)

    4 weeks ago

  • Work in company

    Senior GenAI Algorithms Engineer — Post-Training Optimizations

    Only for registered members

    +Job summary · NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ran ...

    Santa Clara $184,000 - $287,500 (USD)

    4 weeks ago

  • Work in company

    Senior Staff Machine Learning Engineer

    Only for registered members

    At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. · ...

    Santa Clara $155,000 - $250,000 (USD) Full time

    2 weeks ago

  • Work in company

    Senior Software Engineer, Deep Learning

    Only for registered members

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology—and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of ...

    Santa Clara $184,000 - $356,500 (USD)

    3 days ago

  • Work in company

    Senior Deep Learning Software Engineer

    Only for registered members

    We are looking for outstanding Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles. · Train, fine-tune, optimize and customize perception DNNs in low precision (FP16/INT8) · Apply sophisticated quantization of ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    1 month ago

  • Work in company

    Senior DL Algorithms Engineer

    Only for registered members

    We are now looking for a Senior DL Algorithms Engineer We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying Large Language Models LLMs Vision-Language Models VLMs and World Foundation Models WFMsin production environm ...

    Santa Clara $148,000 - $287,500 (USD)

    1 month ago