Senior Deep Learning Software Engineer, Inference - Santa Clara

Only for registered members Santa Clara, United States

1 month ago

$148,000 - $287,500 (USD)

Job summary

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. You will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI applications.

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company Remote job

Inference Engineer

Only for registered members

We're looking for engineers who can bridge the gap between ML research and high-performance inference. · JAX / Equinox / Pallas stack · Rust systems programming with a focus on developer experience · ...

San Francisco Bay Area

2 weeks ago

Work in company

AI Inference Engineer

Only for registered members

We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · ...

San Jose (CA)

4 days ago

Work in company

AI Inference Engineer

Only for registered members

We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · In this role, you will develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. You will work on the most cutting edge speech ...

San Jose, CA

1 month ago

Work in company

AI Inference Engineer

Only for registered members

+Job summary · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference.Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · +We are developing speech re ...

San Jose $151,800 - $332,200 (USD) Full time

1 month ago

Work in company

Principal Engineer Inference Stack

Only for registered members

We are looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks.Join us as we shape the future of AI and beyond. · ...

Santa Clara

2 weeks ago

Work in company

AI Inference Engineer

Only for registered members

+We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · +Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · + ...

San Jose, CA

1 month ago

Work in company

AI Inference Engineer

Only for registered members

+ Develop state-of-the-art automatic speech recognition system for various Zoom products · + Work on cutting edge speech modeling and inference technologies with world-class speech scientists · + Collaborate with cross-functional teams to deliver high-impact projects from ground ...

San Jose $151,800 - $332,200 (USD) Full time

4 days ago

Work in company

Principal Engineer Inference Stack

Only for registered members

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...

Santa Clara, CA

2 weeks ago

Work in company

Inference Software Engineer

Only for registered members

About Etched: building AI chips that are hard-coded for individual model architectures. · Contribute to the architecture and design of the Sohu host software stack · ...

Cupertino, CA

1 week ago

Work in company

Engineering Manager, Deep Learning Inference

Only for registered members

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Santa Clara $224,000 - $431,250 (USD)

1 month ago

Work in company

Engineering Manager, Deep Learning Inference

Only for registered members

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Santa Clara $224,000 - $425,500 (USD)

1 month ago

Work in company

Deployment Engineer, AI Inference

Only for registered members

We are seeking a highly skilled Deployment Engineer to build and operate our cutting-edge inference clusters. · These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its ...

Sunnyvale

1 month ago

Work in company

Director of Engineering, Inference Services

Only for registered members

We are looking for a Director of Engineering to own and scale our next-generation Inference Platform. · - Vision & Roadmap - Define and continuously refine the end-to-end Inference Platform roadmap, prioritizing low-latency, high-throughput model serving and world-class developer ...

Sunnyvale, CA

2 weeks ago

Work in company

Engineering Manager, Inference Platform

Only for registered members

We're looking for a deeply technical, hands-on engineering leader for our Inference Service Platform. · You will lead a high performing team to tackle a critical challenge: scaling LLM inference on Cerebras' advanced compute clusters and delivering a world-class, on-prem solution ...

Sunnyvale

1 week ago

Work in company

Senior Software Engineer – Inference Platform Infrastructure

Only for registered members

NVIDIA is seeking a Sr. Software Engineer – Inference Platform Infrastructure to build and automate the foundations for NVIDIA's inference services. · ...

Santa Clara $152,000 - $287,500 (USD) Full time

5 hours ago

Work in company

Senior Deep Learning Software Engineer, Inference

Only for registered members

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. · You will play a central role in improving high-performance open-source frameworks facilitating smooth deployment and serving of groundbreaking language models. · Your work will ...

Santa Clara $152,000 - $287,500 (USD)

1 month ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We seek highly skilled software engineers to join us building AI inference systems for large-scale models with extreme efficiency architecting high-performance inference stacks optimizing GPU kernels compilers driving industry benchmarks scaling workloads across multi-GPU environ ...

US, CA, Santa Clara

2 days ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. · You'll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry b ...

Santa Clara $184,000 - $356,500 (USD) Full time

4 weeks ago

Work in company

Software Development Engineer- SGLang and Inference Stack

Only for registered members

We are seeking a skilled Software Development Engineer to join our team at AMD. As a core member of our team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. · Skilled engineer with strong technical and analytical expertise in GPGP ...

Santa Clara

1 week ago

Work in company

Senior Software Engineer, Deep Learning Inference

Only for registered members

We are now looking for a Senior Software Engineer for Deep Learning Inference Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models especially Large Language Models on NVIDIA GPUsCraft an ...

Santa Clara $152,000 - $287,500 (USD) Full time

1 week ago

Senior Deep Learning Software Engineer, Inference - Santa Clara

Job summary

Job description

Similar jobs

Inference Engineer

AI Inference Engineer

AI Inference Engineer

AI Inference Engineer

Principal Engineer Inference Stack

AI Inference Engineer

AI Inference Engineer

Principal Engineer Inference Stack

Inference Software Engineer

Engineering Manager, Deep Learning Inference

Engineering Manager, Deep Learning Inference

Deployment Engineer, AI Inference

Director of Engineering, Inference Services

Engineering Manager, Inference Platform

Senior Software Engineer – Inference Platform Infrastructure

Senior Deep Learning Software Engineer, Inference

Senior Software Engineer, AI Inference Systems

Senior Software Engineer, AI Inference Systems

Software Development Engineer- SGLang and Inference Stack

Senior Software Engineer, Deep Learning Inference

Directory

for Recruiters

Information