Senior GenAI Algorithms Engineer — Model Optimizations for Inference - Santa Clara, CA

Only for registered members Santa Clara, CA, United States

1 month ago

Job summary

NVIDIA seeks a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs,VLMs,multimodal and diffusion models.

We are looking for passionate engineers with strong foundations in both machine learning and software systems/architecture who are eager to make a broad impact across the AI stack.

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Inference Frontend

1 week ago

Only for registered members Sunnyvale Full time

Cerebras Systems builds the world's largest AI chip. Our novel wafer-scale architecture provides AI compute power of dozens of GPUs on a single chip. · ...

Inference Engineer

2 weeks ago

Only for registered members San Francisco Bay Area Remote job

We're looking for engineers who can bridge the gap between ML research and high-performance inference. · JAX / Equinox / Pallas stack · Rust systems programming with a focus on developer experience · ...

Principal Engineer Inference Stack

1 week ago

Only for registered members Santa Clara, CA

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...

AI Inference Engineer

4 weeks ago

Only for registered members San Jose, CA

We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · In this role, you will develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. You will work on the most cutting edge speech ...

Solutions Architect, Inference Deployments

1 week ago

Only for registered members US, CA, Santa Clara

We're forming a team of innovators to roll out and enhance AI inference solutions at scale demonstrating NVIDIA's GPU technology and Kubernetes. · Help customers craft deploy and maintain scalable GPU accelerated inference pipelines on Kubernetes for large language models LLMs an ...

Head of Inference Kernels

2 weeks ago

Only for registered members San Jose, CA

As a core member of the team, you will play a pivotal role in leading a high-performing team to build optimized kernels and implement highly optimized inference stacks for state-of-the-art transformer models. · Architect Best-in-Class Inference Performance on Sohu: Deliver contin ...

AI Inference Engineer

4 hours ago

Only for registered members San Jose (CA)

We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · ...

Principal Engineer Inference Stack

1 week ago

Only for registered members Santa Clara

We are looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks.Join us as we shape the future of AI and beyond. · ...

Inference Software Engineer

4 hours ago

Only for registered members San Jose

Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. · Support porting state-of-the-art models to our architecture. Help build programming abstractions ...

AI Inference Engineer

4 weeks ago

Only for registered members San Jose Full time $151,800 - $332,200 (USD)

+Job summary · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference.Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · +We are developing speech re ...

AI Inference Engineer

4 weeks ago

Only for registered members San Jose, CA

+We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · +Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · + ...

Head of Inference Kernels

3 hours ago

Only for registered members San Jose

We are building the world's first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. You will play a pivotal role in leading a high-performing team to build a suite of optimized kernels ...

Inference Software Engineer

2 days ago

Only for registered members Cupertino, CA

About Etched: building AI chips that are hard-coded for individual model architectures. · Contribute to the architecture and design of the Sohu host software stack · ...

AI Inference Engineer

1 month ago

Only for registered members San Francisco Bay Area

TheAI Inference Engineer at Quadric will port AI models to Quadric platform; optimize model deployment for efficient inference; profile and benchmark model performance. · Bachelor's or Master's in Computer Science and/or Electric Engineering. · 5+ years of experience in AI/LLM mo ...

Engineering Manager, Deep Learning Inference

1 month ago

Only for registered members Santa Clara $224,000 - $425,500 (USD)

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Engineering Manager, Deep Learning Inference

4 weeks ago

Only for registered members Santa Clara $224,000 - $431,250 (USD)

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Manager, Large Language Model Inference

2 weeks ago

Only for registered members Santa Clara, CA

We're seeking a highly skilled and driven Engineering Manager to take the lead in developing the next generation of LLM/VLM/VLA inference software technologies that will define the future of AI. At NVIDIA, we aren't just powering the AI revolution—we're accelerating it. · The Ten ...

Member of Technical Staff, Training and Inference

1 month ago

Only for registered members Santa Clara

Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. · ...

Manager, Large Language Model Inference

1 month ago

Only for registered members Santa Clara $184,000 - $356,500 (USD)

NVIDIA is accelerating the AI revolution by developing cutting-edge deep learning models on every GPU. · We're seeking a highly skilled Engineering Manager to lead in developing the next generation of LLM inference software technologies. · Your work will be collaborative, interfa ...

Manager, Large Language Model Inference

1 week ago

Only for registered members US, CA, Santa Clara

+We're seeking a highly skilled and driven Engineering Manager to take the lead in developing the next generation of LLM/VLM/VLA inference software technologies that will define the future of AI. · +Lead and grow a team responsible for specialized kernel development, runtime opti ...

Director Product Manager – Inference Platforms

1 month ago

Only for registered members Santa Clara

We're seeking a Director Product Manager to manage and mentor a team of product managers responsible for the strategy, roadmap, and execution of AI and ML developer tools that power inference at scale on custom GPUs and heterogeneous hardware platforms. · The RoleThis is a high-i ...

Senior GenAI Algorithms Engineer — Model Optimizations for Inference - Santa Clara, CA

Job summary

Job description

Similar jobs

Inference Frontend

Inference Engineer

Principal Engineer Inference Stack

AI Inference Engineer

Solutions Architect, Inference Deployments

Head of Inference Kernels

AI Inference Engineer

Principal Engineer Inference Stack

Inference Software Engineer

AI Inference Engineer

AI Inference Engineer

Head of Inference Kernels

Inference Software Engineer

AI Inference Engineer

Engineering Manager, Deep Learning Inference

Engineering Manager, Deep Learning Inference

Manager, Large Language Model Inference

Member of Technical Staff, Training and Inference

Manager, Large Language Model Inference

Manager, Large Language Model Inference

Director Product Manager – Inference Platforms

Directory

for Recruiters

Information