AI Inference Engineer - San Jose, CA

Only for registered members San Jose, CA , United States

1 month ago

What you can expect · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this role, you will develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. You will work on the most ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

Inference Software Engineer

Only for registered members

About Etched · Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like ...

San Jose $100,000 - $120,000 (USD) per year

5 days ago

Work in company

AI Inference Engineer

Only for registered members

+Job summary · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference.Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · +We are developing speech re ...

San Jose $151,800 - $332,200 (USD) Full time

1 month ago

Work in company

AI Inference Engineer

Only for registered members

What You Can Expect · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this role, you will develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. You will work on the most ...

San Jose, CA $90,000 - $170,000 (USD) per year

1 week ago

Work in company

AI Inference Engineer

Only for registered members

+ Develop state-of-the-art automatic speech recognition system for various Zoom products · + Work on cutting edge speech modeling and inference technologies with world-class speech scientists · + Collaborate with cross-functional teams to deliver high-impact projects from ground ...

San Jose $151,800 - $332,200 (USD) Full time

2 weeks ago

Work in company

Inference Software Engineer

Only for registered members

San Jose, CA

4 days ago

Work in company

AI Inference Engineer

Only for registered members

+We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. · +Developing state-of-the-art speech services for Zoom products. · Optimizing ASR inference systems for production deployment. · + ...

San Jose, CA

1 month ago

Work in company Remote job

Inference Engineer

Only for registered members

We're looking for engineers who can bridge the gap between ML research and high-performance inference. · JAX / Equinox / Pallas stack · Rust systems programming with a focus on developer experience · ...

San Francisco Bay Area

1 month ago

Work in company

Principal Engineer Inference Stack

Only for registered members

We are looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks.Join us as we shape the future of AI and beyond. · ...

Santa Clara

1 month ago

Work in company

Principal Engineer Inference Stack

Only for registered members

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. · Develop techniques for optimizing scale-up and scale-out inference. · Develop methods and tooling to utilize dynam ...

Santa Clara, CA

1 month ago

Work in company

Multimodal Model Training and Inference Optimization Engineer

Only for registered members

The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, · and delivering intelligent solutions to ByteDance products. · Optimize large model training pipelines to improve efficiency,speed,and scalability. · Benchmark and prof ...

San Jose $136,800 - $359,720 (USD)

3 weeks ago

Work in company

Engineering Manager, Deep Learning Inference

Only for registered members

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Santa Clara $224,000 - $425,500 (USD)

1 month ago

Work in company

Engineering Manager, Deep Learning Inference

Only for registered members

NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...

Santa Clara $224,000 - $431,250 (USD)

1 month ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

San Jose $163,360 - $245,040 (USD)

1 week ago

Work in company

Senior Software Engineer, Quantized Inference

Only for registered members

We are now looking for a Senior Software Engineer for Quantized Inference NVIDIA is seeking software engineers to accelerate the discovery and deployment of efficient inference recipes for LLMs. A recipe defines which operators are transformed into low-precision or sparsified var ...

US, CA, Santa Clara

1 week ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

San Jose, CA

1 week ago

Work in company

Multimodal Model Training and Inference Optimization Engineer

Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · including distributed training/inference and acceleration.The ideal candidate will work at the cutting edge of AI efficiency ...

San Jose, CA

3 weeks ago

Work in company

Multimodal Model Training and Inference Optimization Engineer

Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, · Optimize large model training pipelines to improve efficiency speed scalability. · Develop distributed training strategies s ...

San Jose $136,800 - $359,720 (USD)

3 weeks ago

Work in company

Senior Deep Learning Software Engineer, Inference, Senior Deep Learning Software Engineer, Inference

Only for registered members

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our team is responsible for d ...

Santa Clara $152,000 - $287,500 (USD)

1 week ago

Work in company

Engineering Manager, Inference Platform

Only for registered members

We're looking for a deeply technical, hands-on engineering leader for our Inference Service Platform. · You will lead a high performing team to tackle a critical challenge: scaling LLM inference on Cerebras' advanced compute clusters and delivering a world-class, on-prem solution ...

Sunnyvale

3 weeks ago

Work in company

Multimodal Model Training and Inference Optimization Engineer

Only for registered members

We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, ...

San Jose, CA

3 weeks ago

AI Inference Engineer - San Jose, CA

Job description

Similar jobs

Inference Software Engineer

AI Inference Engineer

AI Inference Engineer

AI Inference Engineer

Inference Software Engineer

AI Inference Engineer

Inference Engineer

Principal Engineer Inference Stack

Principal Engineer Inference Stack

Multimodal Model Training and Inference Optimization Engineer

Engineering Manager, Deep Learning Inference

Engineering Manager, Deep Learning Inference

High-performance AI inference solutions Engineer

Senior Software Engineer, Quantized Inference

High-performance AI inference solutions Engineer

Multimodal Model Training and Inference Optimization Engineer

Multimodal Model Training and Inference Optimization Engineer

Senior Deep Learning Software Engineer, Inference, Senior Deep Learning Software Engineer, Inference

Engineering Manager, Inference Platform

Multimodal Model Training and Inference Optimization Engineer

Directory

for Recruiters

Information