Work in company

Technical Program Manager, Inference Performance

Only for registered members

· ...

San Francisco, CA

3 weeks ago

Work in company

Senior Deep Learning Inference Performance Architect, Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

Santa Clara $184,000 - $356,500 (USD)

1 week ago

Work in company

Software Engineer, ML Inference Performance

Only for registered members

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. · SambaNova Suite ...

Palo Alto Full time

2 days ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

US, CA, Santa Clara $184,000 - $356,500 (USD) per year

1 week ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance ArchitectNVIDIA is seeking a creative engineer who loves to squeeze out every cycle of performance from deep learning software. · The Inference Architecture team does groundbreaking hardware-software co-design wo ...

Santa Clara $224,000 - $356,500 (USD)

1 month ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior · Deep Learning Inference Performance Architect. · NVIDIA is seeking a Senior Performance Architect - · a creative engineer who loves to squeeze out every cycle of performance · from deep learning software. ...

Santa Clara, CA

1 month ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

San Jose $163,360 - $245,040 (USD)

1 week ago

Work in company

Senior Compiler Engineer, AI Inference Performance

Only for registered members

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers ...

Santa Clara, CA

4 days ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

San Jose, CA

1 week ago

Work in company

Senior Compiler Engineer, AI Inference Performance

Only for registered members

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers ...

US, CA, Santa Clara

4 days ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

+Job summary · NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer to join our performance engineering team. · +QualificationsMaster's or PhD in Computer Science, Computer Engineering, or a related field · Strong hands-on programming ...

Santa Clara $124,000 - $218,500 (USD)

1 month ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

+NVIDIA is at the forefront of the generative AI revolution. · +Analyze the performance of LLMs running on NVIDIA Compute Platforms using profiling, benchmarking, and performance analysis tools. · Understand and find opportunities for compiler optimization pipelines... · ...

Santa Clara $124,000 - $218,500 (USD) Full time

1 month ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

We are looking for a Software Engineer, Performance Analysis, and Optimization for LLM Inference to join our performance engineering team. · In this role, you will focus on improving the efficiency and scalability of large language model (LLM) inference on NVIDIA Computing Platfo ...

Santa Clara, CA

1 month ago

Work in company

Software Engineer

Only for registered members

We are looking for a GPU Kernel Engineer to design build and optimize low-level compute kernels that power our large-scale GPU accelerated AI inference platform. · ...

San Mateo

1 month ago

Work in company

Senior / Principal Inference Engineer - ML Platform

Only for registered members

We're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. · 4+ years of professional experience. · Bachelor's degr ...

San Mateo $327,000 - $397,460 (USD) Full time

1 month ago

Work in company

Member of Technical Staff, Performance Optimization

Only for registered members

We're looking for a Software Engineer focused on Performance Optimization to help push the boundaries of speed and efficiency across our AI infrastructure. · Optimize system and GPU performance for high-throughput AI workloads across training and inference · Analyze and improve l ...

San Mateo $175,000 - $220,000 (USD)

1 month ago

Work in company

Senior / Principal Inference Engineer - ML Platform

Only for registered members

As a Senior / Principal Inference Engineer on ML Platform you will build the next generation of ML Ecosystem Tooling, specifically around model inference. ML Platform today supports billions of requests per day across our homepage, marketplace, economy, and more. We are looking f ...

San Mateo, CA, United States

1 week ago

Work in company

Member of Technical Staff, Performance Optimization

Only for registered members

We're looking for a Software Engineer focused on Performance Optimization to help push the boundaries of speed and efficiency across our AI infrastructure. · You'll take ownership of optimizing performance at every layer of the stack—from low-level GPU kernels to large-scale dist ...

San Mateo $175,000 - $220,000 (USD)

1 month ago

Work in company

Software Engineer, Developer Experience

Only for registered members

We are seeking a Software Engineer, DevEx & Evals to play a highly impactful role in shaping the Fireworks platform. You will be responsible for defining and building a cohesive developer journey that bridges the gap between experimentation and production. As the public platform ...

San Mateo

1 month ago

Work in company

Python Developer

Only for registered members

FriendliAI is hiring a Python Engineer to build developer tools for our users and internal engineers.You'll design ergonomic & stable APIs, reliable releases to PyPI, and drive top-tier developer experience across documentation, · examples, · and tooling.Owning the Python SDK lif ...

San Mateo

1 month ago

Software Engineer, ML Inference Performance - San Mateo, CA

Job description

Similar jobs

Technical Program Manager, Inference Performance

Senior Deep Learning Inference Performance Architect, Senior Deep Learning Inference Performance Architect

Software Engineer, ML Inference Performance

Senior Deep Learning Inference Performance Architect

Senior Deep Learning Inference Performance Architect

Senior Deep Learning Inference Performance Architect

High-performance AI inference solutions Engineer

Senior Compiler Engineer, AI Inference Performance

High-performance AI inference solutions Engineer

Senior Compiler Engineer, AI Inference Performance

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Software Engineer

Senior / Principal Inference Engineer - ML Platform

Member of Technical Staff, Performance Optimization

Senior / Principal Inference Engineer - ML Platform

Member of Technical Staff, Performance Optimization

Software Engineer, Developer Experience

Python Developer

Directory

for Recruiters

Information