Work in company

Senior Deep Learning Inference Performance Architect, Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

Santa Clara $184,000 - $356,500 (USD)

1 week ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

US, CA, Santa Clara $184,000 - $356,500 (USD) per year

1 week ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior Deep Learning Inference Performance ArchitectNVIDIA is seeking a creative engineer who loves to squeeze out every cycle of performance from deep learning software. · The Inference Architecture team does groundbreaking hardware-software co-design wo ...

Santa Clara $224,000 - $356,500 (USD)

1 month ago

Work in company

Senior Deep Learning Inference Performance Architect

Only for registered members

We are now looking for a Senior · Deep Learning Inference Performance Architect. · NVIDIA is seeking a Senior Performance Architect - · a creative engineer who loves to squeeze out every cycle of performance · from deep learning software. ...

Santa Clara, CA

1 month ago

Work in company

Senior Compiler Engineer, AI Inference Performance

Only for registered members

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers ...

Santa Clara, CA

4 days ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

San Jose, CA

1 week ago

Work in company

High-performance AI inference solutions Engineer

Only for registered members

WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

San Jose $163,360 - $245,040 (USD)

1 week ago

Work in company

Software Engineer, ML Inference Performance

Only for registered members

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. · SambaNova Suite ...

San Mateo, CA

2 days ago

Work in company

Software Engineer, ML Inference Performance

Only for registered members

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. · SambaNova Suite ...

Palo Alto Full time

2 days ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

+Job summary · NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer to join our performance engineering team. · +QualificationsMaster's or PhD in Computer Science, Computer Engineering, or a related field · Strong hands-on programming ...

Santa Clara $124,000 - $218,500 (USD)

1 month ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

+NVIDIA is at the forefront of the generative AI revolution. · +Analyze the performance of LLMs running on NVIDIA Compute Platforms using profiling, benchmarking, and performance analysis tools. · Understand and find opportunities for compiler optimization pipelines... · ...

Santa Clara $124,000 - $218,500 (USD) Full time

1 month ago

Work in company

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Only for registered members

We are looking for a Software Engineer, Performance Analysis, and Optimization for LLM Inference to join our performance engineering team. · In this role, you will focus on improving the efficiency and scalability of large language model (LLM) inference on NVIDIA Computing Platfo ...

Santa Clara, CA

1 month ago

Work in company

Senior Software Development Engineer - LLM Kernel

Only for registered members

We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...

Santa Clara

1 month ago

Work in company

Senior Software Development Engineer – LLM Inference Framework

Only for registered members

We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...

Santa Clara

1 month ago

Work in company

Senior Deep Learning Architect, LLM Inference

Only for registered members

We are now looking for a Senior Deep Learning Architect, LLM Inference · NVIDIA is at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team specifically focuses on inference server performance optimization for Large Language Models (LLMs). If you're ...

Santa Clara $184,000 - $356,500 (USD)

1 day ago

Work in company

Senior DL Algorithms Engineer

Only for registered members

We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · This role offers an opportunity to directly impact the ha ...

Santa Clara $148,000 - $287,500 (USD)

1 month ago

Work in company

Senior DL Algorithms Engineer

Only for registered members

We are looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · ...

Santa Clara $184,000 - $356,500 (USD)

1 week ago

Work in company

Senior DL Algorithms Engineer

Only for registered members

We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. · Implement language and multimodal model inference as part ...

Santa Clara $152,000 - $287,500 (USD)

1 week ago

Work in company

Senior DL Algorithms Engineer

Only for registered members

We are now looking for a Senior DL Algorithms Engineer to help us squeeze every last clock cycle out of Deep Learning workloads. · PhD in CS, EE or CSEE or equivalent experience. · 5+ years of experience. · ...

Santa Clara $224,000 - $356,500 (USD)

1 month ago

Work in company

Principal Software Engineer

Only for registered members

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVID ...

Santa Clara $272,000 - $431,250 (USD)

5 days ago

Senior Compiler Engineer, AI Inference Performance - US, CA, Santa Clara

Job description

Similar jobs

Senior Deep Learning Inference Performance Architect, Senior Deep Learning Inference Performance Architect

Senior Deep Learning Inference Performance Architect

Senior Deep Learning Inference Performance Architect

Senior Deep Learning Inference Performance Architect

Senior Compiler Engineer, AI Inference Performance

High-performance AI inference solutions Engineer

High-performance AI inference solutions Engineer

Software Engineer, ML Inference Performance

Software Engineer, ML Inference Performance

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

Senior Software Development Engineer - LLM Kernel

Senior Software Development Engineer – LLM Inference Framework

Senior Deep Learning Architect, LLM Inference

Senior DL Algorithms Engineer

Senior DL Algorithms Engineer

Senior DL Algorithms Engineer

Senior DL Algorithms Engineer

Principal Software Engineer

Directory

for Recruiters

Information