Senior Software Engineer, AI Inference Systems - Santa Clara

Only for registered members Santa Clara, United States

1 month ago

Default job background
$184,000 - $356,500 (USD)
+

Job summary

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency.
+

Responsibilities

  • Architect and implement high-performance inference stacks.
  • Optimize GPU kernels and compilers.
  • Schedule containerized large-scale inference deployments on GPU clusters across clouds.

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Santa Clara Full time $184,000 - $356,500 (USD)

    We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. · You'll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry b ...

  • Only for registered members Palo Alto Full time

    We are looking for an RL Systems Engineer to build large-scale reinforcement learning systems for post-training, fine-tuning and alignment. · Build large-scale reinforcement learning systems for post-training, fine-tuning and alignmentScales RL workloads across multi-node cluster ...

  • Only for registered members Santa Clara

    We are seeking a Senior Member of Technical Staff to join our team as a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. · ...

  • Only for registered members Santa Clara

    We are seeking a senior software development engineer to join our LLM inference framework team. As a senior member of the team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPU ...

  • Only for registered members Santa Clara $272,000 - $425,500 (USD)

    NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intelligent re ...

  • Only for registered members Santa Clara Full time $272,000 - $425,500 (USD)

    NVIDIA Dynamo is an innovative platform focused on efficient inference for large language models in distributed GPU environments. · ...

  • Only for registered members Santa Clara Full time $272,000 - $431,250 (USD)

    NVIDIA Dynamo is an innovative platform for efficient inference of large language and reasoning models in distributed GPU environments. · We're searching for engineers to build the next generation of scalable AI systems as a Principal Software Engineer on the Dynamo project. · Ad ...

  • Only for registered members Santa Clara $184,000 - $287,500 (USD)

    We are now looking for a Senior Deep Learning Software Engineer to develop groundbreaking technologies in the inference systems software stack. · ...

  • Only for registered members Santa Clara $224,000 - $356,500 (USD)

    ++As an NVIDIAN you'll be immersed in a diverse supportive environment where everyone is inspired to do their best work.+ · +Join NVIDIA as a Senior Software Architect for AI Networking and help define the future of AI inference infrastructure.+Research design and implement advan ...

  • Only for registered members Santa Clara $224,000 - $356,500 (USD)

    We are now looking for a Senior Deep Learning Inference Performance ArchitectNVIDIA is seeking a creative engineer who loves to squeeze out every cycle of performance from deep learning software. · The Inference Architecture team does groundbreaking hardware-software co-design wo ...

  • Only for registered members Santa Clara $248,000 - $391,000 (USD)

    We are seeking a Principal Software Engineer to join our Software Infrastructure Team in Santa Clara, CA. · ...

  • Only for registered members Santa Clara Full time $248,000 - $391,000 (USD)

    We are seeking a Principal Software Engineer to join our Software Infrastructure Team in Santa Clara, CA. · We are building and maintaining the core infrastructure that powers our closed and open source AI models. · This team is at the heart of the NVIDIA AI Factory initiative, · ...

  • Only for registered members Santa Clara $224,000 - $356,500 (USD)

    We are seeking a Senior System Software Engineer to own and advance the AI-Perf analysis, NVIDIA's flagship framework for benchmarking, experimentation, and analysis of LLMs, · Generative AI, · and deep learning inference workloads.In this role, · You'll combinesystems research, ...

  • Only for registered members Santa Clara $108,000 - $178,250 (USD)

    We are now looking for a Deep Learning Software Engineer to develop groundbreaking technologies in the inference systems software stack. · We're looking for outstanding AI systems engineers to build innovative AI systems software to accelerate for AI inference. · ...

  • Only for registered members Santa Clara

    We are seeking individuals passionate about tackling challenges and driven by execution to join our team as a Compiler Architect. · BS 15+ Yrs / MS 12+ Yrs / PhD 10+ Yrs Computer Science or Electrical Engineering, · Deep experience in designing or leading compiler efforts using M ...

  • Only for registered members Santa Clara, CA

    As a Senior Member of Technical Staff, you will be a technical leader in Large Language Model (LLM) inference and kernel optimization for AMD GPUs. You will play a critical role in advancing high-performance LLM serving by optimizing GPU kernels, inference runtimes, and distribut ...

  • Only for registered members Santa Clara $200,000 - $322,000 (USD)

    We are seeking a Senior Software Engineer to join our Software Infrastructure Team in Santa Clara CA. This team is at the heart of the NVIDIA AI Factory initiative building and maintaining the core infrastructure that powers our closed and open source AI models. · What You'll Be ...

  • Only for registered members Santa Clara Full time $184,000 - $356,500 (USD)

    +We are seeking a Senior System Software Engineer to own and advance the AI-Perf analysis, NVIDIA's flagship framework for benchmarking, experimentation, and analysis of LLMs, Generative AI, and deep learning inference workloads. · +Lead the design, development, and roadmap of AI ...

  • Only for registered members Santa Clara Full time $108,000 - $178,250 (USD)

    +Job summary · We are now looking for a Deep Learning Software Engineer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology—and amazing people. Today, we ...

  • Only for registered members Santa Clara $224,000 - $425,500 (USD)

    NVIDIA is seeking an exceptional Manager, Deep Learning Inference Software, to lead a world-class engineering team advancing the state of AI model deployment. · ...