Software Engineer, ML Inference Performance - Palo Alto

Only for registered members Palo Alto, United States

2 days ago

Default job background
Full time
The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. · SambaNova Suite ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Technical Program Manager, Inference Performance

    Only for registered members

    · ...

    San Francisco, CA

    3 weeks ago

  • We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

    Santa Clara $184,000 - $356,500 (USD)

    1 week ago

  • Work in company

    Software Engineer, ML Inference Performance

    Only for registered members

    The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. · SambaNova Suite ...

    San Mateo, CA

    2 days ago

  • Work in company

    Senior Compiler Engineer, AI Inference Performance

    Only for registered members

    NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers ...

    Santa Clara, CA

    4 days ago

  • Work in company

    Senior Deep Learning Inference Performance Architect

    Only for registered members

    We are now looking for a Senior Deep Learning Inference Performance Architect · NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreak ...

    US, CA, Santa Clara $184,000 - $356,500 (USD) per year

    1 week ago

  • Work in company

    Senior Deep Learning Inference Performance Architect

    Only for registered members

    We are now looking for a Senior Deep Learning Inference Performance ArchitectNVIDIA is seeking a creative engineer who loves to squeeze out every cycle of performance from deep learning software. · The Inference Architecture team does groundbreaking hardware-software co-design wo ...

    Santa Clara $224,000 - $356,500 (USD)

    1 month ago

  • Work in company

    Senior Deep Learning Inference Performance Architect

    Only for registered members

    We are now looking for a Senior · Deep Learning Inference Performance Architect. · NVIDIA is seeking a Senior Performance Architect - · a creative engineer who loves to squeeze out every cycle of performance · from deep learning software. ...

    Santa Clara, CA

    1 month ago

  • Work in company

    High-performance AI inference solutions Engineer

    Only for registered members

    WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

    San Jose $163,360 - $245,040 (USD)

    1 week ago

  • Work in company

    High-performance AI inference solutions Engineer

    Only for registered members

    WHAT YOU DO AT AMD CHANGES EVERYTHING · At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progr ...

    San Jose, CA

    1 week ago

  • Work in company

    Senior Compiler Engineer, AI Inference Performance

    Only for registered members

    NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers ...

    US, CA, Santa Clara

    4 days ago

  • +Job summary · NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer to join our performance engineering team. · +QualificationsMaster's or PhD in Computer Science, Computer Engineering, or a related field · Strong hands-on programming ...

    Santa Clara $124,000 - $218,500 (USD)

    1 month ago

  • +NVIDIA is at the forefront of the generative AI revolution. · +Analyze the performance of LLMs running on NVIDIA Compute Platforms using profiling, benchmarking, and performance analysis tools. · Understand and find opportunities for compiler optimization pipelines... · ...

    Santa Clara $124,000 - $218,500 (USD) Full time

    1 month ago

  • We are looking for a Software Engineer, Performance Analysis, and Optimization for LLM Inference to join our performance engineering team. · In this role, you will focus on improving the efficiency and scalability of large language model (LLM) inference on NVIDIA Computing Platfo ...

    Santa Clara, CA

    1 month ago

  • Work in company

    LLM Inference Engineer

    Only for registered members

    We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure.The ideal candidate has: · Extensive hands-on experience with state-of-the-art inference optimization techniques · A track record of deploying efficient, scalable ...

    Palo Alto, CA

    3 weeks ago

  • Work in company

    Member of Technical Staff, Applied Inference

    Only for registered members

    · About xAI · xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challengi ...

    Palo Alto, CA; San Francisco, CA

    2 days ago

  • Work in company

    Senior Software Engineer, Inference Platform

    Only for registered members

    We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences in MongoDB Atlas. · Design and build components of a multi-tenant inference platform integrated d ...

    Palo Alto, CA

    1 week ago

  • Work in company

    Senior Software Engineer, Inference Platform

    Only for registered members

    We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search retrieval and AI-native experiences in MongoDB Atlas. · We'll join the broader Search and AI Platform organization collaborate with ML ...

    Palo Alto, CA

    1 month ago

  • Work in company

    ML Performance Engineer

    Only for registered members

    ML Performance Engineer · Location: · Palo Alto (Onsite) · We're hiring an · ML Performance Engineer · to join a · Series B AI lab valued at $1.4B · that is building · best-in-class image models and products · (not a wrapper). · They have developed · in-house foundation models at ...

    Palo Alto, CA

    5 days ago

  • Work in company

    Member of Technical Staff, Inference

    Only for registered members

    About xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated and focused on engineering excellence. · This organization is for individuals who appreciate challenging thems ...

    Palo Alto, CA

    3 weeks ago

  • Work in company

    Staff Software Engineer, Ads ML Inference Infrastructure

    Only for registered members

    We're on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product. Discover a career where you ignite innovation for millions, transform passion into growth opportunities. · Lead and drive efforts to build next-gen ...

    Palo Alto, CA

    2 weeks ago