Staff Software Engineer, Inference - United States

Only for registered members United States

1 day ago

Default job background
What You'll DoBuild low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics · Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    LLM Inference Engineer

    Only for registered members

    About Periodic Labs · We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn ne ...

    United States

    1 day ago

  • Work in company

    Inference Optimization Engineer

    Only for registered members

    About BentoML · BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance i ...

    North America

    1 week ago

  • Work in company

    Generative AI Inference Engineer

    Only for registered members

    · Generative AI Inference Engineer · About the role:  · We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inf ...

    United States

    1 week ago

  • Work in company

    Senior Engineer

    Only for registered members

    We are looking for an experienced Field-Applications Engineer to help deploy a new generation of code translation tools enabled by AI and modern verification techniques. · Deploy and manage containerized services using Docker. · Deploy and run Python based GenAI pipelines interac ...

    United States Full time

    1 month ago

  • Work in company

    ML Engineer

    Only for registered members

    · Elevating the quality of human life through every conversation · ML Engineer - Inference · Location: United States  · Experience: 5 years · About the Team: · At , our team is dedicated to revolutionizing the field of Conversation AI. We are a collaborative and innovative group ...

    United States $110,000 - $190,000 (USD) per year

    1 week ago

  • Work in company Remote job

    Member of Technical Staff, Exceptional Generalist

    Only for registered members

    +Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. · +This is a globally remote opportunity for exceptional generalist engineers who can work across the entire vLLM stack: from low-level GPU ke ...

    United States

    1 month ago

  • Work in company

    Senior II Software Engineer Lead

    Only for registered members

    Description · Do you thrive on technical leadership and building cutting-edge AI systems? · Are you ready to drive innovation at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technolog ...

    United States

    1 week ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    About the Role · We are looking for a Member of Technical Staff (MTS) to play a key technical leadership role in designing and advancing Wind River's next‑generation intelligent systems platform. This position is ideal for an engineer who thrives at the intersection of cloud‑nati ...

    United States Full time

    1 day ago

  • Work in company

    Machine Learning Infrastructure Engineer

    Only for registered members

    Build a Safer World.  · TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. TRM's blockchai ...

    United States $115,000 - $195,000 (USD) per year Full time

    1 week ago

  • Work in company Remote job

    Lead Tech Recruiter

    Only for registered members

    +Nebius is leading a new era in cloud computing to serve the global AI economy. · +Deep Technical Sourcing (LLM, Inference, Systems, GPU): Proactively identify and engage senior-level engineers and researchers across a wide range of AI/ML and systems domains.Use advanced sourcing ...

    United States

    3 weeks ago

  • Work in company

    Lead Tech Recruiter

    Only for registered members

    · Why work at Nebius · Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in- ...

    United States

    1 week ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer focused on container and cloud infrastructure. You will help design and implement our core container strategy for NVIDIA Inference Microservices (NIMs) and our h ...

    United States $120,000 - $190,000 (USD) per year

    1 day ago

  • Work in company

    Senior Engineering Manager

    Only for registered members

    Description · Do you thrive on building the future of AI infrastructure? · Are you ready to lead a world-class team at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. W ...

    United States $190,000 - $280,000 (USD) per year

    1 day ago

  • Work in company

    Forward Deployed Engineer

    Only for registered members

    About BentoML · BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance i ...

    North America $115,000 - $210,000 (USD) per year

    1 week ago

  • Work in company

    Principal Software Engineer

    Only for registered members

    Description · Do you thrive on solving complex technical challenges in AI infrastructure? · Are you ready to architect the future of AI at the edge? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, imp ...

    United States $190,000 - $320,000 (USD) per year

    1 week ago

  • Work in company

    Statistics Specialist

    Only for registered members

    · Are you a statistics expert eager to shape the future of AI? Large‑scale language models are evolving from clever chatbots into powerful engines of scientific discovery. With high‑quality training data, tomorrow's AI can democratize world‑class education, keep pace with cuttin ...

    United States of America

    5 days ago

  • Work in company

    AI Principal Software Developer

    Only for registered members

    We are seeking experienced AI Developers to help us shape the future of OCI Networking with AI. · This position offers an opportunity to work on cutting-edge AI applications and includes a collaborative work environment, · competitive benefits,and the chance to contribute to tran ...

    United States

    1 month ago

  • Work in company

    Developer Advocate

    Only for registered members

    · Why work at Nebius · Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in- ...

    United States $90,000 - $170,000 (USD) per year

    1 week ago

  • Work in company

    Developer Advocate

    Only for registered members

    We're looking for a scrappy, resourceful Developer Advocate to help grow Token Factory, Nebius' high-performance inference platform built for teams running real production AI workloads at scale. · Help developers know, adopt and use Token Factory for inference use cases. · Build ...

    United States

    3 weeks ago

  • Work in company

    AI Engineer

    Only for registered members

    · We're looking for an AI Engineer to design, implement, and optimize advanced AI systems that balance quality, performance, and cost. You'll work on inference pipelines, retrieval-augmented generation (RAG), and multi-agent patterns while building evaluation harnesses and simul ...

    United States $90,000 - $170,000 (USD) per year

    1 week ago