Staff Software Engineer, Inference - United States

Only for registered members United States

1 day ago

What You'll DoBuild low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics · Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

LLM Inference Engineer

Only for registered members

About Periodic Labs · We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn ne ...

United States

1 day ago

Work in company

Inference Optimization Engineer

Only for registered members

About BentoML · BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance i ...

North America

1 week ago

Work in company

Generative AI Inference Engineer

Only for registered members

· Generative AI Inference Engineer · About the role: · We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inf ...

United States

1 week ago

Work in company

Senior Engineer

Only for registered members

We are looking for an experienced Field-Applications Engineer to help deploy a new generation of code translation tools enabled by AI and modern verification techniques. · Deploy and manage containerized services using Docker. · Deploy and run Python based GenAI pipelines interac ...

United States Full time

1 month ago

Work in company

ML Engineer

Only for registered members

· Elevating the quality of human life through every conversation · ML Engineer - Inference · Location: United States · Experience: 5 years · About the Team: · At , our team is dedicated to revolutionizing the field of Conversation AI. We are a collaborative and innovative group ...

United States $110,000 - $190,000 (USD) per year

1 week ago

Work in company Remote job

Member of Technical Staff, Exceptional Generalist

Only for registered members

+Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. · +This is a globally remote opportunity for exceptional generalist engineers who can work across the entire vLLM stack: from low-level GPU ke ...

United States

1 month ago

Work in company

Senior II Software Engineer Lead

Only for registered members

Description · Do you thrive on technical leadership and building cutting-edge AI systems? · Are you ready to drive innovation at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technolog ...

United States

1 week ago

Work in company

Member of Technical Staff

Only for registered members

About the Role · We are looking for a Member of Technical Staff (MTS) to play a key technical leadership role in designing and advancing Wind River's next‑generation intelligent systems platform. This position is ideal for an engineer who thrives at the intersection of cloud‑nati ...

United States Full time

1 day ago

Work in company

Machine Learning Infrastructure Engineer

Only for registered members

Build a Safer World. · TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. TRM's blockchai ...

United States $115,000 - $195,000 (USD) per year Full time

1 week ago

Work in company Remote job

Lead Tech Recruiter

Only for registered members

+Nebius is leading a new era in cloud computing to serve the global AI economy. · +Deep Technical Sourcing (LLM, Inference, Systems, GPU): Proactively identify and engage senior-level engineers and researchers across a wide range of AI/ML and systems domains.Use advanced sourcing ...

United States

3 weeks ago

Work in company

Lead Tech Recruiter

Only for registered members

· Why work at Nebius · Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in- ...

United States

1 week ago

Work in company

Senior Software Engineer

Only for registered members

NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer focused on container and cloud infrastructure. You will help design and implement our core container strategy for NVIDIA Inference Microservices (NIMs) and our h ...

United States $120,000 - $190,000 (USD) per year

1 day ago

Work in company

Senior Engineering Manager

Only for registered members

Description · Do you thrive on building the future of AI infrastructure? · Are you ready to lead a world-class team at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. W ...

United States $190,000 - $280,000 (USD) per year

1 day ago

Work in company

Forward Deployed Engineer

Only for registered members

North America $115,000 - $210,000 (USD) per year

1 week ago

Work in company

Principal Software Engineer

Only for registered members

Description · Do you thrive on solving complex technical challenges in AI infrastructure? · Are you ready to architect the future of AI at the edge? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, imp ...

United States $190,000 - $320,000 (USD) per year

1 week ago

Work in company

Statistics Specialist

Only for registered members

· Are you a statistics expert eager to shape the future of AI? Large‑scale language models are evolving from clever chatbots into powerful engines of scientific discovery. With high‑quality training data, tomorrow's AI can democratize world‑class education, keep pace with cuttin ...

United States of America

5 days ago

Work in company

AI Principal Software Developer

Only for registered members

We are seeking experienced AI Developers to help us shape the future of OCI Networking with AI. · This position offers an opportunity to work on cutting-edge AI applications and includes a collaborative work environment, · competitive benefits,and the chance to contribute to tran ...

United States

1 month ago

Work in company

Developer Advocate

Only for registered members

United States $90,000 - $170,000 (USD) per year

1 week ago

Work in company

Developer Advocate

Only for registered members

We're looking for a scrappy, resourceful Developer Advocate to help grow Token Factory, Nebius' high-performance inference platform built for teams running real production AI workloads at scale. · Help developers know, adopt and use Token Factory for inference use cases. · Build ...

United States

3 weeks ago

Work in company

AI Engineer

Only for registered members

· We're looking for an AI Engineer to design, implement, and optimize advanced AI systems that balance quality, performance, and cost. You'll work on inference pipelines, retrieval-augmented generation (RAG), and multi-agent patterns while building evaluation harnesses and simul ...

United States $90,000 - $170,000 (USD) per year

1 week ago

Staff Software Engineer, Inference - United States

Job description

Similar jobs

LLM Inference Engineer

Inference Optimization Engineer

Generative AI Inference Engineer

Senior Engineer

ML Engineer

Member of Technical Staff, Exceptional Generalist

Senior II Software Engineer Lead

Member of Technical Staff

Machine Learning Infrastructure Engineer

Lead Tech Recruiter

Lead Tech Recruiter

Senior Software Engineer

Senior Engineering Manager

Forward Deployed Engineer

Principal Software Engineer

Statistics Specialist

AI Principal Software Developer

Developer Advocate

Developer Advocate

AI Engineer

Directory

for Recruiters

Information