Staff Software Engineer, Inference - United States
1 day ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
About Periodic Labs · We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn ne ...
1 day ago
About BentoML · BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance i ...
1 week ago
· Generative AI Inference Engineer · About the role: · We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inf ...
1 week ago
We are looking for an experienced Field-Applications Engineer to help deploy a new generation of code translation tools enabled by AI and modern verification techniques. · Deploy and manage containerized services using Docker. · Deploy and run Python based GenAI pipelines interac ...
1 month ago
· Elevating the quality of human life through every conversation · ML Engineer - Inference · Location: United States · Experience: 5 years · About the Team: · At , our team is dedicated to revolutionizing the field of Conversation AI. We are a collaborative and innovative group ...
1 week ago
Member of Technical Staff, Exceptional Generalist
Only for registered members
+Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. · +This is a globally remote opportunity for exceptional generalist engineers who can work across the entire vLLM stack: from low-level GPU ke ...
1 month ago
Description · Do you thrive on technical leadership and building cutting-edge AI systems? · Are you ready to drive innovation at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technolog ...
1 week ago
About the Role · We are looking for a Member of Technical Staff (MTS) to play a key technical leadership role in designing and advancing Wind River's next‑generation intelligent systems platform. This position is ideal for an engineer who thrives at the intersection of cloud‑nati ...
1 day ago
Build a Safer World. · TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. TRM's blockchai ...
1 week ago
+Nebius is leading a new era in cloud computing to serve the global AI economy. · +Deep Technical Sourcing (LLM, Inference, Systems, GPU): Proactively identify and engage senior-level engineers and researchers across a wide range of AI/ML and systems domains.Use advanced sourcing ...
3 weeks ago
· Why work at Nebius · Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in- ...
1 week ago
NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer focused on container and cloud infrastructure. You will help design and implement our core container strategy for NVIDIA Inference Microservices (NIMs) and our h ...
1 day ago
Description · Do you thrive on building the future of AI infrastructure? · Are you ready to lead a world-class team at the intersection of AI and edge computing? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. W ...
1 day ago
About BentoML · BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enterprises around the world rely on us for consistent scalability and performance i ...
1 week ago
Description · Do you thrive on solving complex technical challenges in AI infrastructure? · Are you ready to architect the future of AI at the edge? · Join the Akamai Inference Cloud Team · The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, imp ...
1 week ago
· Are you a statistics expert eager to shape the future of AI? Large‑scale language models are evolving from clever chatbots into powerful engines of scientific discovery. With high‑quality training data, tomorrow's AI can democratize world‑class education, keep pace with cuttin ...
5 days ago
We are seeking experienced AI Developers to help us shape the future of OCI Networking with AI. · This position offers an opportunity to work on cutting-edge AI applications and includes a collaborative work environment, · competitive benefits,and the chance to contribute to tran ...
1 month ago
· Why work at Nebius · Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in- ...
1 week ago
We're looking for a scrappy, resourceful Developer Advocate to help grow Token Factory, Nebius' high-performance inference platform built for teams running real production AI workloads at scale. · Help developers know, adopt and use Token Factory for inference use cases. · Build ...
3 weeks ago
· We're looking for an AI Engineer to design, implement, and optimize advanced AI systems that balance quality, performance, and cost. You'll work on inference pipelines, retrieval-augmented generation (RAG), and multi-agent patterns while building evaluation harnesses and simul ...
1 week ago