Staff + Senior Software Engineer, Cloud Inference - Seattle
1 week ago

Job summary
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration and intelligent request routing to inference execution, capacity management,Responsibilities
- Design and build infrastructure that serves Claude across multiple CSPs
, accounting for differences in compute hardware, networking APIs
, - Collaborate with CSP partner engineering teams to resolve operational issues influence provider roadmaps stand up end-to-end serving on new cloud platforms
Job description
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.Get full accessAccess all high-level positions and get the job of your dreams.
Similar jobs
Engineering Manager, Cloud Inference Azure
1 month ago
We are seeking an experienced Engineering Manager to lead the Cloud Inference team for Azure. You will lead your team to scale and optimize Claude to serve the massive audiences of developers and enterprise companies using Azure. · ...
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, · Azure, · and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration · ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. Join a pioneering team at Annapurna Labs, an AWS company, dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.Join Annapurna Labs to lead a team at the forefront of AI inference technology, optimizing large language models for AWS's custom machine learning accelerators. · This role offers the chance to drive innov ...
Sr. SDM, AI Inference Technology, Neuron SDK
1 month ago
As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · ...
Lead Engineer, Inference Platform
2 weeks ago
We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas. · This role is part of broader Search AI Platform team involves close collabora ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. · Join a pioneering team at Annapurna Labs an AWS company dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · Th ...
Software Engineer 3
1 week ago
+Build core systems and services for model inference at scale in a hybrid working model based in Seattle+2+ years of experience building backend or infrastructure systems at scale+Strong software engineering skills in languages such as Go, Rust, Python, or C++We're looking for a ...
We are looking for talented individuals to join us for an internship in 2026. PhD Internships at ByteDance aim to provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. · We are ...
Principal Software Engineer
1 day ago
The Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure. · ...
Sr Principal AI Software Engineer
5 days ago
The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating and optimizing cutting-edge technologies for AI/ML infrastructure. This role guides key strategic decisions related to Oracle Cloud's AI infrastructure offerings and spearheads the design and ...
Senior Software Engineer
1 day ago
The Senior AI/ML Software Engineer is responsible for evaluating integrating and optimizing cutting-edge technologies for AI/ML infrastructure focusing on achieving low latency high throughput and efficient resource utilization for both model training and inference at scale. · ...
Product Manager II, GKE AI and Data Storage
1 month ago
At Google, we put our users first. The world is always changing, so we need Product Managers who are continuously adapting and excited to work on products that affect millions of people every day. · ...
We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. · Design and build large-scale, container-based cluster management and orchestration systems w ...
The Inference Infrastructure team at ByteDance is looking for talented individuals to join them for an internship in 2026.We aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths. · Candidates can apply to a maximum ...
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms tha ...
Tech Lead Software Engineer
1 month ago
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are expanding our focus on LLM inference infrastructure to support new AI workloads, · We are looking for engineers passion ...
Product Manager II, GKE AI and Data Storage
1 month ago
The world is always changing, so we need Product Managers who are continuously adapting and excited to work on products that affect millions of people every day. · In this role, you will work cross-functionally to guide products from conception to launch by connecting the technic ...
As the SDM for the Neuron Inference Technology building blocks team you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · Manage ...
Senior Software Engineer
1 month ago
Imagine what we could do together. At Apple, new ideas have a way of becoming extraordinary products, services and customer experiences very quickly. · Bachelor's in CS or related field and/or equivalent experience · n4+ years in software engineering · ...