Staff + Senior Software Engineer, Cloud Inference - Seattle
1 month ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
We are seeking an experienced Engineering Manager to lead the Cloud Inference team for Azure. You will lead your team to scale and optimize Claude to serve the massive audiences of developers and enterprise companies using Azure. · ...
1 month ago
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, · Azure, · and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration · ...
1 month ago
We are seeking a visionary and hands-on Hardware Design Engineer to contribute to the design, definition, · and implementation of our core AI inference engine. · ...
2 weeks ago
Imagine what you can do here. Apple is a place where extraordinary people gather to do their lives best work. Together we create products and experiences people once couldn't have imagined, and now, can't imagine living without. It's the diversity of those people and their ideas ...
1 week ago
We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas. · This role is part of broader Search AI Platform team involves close collabora ...
1 month ago
AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWS's services and features apart in the industry. · Come develop inference acceleration for AWS Neur ...
1 day ago
+Build core systems and services for model inference at scale in a hybrid working model based in Seattle+2+ years of experience building backend or infrastructure systems at scale+Strong software engineering skills in languages such as Go, Rust, Python, or C++We're looking for a ...
1 month ago
Summary · Imagine what you can do here. Apple is a place where extraordinary people gather to do their lives best work. Together we create products and experiences people once couldn't have imagined, and now, can't imagine living without. It's the diversity of those people and th ...
1 week ago
We are looking for a talented Machine Learning Engineer to play a key role in building our core AI inference platform.You will be responsible for designing and developing critical components, · including ML model deployment, · innovative model optimization pipelines, · and perfor ...
2 weeks ago
Software Engineer Intern (Inference Infrastructure) - 2026 Start (PHD)
Only for registered members
Responsibilities · About the Team · The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designi ...
2 days ago
The Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure. · ...
3 weeks ago
The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating and optimizing cutting-edge technologies for AI/ML infrastructure. This role guides key strategic decisions related to Oracle Cloud's AI infrastructure offerings and spearheads the design and ...
3 weeks ago
The Senior AI/ML Software Engineer is responsible for evaluating integrating and optimizing cutting-edge technologies for AI/ML infrastructure focusing on achieving low latency high throughput and efficient resource utilization for both model training and inference at scale. · ...
3 weeks ago
Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)
Only for registered members
We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. · Design and build large-scale, container-based cluster management and orchestration systems w ...
1 month ago
The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure, focusing on achieving low latency, high throughput, and efficient resource utilization for both model training and inference ...
1 week ago
Software Engineer Intern (Inference Infrastructure) - 2026 Start (PHD)
Only for registered members
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms tha ...
1 month ago
We are looking for a talented Machine Learning Engineer to play a key role in building our core AI inference platform. · ...
2 weeks ago
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are expanding our focus on LLM inference infrastructure to support new AI workloads, · We are looking for engineers passion ...
1 month ago
Software Development Manager, AI Inference Technology, Neuron SDK
Only for registered members
As the SDM for the Neuron Inference Technology building blocks team you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · Manage ...
1 month ago
Imagine what we could do together. At Apple, new ideas have a way of becoming extraordinary products, services and customer experiences very quickly. · Bachelor's in CS or related field and/or equivalent experience · n4+ years in software engineering · ...
1 month ago