Lead Engineer, Inference Platform - Seattle
2 weeks ago

Job summary
We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas.This role is part of broader Search AI Platform team involves close collaboration with AI engineers researchers from our acquisition who are developing industry leading embedding models.
As Lead Engineer Inference Platform you'll be hands on with design implementation while working with engineers across experience levels to build robust scalable system.
- 8+ years engineering experience backend systems ML infrastructure scalable platform development ability provide technical leadership guidance team engineers
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Software Engineer 3
1 week ago
+Build core systems and services for model inference at scale in a hybrid working model based in Seattle+2+ years of experience building backend or infrastructure systems at scale+Strong software engineering skills in languages such as Go, Rust, Python, or C++We're looking for a ...
AI Inference Engineer
1 month ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference to develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. · ...
AI Inference Engineer
1 week ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. This role will include collaboration with cross-functional teams to deliver high-impact projects from the ground up.About The Team Zoom's AI Speech Team is developing sp ...
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quant ...
We are looking for talented individuals to join us for an internship in 2026. PhD Internships at ByteDance aim to provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. · We are ...
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration and in ...
Software Engineer
3 weeks ago
+Job summary · The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. · +ResponsibilitiesResponsible for the design and implementation of distributed inferen ...
Software Engineer
3 weeks ago
The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking search ranking live & ecom ranking in our company currently we are looking for machine learning engineer model serving infrastructure to join our team to s ...
We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. · Design and build large-scale, container-based cluster management and orchestration systems w ...
The Inference Infrastructure team at ByteDance is looking for talented individuals to join them for an internship in 2026.We aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths. · Candidates can apply to a maximum ...
We're looking for an Engineering Manager to build and lead a new team focused on developer productivity within Inference: · making every engineer in the org dramatically more effective at building, testing, and shipping inference software. · This is a leadership role at the inter ...
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms tha ...
Staff GenAI Engineer
2 weeks ago
The Apple Data Platform team powers data analytics, exploration, and feature engineering that fuel Siri, Search, Music, Maps, iCloud and many other beloved products in the Apple ecosystem. As a Staff GenAI Engineer on the Apple Data Platform group's GenAI Platform team you'll pla ...
Software Dev Engineer, EC2 Nitro
1 day ago
We're seeking an exceptional Software Development Engineer to build and optimize the performance measurement infrastructure for some of the most computationally intensive AI/ML workloads on AWS. · This position offers the unique opportunity to shape the future of machine learning ...
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, · Azure, · and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration · ...
Tech Lead Software Engineer
1 month ago
The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are expanding our focus on LLM inference infrastructure to support new AI workloads, · We are looking for engineers passion ...
Software Dev Engineer, EC2 Nitro
1 day ago
We're seeking an exceptional Software Development Engineer to build and optimize the performance measurement infrastructure for some of the most computationally intensive AI/ML workloads on AWS. · Design and build foundational infrastructure for ML performance measurement that sc ...
Software Dev Engineer, EC2 Nitro
1 day ago
We're seeking an exceptional Software Development Engineer to build and optimize the performance measurement infrastructure for some of the most computationally intensive AI/ML workloads on AWS. · ...
Software Dev Engineer, EC2 Nitro
21 hours ago
This job is with Amazon to revolutionize accelerated computing in the cloud. Join the EC2 Nitro Machine Learning Systems team to build and optimize performance measurement infrastructure for AI/ML workloads on AWS. · We're seeking a Software Development Engineer to establish EC2 ...
The Inference Infrastructure team is responsible for designing and operating the platforms that power microservices, big data distributed storage machine learning training and inference edge computing across multi-cloud global datacenters. · We are looking for engineers passionat ...