Lead Engineer, Inference Platform - Seattle
1 month ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Dice is the leading career destination for tech experts at every stage of their careers. Our client eTek IT Services Inc is seeking the following Apply via Dice today Setup and operation of AI inference service monitoring for performance and availability Experience deploying and ...
2 weeks ago
We're looking for a systems-minded AI Software Engineer to join our core inference platform team. · Architect, extend, and optimize core components of our AI serving platform for throughput, latency, and scalability. · Customize open-source serving frameworks (e.g., vLLM) for pro ...
2 weeks ago
+Build core systems and services for model inference at scale in a hybrid working model based in Seattle+2+ years of experience building backend or infrastructure systems at scale+Strong software engineering skills in languages such as Go, Rust, Python, or C++We're looking for a ...
1 month ago
We are looking for a talented Machine Learning Engineer to play a key role in building our core AI inference platform.You will be responsible for designing and developing critical components, · including ML model deployment, · innovative model optimization pipelines, · and perfor ...
2 weeks ago
Senior GenAI Algorithms Engineer — Model Optimizations for Inference
Only for registered members
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quant ...
1 month ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference to develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. · ...
1 month ago
Software Engineer Intern (Inference Infrastructure) - 2026 Start (PHD)
Only for registered members
Responsibilities · About the Team · The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designi ...
2 days ago
We are looking for a talented Machine Learning Engineer to play a key role in building our core AI inference platform. · ...
2 weeks ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. This role will include collaboration with cross-functional teams to deliver high-impact projects from the ground up.About The Team Zoom's AI Speech Team is developing sp ...
1 month ago
What You Can Expect · We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this role, you will develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. You will work on the most ...
1 week ago
We are looking for a talented Machine Learning Research Scientist to play a key role in designing our core AI inference platform.You will be actively conducting high impact research on novel ML solutions towards the next generation of efficient intelligence platforms. · ...
2 weeks ago
The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration and in ...
1 month ago
We're looking for a systems-minded AI Software Engineer to join our core inference platform team. · Architect, extend, and optimize core components of our AI serving platform for throughput, latency, and scalability. · Customize open-source serving frameworks (e.g., vLLM) for pro ...
2 weeks ago
We are currently seeking a senior-level Engineer with distinguished expertise to join the Dynamo engineering team. As a Distinguished Engineer, you will be a key technical leader, helping drive technology and product direction, provide deep technical expertise and guidance to the ...
2 weeks ago
Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)
Only for registered members
We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. · Design and build large-scale, container-based cluster management and orchestration systems w ...
1 month ago
Anthropic's mission is to create reliable interpretable and steerable AI systems. · ...
2 weeks ago
+Job summary · We are building the next generation of ML infrastructure that powers AI capabilities across Apple's products and services. · +ResponsibilitiesYou'll work with cutting-edge technologies including vLLM, Ray, TensorRT-LLM... · ...
2 weeks ago
We're looking for an Engineering Manager to build and lead a new team focused on developer productivity within Inference: · making every engineer in the org dramatically more effective at building, testing, and shipping inference software. · This is a leadership role at the inter ...
1 month ago
The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking search ranking live & ecom ranking in our company currently we are looking for machine learning engineer model serving infrastructure to join our team to s ...
1 month ago
+Job summary · The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. · +ResponsibilitiesResponsible for the design and implementation of distributed inferen ...
1 month ago