Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference - Seattle, WA
3 weeks ago

Job summary
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The team works across multiple technology layers - from frameworks and kernels collaborating with compiler runtime collectives optimizing current performance contributing future architecture designs working closely with customers enable their models ensure optimal performance.
This role offers unique opportunity work intersection machine learning high-performance computing distributed architectures shape future AI acceleration technology.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quant ...
About TikTok · TikTok is a leading destination for short-form mobile video where users can make and share creative content in an easier way.We are seeking an experienced Multimodal Model Training Engineer to optimize AI model training pipelines. · ...
+NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quan ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. Join a pioneering team at Annapurna Labs, an AWS company, dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.Join Annapurna Labs to lead a team at the forefront of AI inference technology, optimizing large language models for AWS's custom machine learning accelerators. · This role offers the chance to drive innov ...
AI Inference Engineer
1 month ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference to develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. · ...
AI Inference Engineer
1 week ago
We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. This role will include collaboration with cross-functional teams to deliver high-impact projects from the ground up.About The Team Zoom's AI Speech Team is developing sp ...
Software Development Manager
1 month ago
We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. · Join a pioneering team at Annapurna Labs an AWS company dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · Th ...
Backend Engineer – Inference Optimization
4 days ago
We're looking for a Backend Engineer who thrives on solving some of the hardest systems problems in AI. · Owning the design and optimization of inference pipelines for large-scale models. · ...
Lead Engineer, Inference Platform
2 weeks ago
We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas. · This role is part of broader Search AI Platform team involves close collabora ...
Principal Software Engineer
1 day ago
The Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure. · ...
AI Statistics Analyst
3 weeks ago
We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst.This individual will play a critical role in applying statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · ...
Applied AI Statistics Analyst
1 month ago
We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst to apply statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · Develop methodologies for validating probabilistic outputs f ...
Sr Principal AI Software Engineer
5 days ago
The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating and optimizing cutting-edge technologies for AI/ML infrastructure. This role guides key strategic decisions related to Oracle Cloud's AI infrastructure offerings and spearheads the design and ...
AI Statistical Analyst
1 month ago
We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst.This individual will play a critical role in applying statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · ...
Senior Software Engineer
1 day ago
The Senior AI/ML Software Engineer is responsible for evaluating integrating and optimizing cutting-edge technologies for AI/ML infrastructure focusing on achieving low latency high throughput and efficient resource utilization for both model training and inference at scale. · ...
Software Development Engineer
1 month ago
We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achiev ...
Software Development Engineer
1 month ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. · The Inference Enablement and Acceleration team fo ...
On-Device ML Infrastructure Engineer
4 weeks ago
+ On-Device Machine Learning team at Apple responsible for enabling Research to Production lifecycle of innovative machine learning models. Critical infrastructure built for onboarding latest machine learning architectures to embedded devices.+ Explore new trends in ML architectu ...
Senior Software Development Engineer
1 month ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · This comprehensive toolkit includes an ML compiler, runtime, and applicati ...