Software Engineer-AI/ML, AWS Neuron Inference - Seattle
1 week ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Machine Learning Engineer, AWS Neuron Inference, AWS Neuron
Only for registered members
This role is responsible for development, enablement and performance tuning of a wide variety of ML model families. The ML Apps team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trn1.This role will ...
1 month ago
This role is for a senior software engineer in the Machine Learning Inference Applications team. The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama ...
1 month ago
This role is for a senior software engineer in the Machine Learning Inference Applications team.Adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. · ...
1 month ago
This role is for a senior software engineer in the Machine Learning Inference Applications team. · The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Lla ...
1 week ago
This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, · Mixture of Experts.The team works side by side with chip architects, · compiler engineers and runtime engineers t ...
1 week ago
+AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Dec ...
1 week ago
Software Development Manager, AI Inference Technology, Neuron SDK
Only for registered members
As the SDM for the Neuron Inference Technology building blocks team you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · Manage ...
1 month ago
Software Development Manager, AI Inference Technology, Neuron SDK
Only for registered members
We develop AWS Neuron,the complete software stack for Trainium,Aamazon's custom cloudscale machine learning accelerators.Come optimize LLMs such as Llama and GPT OSS to run really fast on Trainium. · As the SDM for the Neuron Inference Technology building blocks team,you will gui ...
1 week ago
Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference
Only for registered members
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. · The team works across multiple technology layers ...
1 month ago
We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.Join Annapurna Labs to lead a team at the forefront of AI inference technology, optimizing large language models for AWS's custom machine learning accelerators. · This role offers the chance to drive innov ...
1 month ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · This comprehensive toolkit includes an ML compiler, runtime, and applicati ...
1 month ago
The Inference Enablement and Acceleration team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · ...
1 month ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. · The Inference Enablement and Acceleration team fo ...
1 month ago
We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achiev ...
1 month ago
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine · learning accelerators and the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. · ...
1 day ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron software development kit used to accelerate deep learning. · Design develop and optimize machine learning models frameworks for deployment on custom ML hardware accelerators. · ...
3 weeks ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · Design, develop, and optimize machine learning models and frameworks for d ...
3 weeks ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · This role offers a unique opportunity to work at the intersection of machi ...
1 week ago
AWS Neuron is hiring a Senior Technical Writer to drive the documentation strategy and AI-based content contribution and automation initiatives for our AWS Neuron SDK. · ResponsibilitiesWork with multiple Neuron component engineering teams to plan content for Neuron releases and ...
1 month ago
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. · The Inference Enablement and Acceleration team fo ...
4 weeks ago