Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference - Seattle, WA

Only for registered members Seattle, WA, United States

3 weeks ago

Default job background

Job summary

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.
The team works across multiple technology layers - from frameworks and kernels collaborating with compiler runtime collectives optimizing current performance contributing future architecture designs working closely with customers enable their models ensure optimal performance.
This role offers unique opportunity work intersection machine learning high-performance computing distributed architectures shape future AI acceleration technology.
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Seattle $152,000 - $287,500 (USD)

    NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quant ...

  • Only for registered members Seattle $198,360 - $416,100 (USD)

    About TikTok · TikTok is a leading destination for short-form mobile video where users can make and share creative content in an easier way.We are seeking an experienced Multimodal Model Training Engineer to optimize AI model training pipelines. · ...

  • Only for registered members Redmond Full time $148,000 - $287,500 (USD)

    +NVIDIA is at the forefront of the generative AI revolution The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quan ...

  • Only for registered members Seattle

    We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. Join a pioneering team at Annapurna Labs, an AWS company, dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · ...

  • Only for registered members Seattle

    We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.Join Annapurna Labs to lead a team at the forefront of AI inference technology, optimizing large language models for AWS's custom machine learning accelerators. · This role offers the chance to drive innov ...

  • Only for registered members Seattle $151,800 - $332,200 (USD)

    We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference to develop state-of-the-art automatic speech recognition system and ship it to various Zoom products. · ...

  • Only for registered members Seattle $151,800 - $332,200 (USD)

    We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. This role will include collaboration with cross-functional teams to deliver high-impact projects from the ground up.About The Team Zoom's AI Speech Team is developing sp ...

  • Only for registered members Seattle, WA Remote job

    We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. · Join a pioneering team at Annapurna Labs an AWS company dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · Th ...

  • Only for registered members Seattle

    We're looking for a Backend Engineer who thrives on solving some of the hardest systems problems in AI. · Owning the design and optimization of inference pipelines for large-scale models. · ...

  • Only for registered members Seattle $137,000 - $270,000 (USD)

    We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas. · This role is part of broader Search AI Platform team involves close collabora ...

  • Only for registered members Seattle Full time $96,800 - $223,400 (USD)

    The Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure. · ...

  • Only for registered members Seattle

    We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst.This individual will play a critical role in applying statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · ...

  • Only for registered members Seattle

    We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst to apply statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · Develop methodologies for validating probabilistic outputs f ...

  • Only for registered members Seattle Full time

    The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating and optimizing cutting-edge technologies for AI/ML infrastructure. This role guides key strategic decisions related to Oracle Cloud's AI infrastructure offerings and spearheads the design and ...

  • Only for registered members Seattle

    We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst.This individual will play a critical role in applying statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. · ...

  • Only for registered members Seattle Full time $79,200 - $178,100 (USD)

    The Senior AI/ML Software Engineer is responsible for evaluating integrating and optimizing cutting-edge technologies for AI/ML infrastructure focusing on achieving low latency high throughput and efficient resource utilization for both model training and inference at scale. · ...

  • Only for registered members Seattle Full time $129,300 - $223,600 (USD)

    We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achiev ...

  • Only for registered members Seattle $129,300 - $223,600 (USD)

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. · The Inference Enablement and Acceleration team fo ...

  • Only for registered members Seattle $139,500 - $258,100 (USD)

    + On-Device Machine Learning team at Apple responsible for enabling Research to Production lifecycle of innovative machine learning models. Critical infrastructure built for onboarding latest machine learning architectures to embedded devices.+ Explore new trends in ML architectu ...

  • Only for registered members Seattle $151,300 - $261,500 (USD)

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators. · This comprehensive toolkit includes an ML compiler, runtime, and applicati ...