Jobs
>
Cupertino

    Sr. Machine Learning Engineer, Annapurna ML - Cupertino, United States - Annapurna Labs (U.S.) Inc.

    Default job background
    Full time
    Description

    Annapurna ML pathfinding team is a new function within the Annapurna ML go-to-market org that help customers accelerate their adoption of Annapurna ML products including AWS Trainium and AWS Inferentia.

    The team offers hands-on data science and coding services to our most strategic customer opportunities to launch their training and inference workloads on AWS purpose built ML silicon offerings.

    Key job responsibilities

    In this customer-facing role, you will be responsible for helping our most strategic customers port their models to the AWS Trainium & Inferentia platforms by delivering high-quality code and customizations to make the models functional and performant.

    You will use and provide feedback to the various Neuron SDK libraries and help prototype and develop new features based on the latest research findings and customer requests.

    A day in the life
    You will be required to assist our most strategic customers in porting their models to AWS Trainium and Inferentia.


    You will work directly with customer data scientists and ML engineering teams and write code to have the models be performant on AWS purpose-built silicon solutions.

    It may require low-level coding in C++ and writing custom kernels to get the best performance possible.

    You will also be responsible for porting the latest open-source models to AWS Trainium/Inferentia. You will also contribute to open-source projects to help add support for AWS Trainium/Inferentia in popular projects.


    It will require a close collaboration with the Neuron engineering team to help drive the Neuron product roadmap and give feedback on improving product quality.

    About the team

    Our team's mission is to provide the fastest, cost-effective and user-friendly place to train and deploy Generative AI workloads in the cloud.

    The team provides white-glove service to our most strategic customers to implement their models for both training and inference using the Neuron SDK associated libraries and APIs.

    We are open to hiring candidates to work out of one of the following locations:

    Cupertino, CA, USA

    BASIC QUALIFICATIONS

    • 5+ years of noninternship professional software development experience
    • 5+ years of programming using a modern programming language such as Java, C++, or C#, including objectoriented design experience
    • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
    • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
    • Experience as a mentor, tech lead or leading an engineering team
    • 3+ years of experience writing code to train and/or deploy deep learning models in PyTorch
    PREFERRED QUALIFICATIONS

    • Bachelor's degree in computer science or equivalent
    • Experience deploying Generative AI applications with large language or vision models into production.


  • Amazon Development Center U.S., Inc. - B02 San Francisco, United States Permanent

    Annapurna Labs is an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. We specialize in designing software, systems and chips that optimize the AWS customer experience. · More and more customers run their ...


  • Amazon Cupertino, United States

    Job summary · Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. · We are seeking an experienced Design Verification Engineers to build the next ...


  • Amazon Development Center U.S., Inc. - B02 Cupertino, United States

    Job Description · Annapurna Labs is an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. We specialize in designing software, systems and chips that optimize the AWS customer experience. · More and more cu ...


  • Amazon Cupertino, United States

    Job summary · Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. · We are seeking an experienced Design Verification Engineers to build the ne ...


  • Amazon Inc Cupertino, United States

    The Machine Learning Accelerators (MLA) team designs the Inferentia and Trainium chips that are the brains for AWS machine learning instances. Chip design requires massive capital investments in fabrication facilities and equipment, so any design err Engineer, Principal, Educatio ...


  • Amazon Cupertino, United States

    Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across ...


  • Amazon Inc Cupertino, United States

    Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, a Design Engineer, Engineer, De ...


  • tapwage Cupertino, United States Full time

    · Annapurna Labs is an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. We specialize in designing semi-conductors, systems, chips, and software that optimize the AWS customer experience. · More and more ...


  • Amazon Cupertino, United States

    AWS Networking Software enhances every aspect of our customers networking infrastructure and enables all of the benefits of hosting complex workloads in the cloud. · We are looking for software development engineers with a background in networking and a passion for innovation fo ...


  • Amazon Cupertino, United States

    AWS Nitro is the underlying platform for our next generation EC2 instances. Comprised of custom AWS hardware and software that work together to deliver security and performance that is indistinguishable from bare metal. · We have completely re-imagined virtualization infrastructu ...


  • Amazon Cupertino, United States

    Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across ...


  • Amazon Cupertino, United States

    · Do you love decomposing problems to develop products that impact millions of people around the world? Would you enjoy identifying, defining, and building software solutions that revolutionize how businesses operate? · The Annapurna Labs team at Amazon Web Services (AWS) is lo ...


  • Amazon Cupertino, United States

    Software Engineer- AI/ML, AWS Neuron Distributed Training · Do you love decomposing problems to develop products that impact millions of people around the world? Would you enjoy identifying, defining, and building software solutions that revolutionize how businesses operate? · ...


  • Amazon Cupertino, United States

    Come join our Annapurna Finance team as a Sr. Financial Analyst. You will partner with Annapurna Labs Engineers and Operations team to build the next generation products for AWS. You will have exposure to several cutting-edge products at AWS. Senior financial analyst will be resp ...


  • Amazon Cupertino, United States

    AWS Neuron is the complete software stack for the AWS Inferentia (Inf1/Inf2) and Trainium (Trn1), our cloud-scale Machine Learning accelerators. This role is for a machine learning engineer in the Distribute Training team for AWS Neuron, responsible for development, enablement an ...


  • Amazon Cupertino, United States

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the ...


  • Amazon Cupertino, United States

    The Product: AWS Neuron is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS customers. Trainium is designed to deliver the best-in-class ML training perfo ...


  • Annapurna Labs (U.S.) Inc. Cupertino, United States

    Job Description · Custom silicon chips live at the heart of AWS Machine Learning servers, and our team builds the backend software to run these servers. We're looking for someone to lead our system-on-chip (SoC) driver software team and help us deliver at scale, as we build the n ...

  • Annapurna Labs (U.S.) Inc.

    Compiler Engineer II

    3 weeks ago


    Annapurna Labs (U.S.) Inc. Cupertino, United States

    Annapurna Labs, considered as secret sauce behind the success of AWS, is responsible for silicon development. · The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferenti ...


  • Rubrik Palo Alto, United States

    The Product Management team at Rubrik drives Rubrik's vision forward, ensuring we continue to secure the world's data by delivering on our market-leading platform to address our customers' most challenging data management & security needs. Our Product Managers are responsible for ...