Staff + Senior Software Engineer, Cloud Inference - Seattle

Only for registered members Seattle, United States

1 week ago

Default job background
Full time $300,000 - $485,000 (USD)

Job summary

The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration and intelligent request routing to inference execution, capacity management,

Responsibilities

  • Design and build infrastructure that serves Claude across multiple CSPs
    , accounting for differences in compute hardware, networking APIs
    ,
  • Collaborate with CSP partner engineering teams to resolve operational issues influence provider roadmaps stand up end-to-end serving on new cloud platforms

      Lorem ipsum dolor sit amet
      , consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

      Donec lacinia nisi nec odio ultricies imperdiet.
      Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

      Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
      , at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
      Get full access

      Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Seattle Full time $405,000 - $485,000 (USD)

    We are seeking an experienced Engineering Manager to lead the Cloud Inference team for Azure. You will lead your team to scale and optimize Claude to serve the massive audiences of developers and enterprise companies using Azure. · ...

  • Only for registered members Seattle, WA

    The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, · Azure, · and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform—from API integration · ...

  • Only for registered members Seattle

    We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. Join a pioneering team at Annapurna Labs, an AWS company, dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · ...

  • Only for registered members Seattle

    We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.Join Annapurna Labs to lead a team at the forefront of AI inference technology, optimizing large language models for AWS's custom machine learning accelerators. · This role offers the chance to drive innov ...

  • Only for registered members Seattle $198,100 - $342,300 (USD)

    As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · ...

  • Only for registered members Seattle $137,000 - $270,000 (USD)

    We're looking for a Lead Engineer Inference Platform to join our team building inference platform for embedding models that power semantic search retrieval and AI native features across MongoDB Atlas. · This role is part of broader Search AI Platform team involves close collabora ...

  • Only for registered members Seattle, WA Remote job

    We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity. · Join a pioneering team at Annapurna Labs an AWS company dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. · Th ...

  • Only for registered members Seattle Full time $101,000 - $198,000 (USD)

    +Build core systems and services for model inference at scale in a hybrid working model based in Seattle+2+ years of experience building backend or infrastructure systems at scale+Strong software engineering skills in languages such as Go, Rust, Python, or C++We're looking for a ...

  • Only for registered members Seattle

    We are looking for talented individuals to join us for an internship in 2026. PhD Internships at ByteDance aim to provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. · We are ...

  • Only for registered members Seattle Full time $96,800 - $223,400 (USD)

    The Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML infrastructure. · ...

  • Only for registered members Seattle Full time

    The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating and optimizing cutting-edge technologies for AI/ML infrastructure. This role guides key strategic decisions related to Oracle Cloud's AI infrastructure offerings and spearheads the design and ...

  • Only for registered members Seattle Full time $79,200 - $178,100 (USD)

    The Senior AI/ML Software Engineer is responsible for evaluating integrating and optimizing cutting-edge technologies for AI/ML infrastructure focusing on achieving low latency high throughput and efficient resource utilization for both model training and inference at scale. · ...

  • Only for registered members Seattle Full time $156,000 - $229,000 (USD)

    At Google, we put our users first. The world is always changing, so we need Product Managers who are continuously adapting and excited to work on products that affect millions of people every day. · ...

  • Only for registered members Seattle $129,960 - $246,240 (USD)

    We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. · Design and build large-scale, container-based cluster management and orchestration systems w ...

  • Only for registered members Seattle

    The Inference Infrastructure team at ByteDance is looking for talented individuals to join them for an internship in 2026.We aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths. · Candidates can apply to a maximum ...

  • Only for registered members Seattle, WA

    The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms tha ...

  • Only for registered members Seattle, WA

    The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. · We are expanding our focus on LLM inference infrastructure to support new AI workloads, · We are looking for engineers passion ...

  • Only for registered members Seattle $156,000 - $229,000 (USD)

    The world is always changing, so we need Product Managers who are continuously adapting and excited to work on products that affect millions of people every day. · In this role, you will work cross-functionally to guide products from conception to launch by connecting the technic ...

  • Only for registered members Seattle $166,400 - $287,700 (USD)

    As the SDM for the Neuron Inference Technology building blocks team you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. · Manage ...

  • Only for registered members Seattle $171,600 - $258,100 (USD)

    Imagine what we could do together. At Apple, new ideas have a way of becoming extraordinary products, services and customer experiences very quickly. · Bachelor's in CS or related field and/or equivalent experience · n4+ years in software engineering · ...