Staff + Senior Software Engineer, Inference - New York, NY

Only for registered members New York, NY, United States

1 week ago

Default job background
+Job summary

About Anthropic
Anthropics mission is to create reliable interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

+

Responsibilities

  • Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
  • Autoscaling our compute fleet to dynamically match supply with demand across production research and experimental workloads
  • Building production-grade deployment pipelines for releasing new models to millions of users
+

Benefits


    We believe that the highest-impact AI research will be big science At Anthropic we work as a single cohesive team on just a few large-scale research efforts And we value impact advancing our long-term goals of steerable trustworthy AI rather than work on smaller more specific puzzles.
    We are an extremely collaborative group And we host frequent research discussions To ensure that we are pursuing the highest-impact work at any given time As such We greatly value communication skills.


    Lorem ipsum dolor sit amet
    , consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

    Donec lacinia nisi nec odio ultricies imperdiet.
    Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

    Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
    , at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
    Get full access

    Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Applied AI Inference Engineer

    Only for registered members

    We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, · and Conviction. · ...

    New York

    1 month ago

  • Work in company

    Applied AI Inference Engineer

    Only for registered members

    We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...

    New York, NY

    5 days ago

  • Work in company

    Engineering Manager, Inference Developer Productivity

    Only for registered members

    Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...

    New York, NY

    1 week ago

  • Work in company

    Audio Inference Engineer, Model Efficiency

    Only for registered members

    We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

    New York

    6 days ago

  • Work in company Remote job

    Audio Inference Engineer, Model Efficiency

    Only for registered members

    Join us on our mission and shape the future as an Audio Inference Engineer for Model Efficiency. · <li>Work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems,<br/></li><li>Collaborate closely with both the tr ...

    New York, NY

    2 weeks ago

  • Work in company Remote job

    Sr. ML Inference Engineer

    Only for registered members

    We are seeking an experienced Senior ML Inference Engineer to join our team focusing on optimizing and deploying our production virtual staining models at scale.We will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minut ...

    New York, NY

    1 week ago

  • Work in company

    Staff + Senior Software Engineer, Inference

    Only for registered members

    We want AI to be safe and beneficial for our users and for society as a whole.About The RoleOur Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide.You May Be a Good Fit If You · Pick up slack, even if i ...

    New York $300,000 - $485,000 (USD)

    1 week ago

  • Work in company Remote job

    Forward Deployed Engineer, AI Inference

    Only for registered members

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. · Optimize for Production: Go beyond stand ...

    New York

    2 days ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Our team is responsible for developing, deploying, and operating the AI platform deliv ...

    New York

    1 month ago

  • Work in company

    Senior/Staff Software Engineer, Inference

    Only for registered members

    We want AI to be safe and beneficial for our users and for society as a whole. · About The RoleThe team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, · while enabling breakthrough research by giving our scientists the high-performance i ...

    New York $300,000 - $485,000 (USD)

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...

    New York, NY

    1 month ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

    New York, NY

    1 month ago

  • Work in company

    Full-Stack Software Engineer, Inference

    Only for registered members

    We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Improve the platform's auth, billing, and payment systems · Add new features to the i ...

    New York, NY

    1 month ago

  • Work in company

    Forward Deployed Engineer, AI Inference

    Only for registered members

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...

    New York $193,390 - $318,980 (USD)

    3 weeks ago

  • Work in company

    Software Engineer

    Only for registered members

    About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...

    New York

    3 weeks ago

  • Work in company

    Forward Deployed Engineer

    Only for registered members

    ++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...

    New York

    3 weeks ago

  • Work in company

    Product Manager

    Only for registered members

    We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...

    New York

    1 month ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...

    New York

    1 month ago

  • Work in company

    Senior AI Site Reliability Engineer

    Only for registered members

    Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...

    New York

    2 weeks ago

  • Work in company

    Software Engineer, AI/ML Systems

    Only for registered members

    We are looking for a well-rounded AI/ML Systems Engineer to join our team and build LM Studio with us. · Build and maintain world-class on-device inference engines for LLMs and other models. · ...

    New York Full time

    1 month ago