Staff + Senior Software Engineer, Inference - New York, NY
1 week ago

About Anthropic
Anthropics mission is to create reliable interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
Responsibilities
- Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
- Autoscaling our compute fleet to dynamically match supply with demand across production research and experimental workloads
- Building production-grade deployment pipelines for releasing new models to millions of users
Benefits
We believe that the highest-impact AI research will be big science At Anthropic we work as a single cohesive team on just a few large-scale research efforts And we value impact advancing our long-term goals of steerable trustworthy AI rather than work on smaller more specific puzzles.
We are an extremely collaborative group And we host frequent research discussions To ensure that we are pursuing the highest-impact work at any given time As such We greatly value communication skills.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, · and Conviction. · ...
1 month ago
We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...
5 days ago
Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...
1 week ago
We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
6 days ago
Join us on our mission and shape the future as an Audio Inference Engineer for Model Efficiency. · <li>Work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems,<br/></li><li>Collaborate closely with both the tr ...
2 weeks ago
We are seeking an experienced Senior ML Inference Engineer to join our team focusing on optimizing and deploying our production virtual staining models at scale.We will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minut ...
1 week ago
We want AI to be safe and beneficial for our users and for society as a whole.About The RoleOur Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide.You May Be a Good Fit If You · Pick up slack, even if i ...
1 week ago
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. · Optimize for Production: Go beyond stand ...
2 days ago
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Our team is responsible for developing, deploying, and operating the AI platform deliv ...
1 month ago
We want AI to be safe and beneficial for our users and for society as a whole. · About The RoleThe team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, · while enabling breakthrough research by giving our scientists the high-performance i ...
1 month ago
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...
1 month ago
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...
1 month ago
We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Improve the platform's auth, billing, and payment systems · Add new features to the i ...
1 month ago
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...
3 weeks ago
About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...
3 weeks ago
++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...
3 weeks ago
We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...
1 month ago
We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...
1 month ago
Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...
2 weeks ago
We are looking for a well-rounded AI/ML Systems Engineer to join our team and build LM Studio with us. · Build and maintain world-class on-device inference engines for LLMs and other models. · ...
1 month ago