Staff + Senior Software Engineer, Inference - New York, NY

Only for registered members New York, NY, United States

1 week ago

+Job summary

About Anthropic
Anthropics mission is to create reliable interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

Responsibilities

Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
Autoscaling our compute fleet to dynamically match supply with demand across production research and experimental workloads
Building production-grade deployment pipelines for releasing new models to millions of users

Benefits

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

Applied AI Inference Engineer

Only for registered members

We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, · and Conviction. · ...

New York

1 month ago

Work in company

Applied AI Inference Engineer

Only for registered members

We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...

New York, NY

5 days ago

Work in company

Engineering Manager, Inference Developer Productivity

Only for registered members

Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...

New York, NY

1 week ago

Work in company

Audio Inference Engineer, Model Efficiency

Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

New York

6 days ago

Work in company Remote job

Audio Inference Engineer, Model Efficiency

Only for registered members

Join us on our mission and shape the future as an Audio Inference Engineer for Model Efficiency. · <li>Work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems,<br/></li><li>Collaborate closely with both the tr ...

New York, NY

2 weeks ago

Work in company Remote job

Sr. ML Inference Engineer

Only for registered members

We are seeking an experienced Senior ML Inference Engineer to join our team focusing on optimizing and deploying our production virtual staining models at scale.We will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minut ...

New York, NY

1 week ago

Work in company

Staff + Senior Software Engineer, Inference

Only for registered members

We want AI to be safe and beneficial for our users and for society as a whole.About The RoleOur Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide.You May Be a Good Fit If You · Pick up slack, even if i ...

New York $300,000 - $485,000 (USD)

1 week ago

Work in company Remote job

Forward Deployed Engineer, AI Inference

Only for registered members

The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. · Optimize for Production: Go beyond stand ...

New York

2 days ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Our team is responsible for developing, deploying, and operating the AI platform deliv ...

New York

1 month ago

Work in company

Senior/Staff Software Engineer, Inference

Only for registered members

We want AI to be safe and beneficial for our users and for society as a whole. · About The RoleThe team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, · while enabling breakthrough research by giving our scientists the high-performance i ...

New York $300,000 - $485,000 (USD)

1 month ago

Work in company

Staff Software Engineer, Inference Infrastructure

Only for registered members

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...

New York, NY

1 month ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

New York, NY

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Improve the platform's auth, billing, and payment systems · Add new features to the i ...

New York, NY

1 month ago

Work in company

Forward Deployed Engineer, AI Inference

Only for registered members

The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. · In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and ...

New York $193,390 - $318,980 (USD)

3 weeks ago

Work in company

Software Engineer

Only for registered members

About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...

New York

3 weeks ago

Work in company

Forward Deployed Engineer

Only for registered members

++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...

New York

3 weeks ago

Work in company

Product Manager

Only for registered members

We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...

New York

1 month ago

Work in company

Senior Software Engineer

Only for registered members

We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...

New York

1 month ago

Work in company

Senior AI Site Reliability Engineer

Only for registered members

Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...

New York

2 weeks ago

Work in company

Software Engineer, AI/ML Systems

Only for registered members

We are looking for a well-rounded AI/ML Systems Engineer to join our team and build LM Studio with us. · Build and maintain world-class on-device inference engines for LLMs and other models. · ...

New York Full time

1 month ago

Staff + Senior Software Engineer, Inference - New York, NY

Responsibilities

Benefits

Job description

Similar jobs

Applied AI Inference Engineer

Applied AI Inference Engineer

Engineering Manager, Inference Developer Productivity

Audio Inference Engineer, Model Efficiency

Audio Inference Engineer, Model Efficiency

Sr. ML Inference Engineer

Staff + Senior Software Engineer, Inference

Forward Deployed Engineer, AI Inference

Site Reliability Engineer, Inference Infrastructure

Senior/Staff Software Engineer, Inference

Staff Software Engineer, Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Forward Deployed Engineer, AI Inference

Software Engineer

Forward Deployed Engineer

Product Manager

Senior Software Engineer

Senior AI Site Reliability Engineer

Software Engineer, AI/ML Systems

Directory

for Recruiters

Information