Site Reliability Engineer, Inference Infrastructure - New York
1 month ago

Job summary
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents.Our team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...
1 month ago
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...
1 month ago
++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...
3 weeks ago
We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...
1 month ago
About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...
3 weeks ago
We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...
1 month ago
About BasetenBaseten powers mission-critical inference for the world's most dynamic AI companies. We're growing quickly and recently raised our $300M Series E.The ROLEYou'll architect and lead development of our ML inference platform that powers production AI applications. ...
2 days ago
Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, · Clay and Writer. By uniting applied AI research, flexible infrastructure, · and seamless developer tooling, · ...
5 days ago
We are seeking an experienced AI Site Reliability Engineer (AI SRE) to ensure the reliability, scalability, and performance of the Blackdog AI platform. · Own availability, latency, performance, and cost SLAs/SLOs for Blackdog AI services · Design and maintain highly available, f ...
1 week ago
We are seeking an experienced AI Site Reliability Engineer (AI SRE) to ensure the reliability, scalability, · performance of the Blackdog AI platform. · This role sits at the intersection of machine learning, cloud infrastructure, · and reliability engineering, owning productio ...
1 week ago
We are looking for a Head of Platform Engineering to lead our infrastructure and enable the next generation of AI-driven drug discovery.This role sits at the intersection of cutting-edge AI, large-scale systems, and real-world scientific impact. · ...
1 month ago
We are looking for strong backend engineers who love building developer tools used by the largest AI companies in the world. · ...
1 month ago
Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...
2 weeks ago
We think that's backwards. Our mandate is to build efficient intelligence that evolves in real-time. · ...
1 month ago
We are seeking a Front Deployed Engineer specializing in API development. · ...
3 weeks ago
Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...
1 week ago
We're hiring a Member of Technical Staff (Back-End / Systems) to join a small team building a next-generation platform for AI and LLM systems. · This role is ideal for an engineer who enjoys working close to the metal and wants to build infrastructure that other engineers rely on ...
1 month ago
We are growing quickly and recently raised our $300M Series E, · Baseten powers mission-critical inference for the world's most dynamic AI companies, · Join us and help build the platform engineers turn to ship AI products. · We're looking for a Senior Enterprise Platform Enginee ...
3 days ago
Design build and optimize high-performance training and inference pipelines for deep learning workloads. · ...
1 month ago
Job Summary · Join a next-generation investment and technology team in New York City as a Machine Learning Engineer.This firm is building a proprietary AI and data platform that powers an end-to-end investment lifecycle—integrating structured and unstructured data, advanced analy ...
4 weeks ago