Site Reliability Engineer, Inference Infrastructure - New York

Only for registered members New York, United States

1 month ago

Default job background

Job summary

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents.

Our team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints.


Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · 5+ years of engineering experi ...

    New York, NY

    1 month ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

    New York, NY

    1 month ago

  • Work in company

    Forward Deployed Engineer

    Only for registered members

    ++About the Job FriendliAI is seeking a Forward Deployed Engineer (FDE) to assist enterprises in deploying scaling and operating generative and agentic AI workloads on FriendliAI infrastructure. · ...

    New York

    3 weeks ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    We're growing quickly and recently raised our $150M Series D, · Design and architect scalable infrastructure systems for our ML inference platform · Lead optimization of Kubernetes deployments for efficient, cost-effective model serving · ...

    New York

    1 month ago

  • Work in company

    Software Engineer

    Only for registered members

    About us FriendliAI is building an AI inference platform · The RoleWe seek Inference Engine Engineer to optimize our core inference engine. · ...

    New York

    3 weeks ago

  • Work in company

    Product Manager

    Only for registered members

    We power mission-critical inference for the world's most dynamic AI companies. · Baseten recently raised $150M Series D backed by investors including BOND and IVP. · Join us to build the platform engineers turn to ship AI products. As Infrastructure Product Manager at Baseten you ...

    New York

    1 month ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    About BasetenBaseten powers mission-critical inference for the world's most dynamic AI companies. We're growing quickly and recently raised our $300M Series E.The ROLEYou'll architect and lead development of our ML inference platform that powers production AI applications. ...

    New York, NY

    2 days ago

  • Work in company

    Product Manager

    Only for registered members

    Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, · Clay and Writer. By uniting applied AI research, flexible infrastructure, · and seamless developer tooling, · ...

    New York, NY

    5 days ago

  • Work in company

    AI Site Reliability Engineer

    Only for registered members

    We are seeking an experienced AI Site Reliability Engineer (AI SRE) to ensure the reliability, scalability, and performance of the Blackdog AI platform. · Own availability, latency, performance, and cost SLAs/SLOs for Blackdog AI services · Design and maintain highly available, f ...

    New York

    1 week ago

  • Work in company

    AI SRE with Blackdog Platform

    Only for registered members

    We are seeking an experienced AI Site Reliability Engineer (AI SRE) to ensure the reliability, scalability, · performance of the Blackdog AI platform. · This role sits at the intersection of machine learning, cloud infrastructure, · and reliability engineering, owning productio ...

    New York

    1 week ago

  • Work in company

    Head of Platform Engineering

    Only for registered members

    We are looking for a Head of Platform Engineering to lead our infrastructure and enable the next generation of AI-driven drug discovery.This role sits at the intersection of cutting-edge AI, large-scale systems, and real-world scientific impact. · ...

    New York

    1 month ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    We are looking for strong backend engineers who love building developer tools used by the largest AI companies in the world. · ...

    New York Full time

    1 month ago

  • Work in company

    Senior AI Site Reliability Engineer

    Only for registered members

    Blackdog is hiring a Senior AI Site Reliability Engineer to lead the reliability, scalability, and performance of our production AI/ML platform. This role is deeply technical and hands‑on, owning end‑to‑end stability for mission‑critical model serving, data pipelines, and GPU‑int ...

    New York

    2 weeks ago

  • Work in company

    AI Systems

    Only for registered members

    We think that's backwards. Our mandate is to build efficient intelligence that evolves in real-time. · ...

    New York, NY

    1 month ago

  • Work in company

    Forward Deployed Engineer

    Only for registered members

    We are seeking a Front Deployed Engineer specializing in API development. · ...

    New York

    3 weeks ago

  • Work in company

    Engineering Manager, Inference Developer Productivity

    Only for registered members

    Anthropic's Inference organization is responsible for serving Claude to millions of users and enterprise customers with the speed, reliability, and efficiency that frontier AI demands. · We offer competitive compensation and benefits, · optional equity donation matching, · genero ...

    New York, NY

    1 week ago

  • Work in company

    Back-End Engineering

    Only for registered members

    We're hiring a Member of Technical Staff (Back-End / Systems) to join a small team building a next-generation platform for AI and LLM systems. · This role is ideal for an engineer who enjoys working close to the metal and wants to build infrastructure that other engineers rely on ...

    New York

    1 month ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    We are growing quickly and recently raised our $300M Series E, · Baseten powers mission-critical inference for the world's most dynamic AI companies, · Join us and help build the platform engineers turn to ship AI products. · We're looking for a Senior Enterprise Platform Enginee ...

    New York $200,000 - $270,000 (USD)

    3 days ago

  • Work in company

    Machine Learning Performance Engineer

    Only for registered members

    Design build and optimize high-performance training and inference pipelines for deep learning workloads. · ...

    New York

    1 month ago

  • Work in company

    Machine Learning Engineer

    Only for registered members

    Job Summary · Join a next-generation investment and technology team in New York City as a Machine Learning Engineer.This firm is building a proprietary AI and data platform that powers an end-to-end investment lifecycle—integrating structured and unstructured data, advanced analy ...

    New York

    4 weeks ago