Research Engineer, Infrastructure, Inference - San Francisco

Only for registered members San Francisco, United States

4 days ago

Default job background
Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals.  · We are scientists, engineers, and builder ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Member of Technical Staff, Inference Infrastructure

    Only for registered members

    Inferact is looking for an infrastructure engineer to build the distributed systems that power inference at global scale. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. · We obsess o ...

    San Francisco, CA

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We are looking for Members of Technical Staff to join the Model Serving team at Cohere. · +5+ years of engineering experience running production infrastructure at a large scale · Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads ...

    San Francisco Full time

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We are looking for Members of Technical Staff to join the Model Serving team at Cohere. The team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. · +Work closely with many teams t ...

    San Francisco

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    +We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Machine Learning Infrastructure Engineer- Model Inference

    Only for registered members

    About Abridge · Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most— ...

    San Francisco, CA

    7 minutes ago

  • Work in company

    Machine Learning Infrastructure Engineer- Model Inference

    Only for registered members

    About Abridge · Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most— ...

    San Francisco $221,000 - $260,000 (USD)

    1 day ago

  • Work in company

    Staff Software Engineer, Ads ML Inference Infrastructure

    Only for registered members

    The Ads ML Inference Infra team owns the online inference and feature serving systems that power real-time model scoring and delivery for all Ads models at Pinterest. The team is looking for a staff engineer with strong hands-on experience in large-scale ML inference systems. · L ...

    Palo Alto $208,454 - $364,795 (USD)

    2 weeks ago

  • Work in company

    Staff Software Engineer, Ads ML Inference Infrastructure

    Only for registered members

    The Ads ML Inference Infra team owns the online inference and feature serving systems that power real-time model scoring and delivery for all Ads models at Pinterest. · Lead efforts to build next-generation model inference and feature serving systems. · Design low-latency inferen ...

    Palo Alto $208,454 - $364,795 (USD) Full time

    2 weeks ago

  • Work in company

    Staff Software Engineer, Ads ML Inference Infrastructure

    Only for registered members

    We're on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product. Discover a career where you ignite innovation for millions, transform passion into growth opportunities. · Lead and drive efforts to build next-gen ...

    Palo Alto, CA

    2 weeks ago

  • This position is expected to start May 2026 and continue through summer term (ending approximately August 2026 or later). As a member of the Foundation Inference Infrastructure team you will design & implement backend services and tools that power autonomy software and hardware d ...

    Palo Alto $100,000 - $150,000 (USD) InternshipSHIP

    1 month ago

  • This position is expected to start May 2026 and continue through summer term (ending approximately August 2026 or later, if available). We ask for a minimum of 12 weeks, full-time (40 hours/week) and on-site. · We are looking for a strong candidate who can contribute to our syste ...

    Palo Alto, CA InternshipSHIP

    1 month ago

  • Work in company

    Platform / Inference Optimization Engineer

    Only for registered members

    We build and operate large-scale LLM inference and training infrastructure serving millions of users. · ...

    San Francisco $200,000 - $500,000 (USD)

    2 weeks ago

  • Work in company

    Inference Engineering Manager

    Only for registered members

    We are looking for an Inference Engineering Manager to lead our AI Inference team. · This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities. · ...

    San Francisco $300,000 - $385,000 (USD)

    1 month ago

  • Work in company

    Inference Engineering Manager

    Only for registered members

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities.Lead and grow a high ...

    San Francisco, CA

    1 month ago

  • Work in company

    Engineering Manager

    Only for registered members

    We are looking for an Inference Engineering Manager to lead our AI Inference team.This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs. · , · You will own the technical direction and execution of our inference systems while ...

    San Francisco $300,000 - $385,000 (USD)

    2 weeks ago

  • Work in company

    Inference Engineering Manager

    Only for registered members

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs serving millions of users with state-of-the-art AI capabilities. · ...

    San Francisco

    1 month ago

  • Work in company

    Inference Engineer

    Only for registered members

    About Us Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Engineering Manager

    Only for registered members

    About the Role · We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. · Yo ...

    San Francisco $140,000 - $230,000 (USD) per year

    1 week ago

  • Work in company

    Inference Engineering Manager

    Only for registered members

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. · ...

    San Francisco Full time

    1 month ago