Senior Software Engineer, AI Inference Platform - Sunnyvale

Only for registered members Sunnyvale, United States

2 days ago

Default job background

Job summary

We are seeking a talented
Platform Software Engineer to join the team building the Cerebras Inference Platform.

You will be instrumental in designing,
developing and operating the core backend services and APIs that power
The inference platform.

  • the core APIs for The Inference Platform handling model catalog management deployment of ML workloads scaling and status monitoring.
  • and self-service access to inference models serving.

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Engineering Manager, Inference Platform

    Only for registered members

    We're looking for a deeply technical, hands-on engineering leader for our Inference Service Platform. · You will lead a high performing team to tackle a critical challenge: scaling LLM inference on Cerebras' advanced compute clusters and delivering a world-class, on-prem solution ...

    Sunnyvale

    1 week ago

  • Work in company

    Staff ML Engineer, Inference Platform

    Only for registered members

    This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, · at minimum or other frequency dictated by the business.About GM · Our vision is a world with Zero Crashes, Zero Emis ...

    Sunnyvale, CA

    3 weeks ago

  • Work in company

    Platform Engineer, AI Inference

    Only for registered members

    We're on a mission to revolutionize AI infrastructure by systematically rebuilding the AI software stack from the ground up.We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · The estimated base salary range for this ...

    Los Altos $198,000 - $286,000 (USD)

    1 month ago

  • Work in company

    Platform Engineer, AI Inference

    Only for registered members

    We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · 5+ years of relevant experience · Experience maintaining Kubernetes clusters and applications · ...

    Los Altos, CA

    1 month ago

  • Work in company

    Product Manager – AI Inference Platform

    Only for registered members

    We are looking for a strategic Product Manager to lead the product planning, competitive benchmarking, · and PLG (Product-Led Growth) conversion for our AI Inference Engine. · ...

    Mountain View, CA

    1 month ago

  • Work in company

    Lead Engineer, Inference Platform

    Only for registered members

    We're looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search retrieval and AI-native features across MongoDB Atlas. · This role is part of the broader Search and AI Platform team and involve ...

    Palo Alto $137,000 - $270,000 (USD)

    1 month ago

  • Work in company

    Product Manager – AI Inference Platform

    Only for registered members

    We are looking for a strategic Product Manager to lead the product planning and growth for our AI Inference Engine.You will directly influence the platform's product direction and commercial success. · Conduct end-to-end commercial and market research. · Define and iterate the pl ...

    Mountain View

    1 month ago

  • Work in company

    Director Product Manager – Inference Platforms

    Only for registered members

    We're seeking a Director Product Manager to manage and mentor a team of product managers responsible for the strategy, roadmap, and execution of AI and ML developer tools that power inference at scale on custom GPUs and heterogeneous hardware platforms.This is a high-impact role ...

    Santa Clara, CA

    1 month ago

  • Work in company

    Director Product Manager – Inference Platforms

    Only for registered members

    We're seeking a Director Product Manager to manage and mentor a team of product managers responsible for the strategy, roadmap, and execution of AI and ML developer tools that power inference at scale on custom GPUs and heterogeneous hardware platforms. · The RoleThis is a high-i ...

    Santa Clara

    1 month ago

  • Work in company

    Senior Software Engineer – Inference Platform Infrastructure

    Only for registered members

    NVIDIA is seeking a Sr. Software Engineer – Inference Platform Infrastructure to build and automate the foundations for NVIDIA's inference services. · ...

    Santa Clara $152,000 - $287,500 (USD) Full time

    15 hours ago

  • Work in company

    Senior Software Engineer, Inference Platform

    Only for registered members

    We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search retrieval and AI-native experiences in MongoDB Atlas. · We'll join the broader Search and AI Platform organization collaborate with ML ...

    Palo Alto, CA

    1 month ago

  • Work in company

    Senior / Principal Inference Engineer - ML Platform

    Only for registered members

    We're building the tools and platform that empower our community to bring any experience that they can imagine to life.We're on a mission to connect a billion people with optimism and civility, · tens of millions of people come to Roblox daily. · solving unique technical challeng ...

    San Mateo, CA

    1 month ago

  • Work in company

    Product Manager MBA Intern, AI Platform Inference - Summer 2026

    Only for registered members

    As NVIDIA Product Managers our goal is to enable developers to be successful on the NVIDIA Platform and push the boundaries of what is possible with their AI deployments For Inference we are the champions inside NVIDIA for AI developers looking to accelerate their deployments on ...

    Santa Clara $27 - $82 (USD) Internship

    4 weeks ago

  • Work in company

    Senior / Principal Inference Engineer - ML Platform

    Only for registered members

    We're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. · 4+ years of professional experience. · Bachelor's degr ...

    San Mateo $327,000 - $397,460 (USD) Full time

    1 month ago

  • Work in company

    Product Manager MBA Intern, AI Platform Inference - Summer 2026

    Only for registered members

    We are seeking an intern for our Product Management organization at NVIDIA. As a Product Manager MBA Intern for AI Platform Inference you will be responsible for building tools SDKs and libraries which enables developers' Inference deployments to thrive on NVIDIA GPUs. · ...

    Santa Clara $27 - $82 (USD)

    4 weeks ago

  • Work in company

    Product Marketing Manager, Inference

    Only for registered members

    Become part of a team solving some of the most exciting challenges in the industry. · ...

    Sunnyvale $143,000 - $210,000 (USD)

    4 days ago

  • Work in company

    Senior Technical Program Manager

    Only for registered members

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.We're looking for a Senior Technical Program Manager to lead complex, cross-functional programs across our AI training and inference platforms. · This role sits at the intersection of engineering, prod ...

    Sunnyvale

    2 weeks ago

  • Work in company

    Staff Product Manager, Managed Inference

    Only for registered members

    Join Crusoe's mission to accelerate energy abundance intelligence speed sustainability drive meaningful innovation make tangible impact join pace-setting team responsible transformative cloud infrastructure. ...

    Sunnyvale $204,000 - $247,000 (USD) Full time

    1 month ago

  • Work in company

    Staff Software Engineer, Observability

    Only for registered members

    Cerebras Systems is building the world's largest AI chip. We are seeking a talented Platform Software Engineer to join our team. · Set Technical Direction: Own the long-term architecture and roadmap for the observability platform across all pillars (metrics, logs, traces, and eve ...

    Sunnyvale Full time

    1 week ago

  • Work in company

    Deployment Engineer, AI Inference

    Only for registered members

    We are seeking a highly skilled Deployment Engineer to build and operate our cutting-edge inference clusters. · These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its ...

    Sunnyvale

    1 month ago