Senior Software Engineer, AI Inference Platform - Sunnyvale
2 days ago

Job summary
We are seeking a talentedPlatform Software Engineer to join the team building the Cerebras Inference Platform.
You will be instrumental in designing,
developing and operating the core backend services and APIs that power
The inference platform.
- the core APIs for The Inference Platform handling model catalog management deployment of ML workloads scaling and status monitoring.
- and self-service access to inference models serving.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
We're looking for a deeply technical, hands-on engineering leader for our Inference Service Platform. · You will lead a high performing team to tackle a critical challenge: scaling LLM inference on Cerebras' advanced compute clusters and delivering a world-class, on-prem solution ...
1 week ago
This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, · at minimum or other frequency dictated by the business.About GM · Our vision is a world with Zero Crashes, Zero Emis ...
3 weeks ago
We're on a mission to revolutionize AI infrastructure by systematically rebuilding the AI software stack from the ground up.We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · The estimated base salary range for this ...
1 month ago
We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · 5+ years of relevant experience · Experience maintaining Kubernetes clusters and applications · ...
1 month ago
We are looking for a strategic Product Manager to lead the product planning, competitive benchmarking, · and PLG (Product-Led Growth) conversion for our AI Inference Engine. · ...
1 month ago
We're looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search retrieval and AI-native features across MongoDB Atlas. · This role is part of the broader Search and AI Platform team and involve ...
1 month ago
We are looking for a strategic Product Manager to lead the product planning and growth for our AI Inference Engine.You will directly influence the platform's product direction and commercial success. · Conduct end-to-end commercial and market research. · Define and iterate the pl ...
1 month ago
We're seeking a Director Product Manager to manage and mentor a team of product managers responsible for the strategy, roadmap, and execution of AI and ML developer tools that power inference at scale on custom GPUs and heterogeneous hardware platforms.This is a high-impact role ...
1 month ago
We're seeking a Director Product Manager to manage and mentor a team of product managers responsible for the strategy, roadmap, and execution of AI and ML developer tools that power inference at scale on custom GPUs and heterogeneous hardware platforms. · The RoleThis is a high-i ...
1 month ago
Senior Software Engineer – Inference Platform Infrastructure
Only for registered members
NVIDIA is seeking a Sr. Software Engineer – Inference Platform Infrastructure to build and automate the foundations for NVIDIA's inference services. · ...
15 hours ago
We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search retrieval and AI-native experiences in MongoDB Atlas. · We'll join the broader Search and AI Platform organization collaborate with ML ...
1 month ago
We're building the tools and platform that empower our community to bring any experience that they can imagine to life.We're on a mission to connect a billion people with optimism and civility, · tens of millions of people come to Roblox daily. · solving unique technical challeng ...
1 month ago
Product Manager MBA Intern, AI Platform Inference - Summer 2026
Only for registered members
As NVIDIA Product Managers our goal is to enable developers to be successful on the NVIDIA Platform and push the boundaries of what is possible with their AI deployments For Inference we are the champions inside NVIDIA for AI developers looking to accelerate their deployments on ...
4 weeks ago
We're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. · 4+ years of professional experience. · Bachelor's degr ...
1 month ago
Product Manager MBA Intern, AI Platform Inference - Summer 2026
Only for registered members
We are seeking an intern for our Product Management organization at NVIDIA. As a Product Manager MBA Intern for AI Platform Inference you will be responsible for building tools SDKs and libraries which enables developers' Inference deployments to thrive on NVIDIA GPUs. · ...
4 weeks ago
Become part of a team solving some of the most exciting challenges in the industry. · ...
4 days ago
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.We're looking for a Senior Technical Program Manager to lead complex, cross-functional programs across our AI training and inference platforms. · This role sits at the intersection of engineering, prod ...
2 weeks ago
Join Crusoe's mission to accelerate energy abundance intelligence speed sustainability drive meaningful innovation make tangible impact join pace-setting team responsible transformative cloud infrastructure. ...
1 month ago
Cerebras Systems is building the world's largest AI chip. We are seeking a talented Platform Software Engineer to join our team. · Set Technical Direction: Own the long-term architecture and roadmap for the observability platform across all pillars (metrics, logs, traces, and eve ...
1 week ago
We are seeking a highly skilled Deployment Engineer to build and operate our cutting-edge inference clusters. · These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its ...
1 month ago