LLM Inference Engineer - Palo Alto, CA

Only for registered members Palo Alto, CA, United States

2 weeks ago

About Us · Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations with patients. We have trained our own LLMs as part of our Polaris constellation, resulting in a system with over 99.9% acc ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

LLM Inference Engineer

Only for registered members

We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure. · The ideal candidate has: · Extensive hands-on experience with state-of-the-art inference optimization techniques · A track record of deploying efficient, scala ...

Palo Alto

2 weeks ago

Work in company

LLM Inference Engineer

Only for registered members

We're seeking an experienced LLM Inference Engineer to optimize our large language model serving infrastructure.The ideal candidate has extensive hands-on experience with state-of-the-art inference optimization techniques and a track record of deploying efficient scalable LLM sys ...

Palo Alto Full time

1 month ago

Work in company Remote job

Inference Engineer

Only for registered members

We're looking for engineers who can bridge the gap between ML research and high-performance inference. · JAX / Equinox / Pallas stack · Rust systems programming with a focus on developer experience · ...

San Francisco Bay Area

3 weeks ago

Work in company

Lead Engineer, Inference Platform

Only for registered members

We're looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search retrieval and AI-native features across MongoDB Atlas. · This role is part of the broader Search and AI Platform team and involve ...

Palo Alto $137,000 - $270,000 (USD)

1 month ago

Work in company

Founding ML inference Engineer

Only for registered members

We're a stealth-mode VC-backed AI startup developing the world's fastest most efficient AI models redefining how large language models LLMs are trained deployed and used. · BSMSPhD in Computer Science Engineering or equivalent experience. · ...

Palo Alto

1 month ago

Work in company

Inference Engineer

Only for registered members

Mirai builds on-device inference layer for AI. · We enable model makers to run AI models directly on edge devices,Our stack spans low-level GPU kernels to high-level model conversion tools. · We're looking for engineers who bridge ML research and high-performance inference,You'll ...

San Francisco

3 weeks ago

Work in company

Senior Software Engineer, Inference Platform

Only for registered members

We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search retrieval and AI-native experiences in MongoDB Atlas. · We'll join the broader Search and AI Platform organization collaborate with ML ...

Palo Alto, CA

1 month ago

Work in company

Software Engineer, AI Inference CoDesign

Only for registered members

+Job summary · Build robust AI frameworks to lower neural networks to edge devices · +ResponsibilitiesBuild robust AI infrastructure to train and fine-tune networks for Autopilot and Optimus on large GPU clusters ...

Palo Alto $132,000 - $390,000 (USD)

1 week ago

Work in company

Senior Software Engineer, Inference Platform

Only for registered members

We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences in MongoDB Atlas. · Design and build components of a multi-tenant inference platform integrated d ...

Palo Alto, CA

1 day ago

Work in company

Platform Engineer, AI Inference

Only for registered members

We're on a mission to revolutionize AI infrastructure by systematically rebuilding the AI software stack from the ground up.We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · The estimated base salary range for this ...

Los Altos $198,000 - $286,000 (USD)

1 month ago

Work in company

Software Engineer, Generalist, AI Inference

Only for registered members

We train and deploy large neural networks for efficient inference on compute-constrained edge devices. · Build robust AI frameworks to lower neural networks to edge devices · Create robust AI infrastructure to train and fine-tune networks for Autopilot and Optimus · ...

Palo Alto, CA

1 month ago

Work in company

Software Engineer, AI Inference CoDesign

Only for registered members

What To Expect · Our team productionizes and experiments with novel ML models for our custom HW- we train and deploy large neural networks for efficient inference on compute-constrained edge devices (CPU / GPU / AI ASIC). · ...

Palo Alto, CA

1 week ago

Work in company

Platform Engineer, AI Inference

Only for registered members

We are looking for a Platform Engineer to own and improve the systems that support our Cloud Inference products. · 5+ years of relevant experience · Experience maintaining Kubernetes clusters and applications · ...

Los Altos, CA

1 month ago

Work in company

Inference Engineer

Only for registered members

About Us Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. · ...

San Francisco, CA

1 month ago

Work in company

Software Engineer, Generalist, AI Inference

Only for registered members

We train and deploy large neural networks for efficient inference on compute-constrained edge devices (CPU / GPU / AI ASIC). · ...

Palo Alto $118,000 - $390,000 (USD)

1 month ago

Work in company

Inference Engine Product Manager

Only for registered members

We are seeking a dynamic and experienced Product manager to lead our Inference Engine product strategy and execution for GMI Cloud.This role will be instrumental in shaping the future of our cloud offerings with a strong focus on the rapidly evolving AI infrastructure and Maas la ...

Mountain View

1 month ago

Work in company

Diffusion/Inference/RL system engineer

Only for registered members

We are looking for an RL Systems Engineer to build large-scale reinforcement learning systems for post-training, fine-tuning and alignment. · Build large-scale reinforcement learning systems for post-training, fine-tuning and alignmentScales RL workloads across multi-node cluster ...

Palo Alto Full time

3 weeks ago

Work in company

Inference Engine Product Manager

Only for registered members

We are seeking a dynamic and experienced Product manager to lead our Inference Engine product strategy and execution for GMI Cloud. · Bonus: Experience with Model as a service product · Bonus: Experience with serverless GPU product · ...

Mountain View, CA

1 month ago

Work in company

BD Manager, Inference Engine

Only for registered members

We're looking for a BD manager to drive the identification, execution, and commercialization of customer POC for our AI infrastructure platform.You'll work closely with customers and internal engineering teams to turn early technical evaluations into long-term commercial partners ...

Mountain View

1 month ago

Work in company

BD Manager, Inference Engine

Only for registered members

We're looking for a BD manager to drive the identification, execution and commercialization of customer POC for our AI infrastructure platform. · Own and Drive Customer POCs Identify, qualify and source high-potential customer POCs aligned with strategic priorities. · ...

Mountain View, CA

1 month ago

LLM Inference Engineer - Palo Alto, CA

Job description

Similar jobs

LLM Inference Engineer

LLM Inference Engineer

Inference Engineer

Lead Engineer, Inference Platform

Founding ML inference Engineer

Inference Engineer

Senior Software Engineer, Inference Platform

Software Engineer, AI Inference CoDesign

Senior Software Engineer, Inference Platform

Platform Engineer, AI Inference

Software Engineer, Generalist, AI Inference

Software Engineer, AI Inference CoDesign

Platform Engineer, AI Inference

Inference Engineer

Software Engineer, Generalist, AI Inference

Inference Engine Product Manager

Diffusion/Inference/RL system engineer

Inference Engine Product Manager

BD Manager, Inference Engine

BD Manager, Inference Engine

Directory

for Recruiters

Information