Senior Staff Tech Lead Inference - San Francisco - Fal

Fal San Francisco

3 days ago

Description

A cutting-edge technology company is seeking a Staff Technical Lead for Inference & ML Performance in San Francisco. This role involves setting technical direction for a team, enhancing inference performance, and mentoring engineers. Candidates should have deep experience in ML optimization and familiarity with advanced techniques in model inference. Join us to shape the future of generative media infrastructure with significant impact and high growth potential.

#J-18808-Ljbffr

Staff Technical Lead for Inference
3 days ago

Only for registered members San Francisco

fal is pioneering the next generation of generative-media infrastructure. We're pushing the boundaries of model inference performance to power seamless creative experiences at unprecedented scale. · We're looking for a Staff Technical Lead for Inference & ML Performance · ...
Inference Technical Lead, Sora
3 days ago

Only for registered members San Francisco

The Sora team is pioneering multimodal capabilities for OpenAI's foundation models. · ...
Lead Engineer, Inference Platform
1 month ago

Only for registered members Palo Alto $137,000 - $270,000 (USD)

We're looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search retrieval and AI-native features across MongoDB Atlas. · This role is part of the broader Search and AI Platform team and involve ...
Inference Engineering Manager
3 weeks ago

Only for registered members San Francisco, CA

We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities.Lead and grow a high ...
Inference Engineering Manager
3 weeks ago

Only for registered members San Francisco $300,000 - $385,000 (USD)

We are looking for an Inference Engineering Manager to lead our AI Inference team. · This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities. · ...
Engineering Manager
5 days ago

Only for registered members San Francisco $300,000 - $385,000 (USD)

We are looking for an Inference Engineering Manager to lead our AI Inference team.This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs. · , · You will own the technical direction and execution of our inference systems while ...
Inference Engineering Manager
3 weeks ago

Only for registered members San Francisco

We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs serving millions of users with state-of-the-art AI capabilities. · ...
Solutions Lead
1 month ago

Only for registered members San Francisco

We are hiring a Solutions & Customer Success Lead to deploy, optimize, and scale AI inference systems across real production hardware. · Our client is a high-revenue AI infrastructure startup redefining how agentic AI runs across CPUs, GPUs, and accelerators. · ...
Engineering Manager
1 week ago

Only for registered members San Francisco

We are looking for an Inference Engineering Manager to lead our AI Inference team. · This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities.Lead and grow a hi ...
Inference Engineering Manager
3 weeks ago

Only for registered members San Francisco Full time

We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. · ...
Staff Software Engineer
3 days ago

Only for registered members San Francisco, California

As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. · ...
AI Engineer
4 weeks ago

Only for registered members San Francisco

We're working with Annapurna Labs (part of Amazon) on this exciting opportunity.Join Annapurna Labs, an Amazon company, and lead a cutting-edge team optimizing Large Language Models (LLMs) like Llama and GPT-OSS for lightning-fast inference on AWS Trainium accelerators. · ...
Research Intern RL & Post-Training Systems, Turbo (Summer 2026)
1 week ago

Only for registered members San Francisco $58 - $63 (USD)

The Turbo Research team investigates how to make post-training and reinforcement learning for large language models efficient, scalable, and reliable. · As a research intern, you will study RL and post-training methods whose performance and scalability are tightly coupled to infe ...
Product Lead
2 weeks ago

Only for registered members San Francisco, CA

A well-funded and fast-growing AI company is building a next-generation platform for safe, performant, and cost-efficient AI agent deployment across enterprise environments. · Owning the roadmap and execution for the infrastructure powering model deployment, orchestration, and us ...
Research Intern RL & Post-Training Systems, Turbo (Summer 2026)
1 month ago

Only for registered members San Francisco, CA

About Together AI is a research-driven artificial intelligence company that aims to significantly lower the cost of modern AI systems by co-designing software, algorithms and models. · ...
Senior Staff Data Scientist
3 weeks ago

Only for registered members San Francisco, CA

How should Intuit be using causal inference methods to make decisions across marketing product and business strategy?We re building out a causal inference team and we are looking for a Senior Staff Data Scientist thats ready to lead the way. ...
Junior Technical Product Manager
3 weeks ago

Only for registered members San Francisco

This position involves managing AI inference platforms with expertise in technical product management. · Managing the product lifecycle. · Developing product strategies. · ...
Senior Software Engineer
4 hours ago

Only for registered members San Francisco $200,000 - $270,000 (USD)

Baseten powers mission-critical inference for the world's most dynamic AI companies. We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...
Junior Technical Product Manager
3 weeks ago

Only for registered members San Francisco Bay Area

This is a full-time position for a Junior Technical Product Manager - AI Inference based in the San Francisco Bay Area. · ...
Senior Software Engineer
1 month ago

Only for registered members San Francisco, CA

We are growing quickly and recently raised our $150M Series D backed by investors including BOND IVP Spark Capital Greylock and Conviction Join us and help build the platform engineers turn to to ship AI products. · Design and architect scalable infrastructure systems for our ML ...
Senior Software Engineer, Model Inference
4 weeks ago

Only for registered members San Francisco $181,100 - $318,400 (USD)

Join Apple Maps to help build the best map in the world.In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, · improving search quality and powering experiences across ...

Staff Technical Lead for Inference
Only for registered members San Francisco
Inference Technical Lead, Sora
Only for registered members San Francisco
Lead Engineer, Inference Platform
Only for registered members Palo Alto
Inference Engineering Manager
Only for registered members San Francisco, CA
Inference Engineering Manager
Only for registered members San Francisco
Engineering Manager
Only for registered members San Francisco
Inference Engineering Manager
Only for registered members San Francisco
Solutions Lead
Only for registered members San Francisco
Engineering Manager
Only for registered members San Francisco
Inference Engineering Manager
Full time Only for registered members San Francisco
Staff Software Engineer
Only for registered members San Francisco, California
AI Engineer
Only for registered members San Francisco
Research Intern RL & Post-Training Systems, Turbo (Summer 2026)
Only for registered members San Francisco
Product Lead
Only for registered members San Francisco, CA
Research Intern RL & Post-Training Systems, Turbo (Summer 2026)
Only for registered members San Francisco, CA
Senior Staff Data Scientist
Only for registered members San Francisco, CA
Junior Technical Product Manager
Only for registered members San Francisco
Senior Software Engineer
Only for registered members San Francisco
Junior Technical Product Manager
Only for registered members San Francisco Bay Area
Senior Software Engineer
Only for registered members San Francisco, CA
Senior Software Engineer, Model Inference
Only for registered members San Francisco