Senior Staff Tech Lead Inference - San Francisco - Fal

    Fal
    Fal San Francisco

    3 days ago

    Description
    A cutting-edge technology company is seeking a Staff Technical Lead for Inference & ML Performance in San Francisco. This role involves setting technical direction for a team, enhancing inference performance, and mentoring engineers. Candidates should have deep experience in ML optimization and familiarity with advanced techniques in model inference. Join us to shape the future of generative media infrastructure with significant impact and high growth potential.

    #J-18808-Ljbffr

  • Only for registered members San Francisco

    fal is pioneering the next generation of generative-media infrastructure. We're pushing the boundaries of model inference performance to power seamless creative experiences at unprecedented scale. · We're looking for a Staff Technical Lead for Inference & ML Performance · ...

  • Only for registered members San Francisco

    The Sora team is pioneering multimodal capabilities for OpenAI's foundation models. · ...

  • Only for registered members Palo Alto $137,000 - $270,000 (USD)

    We're looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search retrieval and AI-native features across MongoDB Atlas. · This role is part of the broader Search and AI Platform team and involve ...

  • Only for registered members San Francisco, CA

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities.Lead and grow a high ...

  • Only for registered members San Francisco $300,000 - $385,000 (USD)

    We are looking for an Inference Engineering Manager to lead our AI Inference team. · This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities. · ...

  • Only for registered members San Francisco $300,000 - $385,000 (USD)

    We are looking for an Inference Engineering Manager to lead our AI Inference team.This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs. · , · You will own the technical direction and execution of our inference systems while ...

  • Only for registered members San Francisco

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs serving millions of users with state-of-the-art AI capabilities. · ...

  • Solutions Lead

    1 month ago

    Only for registered members San Francisco

    We are hiring a Solutions & Customer Success Lead to deploy, optimize, and scale AI inference systems across real production hardware. · Our client is a high-revenue AI infrastructure startup redefining how agentic AI runs across CPUs, GPUs, and accelerators. · ...

  • Only for registered members San Francisco

    We are looking for an Inference Engineering Manager to lead our AI Inference team. · This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, · serving millions of users with state-of-the-art AI capabilities.Lead and grow a hi ...

  • Only for registered members San Francisco Full time

    We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. · ...

  • Only for registered members San Francisco, California

    As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. · ...

  • AI Engineer

    4 weeks ago

    Only for registered members San Francisco

    We're working with Annapurna Labs (part of Amazon) on this exciting opportunity.Join Annapurna Labs, an Amazon company, and lead a cutting-edge team optimizing Large Language Models (LLMs) like Llama and GPT-OSS for lightning-fast inference on AWS Trainium accelerators. · ...

  • Only for registered members San Francisco $58 - $63 (USD)

    The Turbo Research team investigates how to make post-training and reinforcement learning for large language models efficient, scalable, and reliable. · As a research intern, you will study RL and post-training methods whose performance and scalability are tightly coupled to infe ...

  • Product Lead

    2 weeks ago

    Only for registered members San Francisco, CA

    A well-funded and fast-growing AI company is building a next-generation platform for safe, performant, and cost-efficient AI agent deployment across enterprise environments. · Owning the roadmap and execution for the infrastructure powering model deployment, orchestration, and us ...

  • Only for registered members San Francisco, CA

    About Together AI is a research-driven artificial intelligence company that aims to significantly lower the cost of modern AI systems by co-designing software, algorithms and models. · ...

  • Only for registered members San Francisco, CA

    How should Intuit be using causal inference methods to make decisions across marketing product and business strategy?We re building out a causal inference team and we are looking for a Senior Staff Data Scientist thats ready to lead the way. ...

  • Only for registered members San Francisco

    This position involves managing AI inference platforms with expertise in technical product management. · Managing the product lifecycle. · Developing product strategies. · ...

  • Only for registered members San Francisco $200,000 - $270,000 (USD)

    Baseten powers mission-critical inference for the world's most dynamic AI companies. We enable companies operating at the frontier of AI to bring cutting-edge models into production. · ...

  • Only for registered members San Francisco Bay Area

    This is a full-time position for a Junior Technical Product Manager - AI Inference based in the San Francisco Bay Area. · ...

  • Only for registered members San Francisco, CA

    We are growing quickly and recently raised our $150M Series D backed by investors including BOND IVP Spark Capital Greylock and Conviction Join us and help build the platform engineers turn to to ship AI products. · Design and architect scalable infrastructure systems for our ML ...

  • Only for registered members San Francisco $181,100 - $318,400 (USD)

    Join Apple Maps to help build the best map in the world.In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, · improving search quality and powering experiences across ...

Jobs
>
San Francisco