ML Infra Engineer - San Francisco - Physical Intelligence

    Physical Intelligence
    Physical Intelligence San Francisco

    1 week ago

    $1,500,000 - $2,000,000 (USD) per year
    Description

    As an ML Infra Engineer (Data Systems), you'll build and operate the data infrastructure that powers large-scale robot learning. Your systems will sit directly between raw data sources and training/evaluation, enabling us to move faster while maintaining performance, correctness, and reliability at scale.

    This is a systems role at the intersection of distributed systems, storage, and machine learning infrastructure.

    The Team

    The Infrastructure organization builds the foundations that make large-scale learning possible at PI. This includes training systems, data platforms, evaluation pipelines, and the tooling that allows researchers and roboticists to work with massive datasets safely and efficiently.

    In This Role You Will

    • Data Ingestion & Processing: Design and build high-throughput pipelines that validate, transform, and featurize raw multimodal data.
    • Batch & Streaming Systems: Operate large-scale batch and streaming workflows over massive datasets.
    • Storage Systems: Design object storage layouts, metadata systems, and efficient access patterns; choose file formats with performance and scalability in mind.
    • Data Lifecycle Management: Build systems for backfills, dataset rebuilds, garbage collection, and large-scale transformations.
    • Training-Time Performance: Optimize dataloaders, sharding, prefetching, caching, and throughput to reduce time from data arrival → model training.
    • Metadata & Indexing: Build scalable metadata stores for datasets, annotations, and training artifacts.
    • Data Movement: Move hundreds of terabytes to petabytes efficiently across clusters and environments.
    • Operational Correctness: Implement observability, validation, and guardrails to prevent silent data regressions.
    • Cross-Functional Collaboration: Work closely with cross-functional teams of researchers, engineers and roboticists to translate evolving data needs into robust systems.

    What We Hope You'll Bring

    • Strong software engineering fundamentals.
    • Experience building distributed systems or large-scale data pipelines.
    • Comfort reasoning about performance, memory, I/O, and storage efficiency.
    • Familiarity with batch and/or streaming processing systems.
    • Experience with object storage systems and data format tradeoffs.
    • Ownership mindset: design, build, operate, and iterate on systems end-to-end.
    • Enjoy working closely with researchers and unblocking fast-moving projects.

    Bonus Points If You Have

    • Experience with large ML training pipelines or dataloading systems.
    • Knowledge of columnar or custom data formats.
    • Experience with systems like ClickHouse, Ray, Flink, Spark, or similar.
    • Hands-on experience operating petabyte-scale datasets.
    • Debugging and fixing performance bottlenecks in data-heavy systems.

    #J-18808-Ljbffr

  • Work in company

    ML Infra Engineer

    Only for registered members

    + Build and scale platform systems: Operate and evolve Kubernetes clusters and service deployment patterns,+ Own workflow orchestration infrastructure: Take platform-level ownership of async and multi-stage workflows,+ Drive observability and cost-aware infrastructure: Treat logg ...

    San Francisco

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    The Infrastructure team builds and operates the backbone of everything PI does: from training state-of-the-art VLA models, to orchestrating large-scale simulation, to reliably deploying intelligence across fleets of physical robots. · ...

    San Francisco Full time

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    Physical Intelligence is bringing general-purpose AI into the physical world. We are a team of engineers, scientists, roboticists, and company builders developing foundation models and learning algorithms to power the robots of today and the physically-actuated devices of the fut ...

    San Francisco, CA

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    In this role you will help scale and optimize our training systems and core model code. You'll own critical infrastructure for large-scale training, from managing GPU/TPU compute and job orchestration to building reusable and efficient JAX training pipelines. You'll work closely ...

    San Francisco

    1 week ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    This is a hands-on role at the intersection of ML · software engineering and scalable infrastructure. You will help scale and optimize our training systems and core model code.Own training/inference infrastructure: Design implement and maintain systems for large-scale model trai ...

    San Francisco

    1 month ago

  • Work in company

    AI Infra Engineer

    Only for registered members

    HealthLeap builds AI that helps clinicians prioritize patients and get them the care they need earlier. · ...

    San Francisco $200,000 - $275,000 (USD) Full time

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    In this role you will help scale and optimize our training systems and core model code. · You'll own critical infrastructure for large-scale training, from managing GPU/TPU compute and job orchestration to building reusable and efficient JAX training pipelines. · Hands-on high-le ...

    San Francisco Full time

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    Job summary · As an ML Infra Engineer (Data Systems), you'll build and operate the data infrastructure that powers large-scale robot learning. · ResponsibilitiesData Ingestion & Processing: Design and build high-throughput pipelines that validate, transform, and featurize raw mul ...

    San Francisco

    1 month ago

  • Work in company

    Senior Infra Engineer

    Only for registered members

    Join us and help shape the future of AI by defining the narrative around document understanding. · About The Role · The Infra team at LlamaIndex owns the foundations that our product is built upon as well as many of the tools that enable engineers to develop, ship, and observe th ...

    San Francisco $180,000 - $250,000 (USD)

    1 week ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    You'll build and operate the data infrastructure that powers large-scale robot learning. Your systems will sit directly between raw data sources and training/evaluation, enabling us to move faster while maintaining performance, correctness, and reliability at scale. · Data Ingest ...

    San Francisco Full time

    1 month ago

  • Work in company

    Senior Infra Engineer

    Only for registered members

    Join us and help shape the future of AI by defining the narrative around document understanding. · About The Role · The Infra team at LlamaIndex owns the foundations that our product is built upon as well as many of the tools that enable engineers to develop, ship, and observe th ...

    San Francisco, CA

    1 week ago

  • Work in company

    AI Infra Engineer

    Only for registered members

    We're ~15 people. >$7M raised. SF-based, hybrid-friendly. · Early enough to shape the product. Late enough to know it works. Results that are changing lives. · Build the infrastructure that screens every hospital patient, every day. · Data flows through secure connections, transf ...

    San Francisco, CA

    1 month ago

  • Work in company

    AI Infra Engineer

    Only for registered members

    We build AI that helps clinicians prioritize patients surfaces the right data and gets patients the care they need earlier so they can leave the hospital sooner. · Clinicians at Cedars-Sinai and Penn Medicine start every morning with HealthLeap — with Houston Methodist Emory and ...

    San Francisco $200,000 - $275,000 (USD)

    1 month ago

  • Work in company

    AI Infra Engineer

    Only for registered members

    About Healthleap · HealthLeap builds AI that helps clinicians prioritize patients, surfaces the right data, and gets patients the care they need earlier, so they can leave the hospital sooner. · We integrate with hospital electronic health record systems, screen 100% of patients ...

    San Francisco

    6 days ago

  • Work in company

    AI Infra Engineer

    Only for registered members

    Build the infrastructure that screens every hospital patient, every day. · Deploy new hospitals: site-to-site VPNs, · ...

    San Francisco, CA

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    As an ML Infra Engineer (Data Systems), you'll build and operate the data infrastructure that powers large-scale robot learning. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    ML Infra Engineer

    Only for registered members

    In this role you will help scale and optimize our training systems and core model code. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    Senior Infra Engineer

    Only for registered members

    Join us and help shape the future of AI by defining the narrative around document understanding. · About the Role · The Infra team at LlamaIndex owns the foundations that our product is built upon as well as many of the tools that enable engineers to develop, ship, and observe th ...

    San Francisco

    1 week ago

  • Work in company

    Senior Infra Engineer

    Only for registered members

    Join us and help shape the future of AI by defining the narrative around document understanding. · About the Role · The Infra team at LlamaIndex owns the foundations that our product is built upon as well as many of the tools that enable engineers to develop, ship, and observe th ...

    San Francisco $180,000 - $250,000 (USD) Full time

    1 week ago

  • Work in company

    Infra / Backend Engineer

    Only for registered members

    Raindrop is building Sentry for AI Agents. · Engineering teams at companies use Raindrop to get alerts about silent failures with their AI agents. Raindrop sends alerts when AI agents misbehave and links straight to the events, so AI engineers can dig into the conversations or tr ...

    San Francisco $115,000 - $250,000 (USD)

    2 days ago

  • Work in company

    Infra / Backend Engineer

    Only for registered members

    Raindrop · is building Sentry for AI Agents. · Engineering teams at companies use Raindrop to get alerts about silent failures with their AI agents. Raindrop sends alerts when AI agents misbehave and links straight to the events, so AI engineers can dig into the conversations or ...

    San Francisco, CA

    1 day ago

Jobs
>
San Francisco