ML/AI Research Engineer — Agentic AI Lab - San Francisco Bay

Only for registered members San Francisco Bay, United States

3 days ago

Default job background
ML/AI Research Engineer — Agentic AI Lab (Founding Team) · Location: San Francisco Bay Area · Type: Full-Time · Compensation: Competitive salary + meaningful equity (founding tier) · Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical in ...
Job description

ML/AI Research Engineer — Agentic AI Lab (Founding Team)

Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)

Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical infrastructure problems.

About the Role

We're designing the future of enterprise AI infrastructure — grounded in agents, retrieval-augmented generation (RAG), knowledge graphs, and multi-tenant governance.

We're looking for an ML/AI Research Engineer to join our AI Lab and lead the design, training, evaluation, and optimization of agent-native AI models. You'll work at the intersection of LLMs, vector search, graph reasoning, and reinforcement learning — building the intelligence layer that sits on top of our enterprise data fabric.

This isn't a prompt engineer role. It's full-cycle ML: from data curation and fine-tuning to evaluation, interpretability, and deployment — with cost-awareness, alignment, and agent coordination all in scope.

Core Responsibilities

  • Fine-tune and evaluate open-source LLMs (e.g. LLaMA 3, Mistral, Falcon, Mixtral) for enterprise use cases with both structured and unstructured data

  • Build and optimize RAG pipelines using LangChain, LangGraph, LlamaIndex, or Dust — integrated with our vector DBs and internal knowledge graph

  • Train agent architectures (ReAct, AutoGPT, BabyAGI, OpenAgents) using enterprise task data

  • Develop embedding-based memory and retrieval chains with token-efficient chunking strategies

  • Create reinforcement learning pipelines to optimize agent behaviors (e.g. RLHF, DPO, PPO)

  • Establish scalable evaluation harnesses for LLM and agent performance, including synthetic evals, trace capture, and explainability tools

  • Contribute to model observability, drift detection, error classification, and alignment

  • Optimize inference latency and GPU resource utilization across cloud and on-prem environments

Desired Experience

Model Training:

  • Deep experience fine-tuning open-source LLMs using HuggingFace Transformers, DeepSpeed, vLLM, FSDP, LoRA/QLoRA

  • Worked with both base and instruction-tuned models; familiar with SFT, RLHF, DPO pipelines

  • Comfortable building and maintaining custom training datasets, filters, and eval splits

  • Understand tradeoffs in batch size, token window, optimizer, precision (FP16, bfloat16), and quantization

RAG + Knowledge Graphs:

  • Experience building enterprise-grade RAG pipelines integrated with real-time or contextual data

  • Familiar with LangChain, LangGraph, LlamaIndex, and open-source vector DBs (Weaviate, Qdrant, FAISS)

  • Experience grounding models with structured data (SQL, graph, metadata) + unstructured sources

  • Bonus: Worked with Neo4j, Puppygraph, RDF, OWL, or other semantic modeling systems

Agent Intelligence:

  • Experience training or customizing agent frameworks with multi-step reasoning and memory

  • Understand common agent loop patterns (e.g. Plan→Act→Reflect), memory recall, and tools

  • Familiar with self-correction, multi-agent communication, and agent ops logging

Optimization:

  • Strong background in token cost optimization, chunking strategies, reranking (e.g. Cohere, Jina), compression, and retrieval latency tuning

  • Experience running models under quantized (int4/int8) or multi-GPU settings with inference tuning (vLLM, TGI)

Preferred Tech Stack

  • LLM Training & Inference: HuggingFace Transformers, DeepSpeed, vLLM, FlashAttention, FSDP, LoRA

  • Agent Orchestration: LangChain, LangGraph, ReAct, OpenAgents, LlamaIndex

  • Vector DBs: Weaviate, Qdrant, FAISS, Pinecone, Chroma

  • Graph Knowledge Systems: Neo4j, Puppygraph, RDF, Gremlin, JSON-LD

  • Storage & Access: Iceberg, DuckDB, Postgres, Parquet, Delta Lake

  • Evaluation: OpenLLM Evals, Trulens, Ragas, LangSmith, Weight & Biases

  • Compute: Ray, Kubernetes, TGI, Sagemaker, LambdaLabs, Modal

  • Languages: Python (core), optionally Rust (for inference layers) or JS (for UX experimentation)

Soft Skills & Mindset

  • Startup DNA: resourceful, fast-moving, and capable of working in ambiguity

  • Deep curiosity about agent-based architectures and real-world enterprise complexity

  • Comfortable owning model performance end-to-end: from dataset to deployment

  • Strong instincts around explainability, safety, and continuous improvement

  • Enjoy pair-designing with product and UX to shape capabilities, not just APIs

Why This Role Matters

This role is foundational to our thesis: that agents + enterprise data + knowledge modeling can create intelligent infrastructure for real-world, multi-billion-dollar workflows. Your work won't be buried in research reports — it will be productionized and activated by hundreds of users and hundreds of thousands of decisions. If this is your dream role - we would love to hear from you.



Similar jobs

  • Work in company

    ML Ops Engineer — Agentic AI Lab

    Only for registered members

    ML Ops Engineer — Agentic AI Lab (Founding Team) · Location: San Francisco Bay Area · Type: Full-Time · Compensation: Competitive salary + meaningful equity (founding tier) · Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical infrastruc ...

    San Francisco Bay

    3 days ago

  • Work in company

    ML/AI Research Engineer — Agentic AI Lab

    Only for registered members

    We're designing the future of enterprise AI infrastructure — grounded in agents, retrieval-augmented generation (RAG), knowledge graphs, and multi-tenant governance. · This isn't a prompt engineer role · ...

    San Francisco

    1 month ago

  • This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. · She will pick the best candidates from Jack's network · The next step is to speak to Jack. · Senior Software Engineer (Agentic Systems) · Salary: $160K – $250K + Equity · Company Des ...

    San Francisco

    3 days ago

  • This is a job that involves leading the development of sophisticated agentic features like RAG and asynchronous task management within a high-performance engineering culture. · Join a well-funded Series B startup backed by top-tier global venture capital firms at the forefront of ...

    San Francisco $160,000 - $250,000 (USD) Full time

    3 weeks ago

  • Work in company

    Operator-In-Residence (OIR): Agents

    Only for registered members

    Primary Labs is looking for an Operator-in-Residence (OIR) to explore the frontier of local AI and agent frameworks and turn technical breakthroughs into company opportunities. · ...

    San Francisco, CA

    1 week ago

  • Work in company

    CTO / Co-Founder - Agentic AI Systems

    Only for registered members

    Stealth AI Startup | Silicon Valley Deep Tech ChipSage Labs is building next-generation agentic AI systems powered by large language models.Architect and lead core AI platform · ...

    San Francisco

    4 days ago

  • Work in company

    Operator-In-Residence (OIR): Agents

    Only for registered members

    We're looking for an Operator-in-Residence (OIR) to explore local AI and agent frameworks. · Design and build agent-based systems using emerging frameworks. · ...

    San Francisco $125,000 - $175,000 (USD) Full time

    1 week ago

  • Work in company

    Research Scientist

    Only for registered members

    You'll be joining a research-driven AI company building reinforcement learning simulation environments for agents and large language models, · with a focus on post-training, evaluation, and scalable supervision. · ...

    San Francisco

    1 week ago

  • Work in company

    Applied Research

    Only for registered members

    Be Your Own Lab · Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training ...

    San Francisco, CA $70,000 - $140,000 (USD) per year

    2 days ago

  • Work in company

    Applied Research

    Only for registered members

    · Be Your Own Lab · Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-train ...

    San Francisco $70,000 - $140,000 (USD) per year

    1 day ago

  • Work in company

    Applied Research

    Only for registered members

    We're looking for a Forward-Deployed Research Engineer (FDRE) to serve as the primary technical interface between Prime Intellect and our most important customers: AI companies, research labs, and enterprises running post-training and agentic RL on our platform. · ...

    San Francisco $70,000 - $140,000 (USD) per year Full time

    3 days ago

  • Work in company

    Research Scientist

    Only for registered members

    You'll join a research-driven AI company building reinforcement learning simulation environments for agents and large language models. · Experience in applied research in reinforcement learning, LLM post-training, or agent-based systemsStrong understanding of transformer architec ...

    San Francisco

    1 month ago

  • Work in company

    Forward Deploy AI Engineer

    Only for registered members

    Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). Hundreds of teams building autonomous agents rely on Judgment to understand how their systems are behaving post-deployment. · We've raised $30M+ across two rounds in the past five months. The Role: Forward D ...

    San Francisco

    1 week ago

  • Amazon has launched a new research lab in San Francisco to develop foundational capabilities for useful AI agents. We're enabling practical AI to make our customers more productive empowered and fulfilled. · ...

    San Francisco, CA

    5 days ago

  • Work in company

    Senior Machine Learning Engineer

    Only for registered members

    We are seeking a talented large language model application engineer to join our team.The ideal candidate should have a strong spirit of innovation, an outstanding technical background, and a passion for exploring large language models. · ...

    San Francisco

    1 month ago

  • Work in company

    Frontier Data Lead

    Only for registered members

    Turing builds large-scale datasets and reinforcement learning (RL) environments that power post-training for the world's leading AI labs and enterprises. · ...

    San Francisco, California, United States

    1 week ago

  • Work in company

    Senior Machine Learning Scientist, AI Agents

    Only for registered members

    We're seeking a Senior Machine Learning Scientist to accelerate and optimize our progress in agentic AI methods for biological and clinical data curation and analysis. · In this role, you will collaborate with biomedical research experts as well as other machine learning scientis ...

    San Francisco $251,700 - $330,000 (USD)

    3 days ago

  • Work in company

    Forward Deploy AI Engineer

    Only for registered members

    We're building the infrastructure that fixes this: the monitoring layer that makes agents self-improving. · ...

    San Francisco Full time

    2 weeks ago

  • A new research lab in San Francisco is developing foundational capabilities for useful AI agents. We're enabling practical AI to make our customers more productive, empowered, and fulfilled. · We're entering an exciting new era where agents can redefine what AI makes possible. To ...

    San Francisco

    5 days ago

  • Work in company

    Create Your Own Role

    Only for registered members

    · If you don't see a role currently listed that fits your background, · feel free to get on our radar by submitting your resume below. · ...

    San Francisco, California

    1 week ago