Applied Research - San Francisco
7 hours ago

Job description
Be Your Own Lab
Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool use, agent workflows, and deployment. We validate everything by using it ourselves, training open state-of-the-art models on the same stack we put in your hands. We're looking for people who want to build at the intersection of frontier research and real infrastructure.
We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.
About the Role
We're looking for a Forward-Deployed Research Engineer (FDRE) to serve as the primary technical interface between Prime Intellect and our most important customers: AI companies, research labs, and enterprises running post-training and agentic RL on our platform.
This is not a traditional research role. You'll spend most of your time embedded with customers, understanding their models, workflows, and goals. Then, you'll translate those objectives into concrete training runs, environment designs, evaluation harnesses, and deployment recipes using the Lab stack. You are the person who makes the platform work in practice for real workloads.
You'll work closely with our research, product, and infrastructure teams to feed field insights back into the platform, shaping what we build next based on what customers actually need.
What You'll Do
Customer Engagement & Technical Delivery
-
Embed directly with strategic customers to understand their agent architectures, failure modes, and product goals
-
Design and build custom RL environments, evaluation harnesses, and verifiers that capture what "good" looks like for each customer's domain
-
Architect agent scaffolding — tool use, multi-step reasoning, memory, sandbox execution — tailored to customer workflows
-
Configure and launch training runs on Lab, iterating on reward functions, rollout strategies, and evaluation criteria
-
Serve as the technical lead for engagements end-to-end: from discovery through deployed, improved models
Platform Feedback & Ecosystem
-
Identify repeatable patterns from customer engagements and codify them into reference implementations, templates, and documentation
-
Serve as the voice of the customer internally, shaping the roadmap for Lab, verifiers, the Environments Hub, and training infrastructure
-
Build high-quality examples and "recipes" that make it easy for new customers and open-source contributors to extend the stack
-
Contribute to technical content (blog posts, tutorials, case studies) that demonstrates real-world platform usage
Applied Research & Experimentation
-
Develop novel evaluation methodologies for agentic behavior — multi-step reasoning, tool use correctness, recovery from failure, long-horizon task completion
-
Prototype and iterate on agent harnesses for real-world tasks: code generation, workflow automation, document processing, and more
-
Experiment with reward design, rubric construction, and environment shaping to improve training signal quality
-
Stay current on the frontier of agentic AI, evals, and post-training methods, and bring that knowledge directly into customer work
What We're Looking For
-
Deep hands-on experience building, evaluating, or deploying LLM-based agents in the past 1–2 years — you've seen what breaks in production and know what good evals look like
-
Strong intuition for evaluation design: you can look at a customer's agent and quickly identify what to measure, how to construct a rubric, and where the reward signal is weak
-
Working understanding of RL and post-training concepts (GRPO, RLHF, reward modeling, SFT) — you don't need to have written a trainer from scratch, but you should understand what the knobs do and why they matter
-
Strong Python skills and comfort with the modern AI stack (Hugging Face, inference engines, agent frameworks)
-
Experience in a customer-facing or consulting-adjacent technical role, or as a technical founder — you're comfortable in a room with a customer's engineering team figuring out what to build
-
Excellent written and verbal communication — you can write a clear environment spec, a compelling case study, and a useful Slack message to a frustrated customer
-
High agency and comfort with ambiguity. You don't wait for specs; you scope the problem, ship a solution, and iterate
Nice-to-Haves
-
Experience with agent frameworks and tooling (DSPy, LangGraph, MCP, Stagehand, browser automation)
-
Experience building or running LLM evaluation pipelines at scale (benchmarks, synthetic data generation, model grading)
-
Research experience — publications, open-source contributions, or benchmarks in ML/RL/agents
-
Familiarity with sandbox/code execution environments for agent evaluation
-
Web programming experience (React, TypeScript, ) for building demos and customer-facing tooling
What We Offer
-
Competitive Compensation equity incentives
-
Flexible Work (San Francisco or hybrid-remote)
-
Visa Sponsorship & relocation support
-
Professional Development budget
-
Team Off-sites & conference attendance
Growth Opportunity
You'll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you'll have the opportunity to:
-
Shape the evolution of agent-driven solutions—from research breakthroughs to production systems used by real customers.
-
Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL and post-training.
-
Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem.
If you're excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we'd love to hear from you.
Ready to build the open superintelligence infrastructure of tomorrow?
Apply now to help us make powerful, open AGI accessible to everyone.
Similar jobs
Be Your Own Lab · Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training ...
14 hours ago
We're looking for a Forward-Deployed Research Engineer (FDRE) to serve as the primary technical interface between Prime Intellect and our most important customers: AI companies, research labs, and enterprises running post-training and agentic RL on our platform. · ...
1 day ago
We are building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with a frontier open post-training stack: enviro ...
1 week ago
We are building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create train and deploy them. · We aggregate and orchestrate global compute into a single control plane and pair it with a frontier open post-training stack: enviro ...
1 month ago
We are seeking an innovative and passionate Applied Research Engineer to join our dynamic team in San Francisco. In this cutting-edge role you will work at the intersection of engineering and applied research utilizing advanced methodologies to drive technological advancements. · ...
1 week ago
Lensa is a career site that helps job seekers find great jobs in the US. At Capital One we are creating trustworthy and reliable AI systems changing banking for good. · ...
1 month ago
At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. · Partner with a cross-functional team ...
1 day ago
About Radiant Security: A Bay Area Cyber AI Startup that raised funding from top tier investors. Our vision is simple: enable all security teams to perform security operations with the efficiency needed to prevent breaches. · ...
1 month ago
We are creating trustworthy and reliable AI systems changing banking for good at Capital One. · ...
1 month ago
Applied Researcher IIOverview: · At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Capital One has been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. From i ...
1 day ago
We are creating trustworthy and reliable AI systems changing banking for good. · Partner with a cross-functional team of data scientists software engineers machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their ...
1 month ago
We are creating trustworthy and reliable AI systems changing banking for good. · Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface to reveal insights hidden within huge volumes of numeric and textual data. · ...
1 month ago
We are creating trustworthy and reliable AI systems, changing banking for good. · Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with th ...
1 week ago
We are creating trustworthy and reliable AI systems changing banking for good at Capital One. For years we have been leading the industry in using machine learning to create real-time intelligent automated customer experiences. · ...
1 week ago
Help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers. · ...
1 month ago
At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. · ...
1 day ago
At Capital One we are creating trustworthy AI systems changing banking for good For years Capital One has been leading industry in using machine learning to create real time intelligent automated customer experiences We are committed to building world class applied science engine ...
2 weeks ago
We are representing an elite AI research lab that is exclusively focused on the final frontier of internet data: Video. · This is a rare chance to join a high-growth, high-revenue "soonicorn" where your work directly impacts the models used by the world's top AI labs. · ...
3 weeks ago
+We are creating trustworthy and reliable AI systems changing banking for good. · +-Currently has PhD in Electrical Engineering Computer Science Mathematics related fields excepted that required degree will be obtained on or before scheduled start date M.S in Electrical Engineeri ...
2 weeks ago
We are creating trustworthy and reliable AI systems at Capital One. · Partner with cross-functional teams to deliver AI-powered products. · Leverage technologies including Pytorch and AWS Ultraclusters. · ...
1 week ago