Remote Research Engineer: Agentic AI Evaluations - San Francisco, CA - HUD

    HUD
    HUD San Francisco, CA

    1 week ago

    Description

    Job Description:

    A technology startup seeks a research engineer to build evaluation environments for AI systems.

    • Proficiency in Python, Docker, and Linux required

  • Only for registered members San Francisco, CA

    Roche's Research and Early Development organisations are seeking exceptional undergraduate student interns with a strong interest in applied AI engineering. The internship position is located in South San Francisco, CA. · Design and implement scalable evaluation pipelines for LLM ...

  • Only for registered members South San Francisco

    Genentech's Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) have demonstrated how technologies accelerate R&D, · The new computational sciences Center of Excellence (CoE) is a strategic group whose goal is to harness this transformative power of ...

  • Only for registered members South San Francisco Full time

    Advances in AI, data, and computational sciences are transforming drug discovery and development. · Design and implement scalable evaluation pipelines for LLM-based agents across diverse use cases, · Build systems that go beyond PoCs by addressing edge cases, failure modes, versi ...

  • Only for registered members San Francisco, CA

    We're looking for our first AI Engineer focused on agents and evaluation—a foundational hire who will shape how we build, measure, and scale intelligent systems. · The Opportunity: Design the Playbook for High-Performance AI Agents · We're tackling one of the hardest—and most imp ...

  • Only for registered members San Francisco

    +We are looking for an AI Engineer focused on agents and evaluation to shape how we build measure scale intelligent systems. You will design the playbook for high-performance AI agents and create task evaluations that matter. · + ...

  • Only for registered members San Francisco

    We're seeking a Director/Senior Director of UX Product Design to lead the vision, strategy, and execution for the Agentforce Development Platform.You'll define how builders and developers author and evaluate agents — shaping an ecosystem of tooling, frameworks, and experiences th ...

  • Only for registered members San Francisco

    You'll be joining a research-driven AI company building reinforcement learning simulation environments for agents and large language models, · with a focus on post-training, evaluation, and scalable supervision. · ...

  • Only for registered members San Francisco, CA

    We're seeking a · Director · / · Senior Director of UX Product Design · to lead the vision, strategy, execution for Agentforce Development Platform. You'll define how builders developers author evaluate agents — shaping ecosystem tooling frameworks experiences make innovation bot ...

  • Only for registered members San Francisco

    You'll join a research-driven AI company building reinforcement learning simulation environments for agents and large language models. · Experience in applied research in reinforcement learning, LLM post-training, or agent-based systemsStrong understanding of transformer architec ...

  • Only for registered members San Francisco, CA

    We are hiring an Education Engineer to help developers and agent builders learn how to build, evaluate and continuously refine agents using LangSmith. You will turn cutting-edge applied AI techniques into accessible educational content across multiple formats - from online course ...

  • Only for registered members San Francisco, CA

    We are hiring an Education Engineer to help developers and agent builders learn how to build, evaluate and continuously refine agents using LangSmith. · ...

  • Only for registered members San Francisco $165,000 - $185,000 (USD)

    We build the foundation for agent engineering in the real world, · helping developers move from prototypes to production-ready AI agents that teams can rely on. · LanChain is hiring an Education Engineer to help developers and agent builders learn how to build, · evaluate and con ...

  • Only for registered members San Francisco, CA

    An opportunity to work from first principles on agentic architectures that power production systems used by professionals globally. · ...

  • Only for registered members San Francisco, California, United States

    turing is looking for a head of data quality rl environments to build and lead the quality function for all reinforcement learning rl environment and trajectory data used to train and evaluate models at frontier ai labs. · ...

  • Only for registered members San Francisco Full time $155,000 - $185,000 (USD)

    About the role: LangChain is hiring an Education Engineer to help developers and agent builders learn how to build · elegant educational content across multiple formats. You will collaborate with our Applied AI and engineering teams to create high-quality learning experiences th ...

  • Only for registered members San Francisco, CA

    LangChain is hiring an Education Engineer to help developers and agent builders learn how to build evaluate and continuously refine agents using LangSmith. · Collaborate with LangChain engineers to develop educational content that teaches agentic evaluation monitoring and refinem ...

  • Only for registered members San Francisco

    We're looking for a technical leader who understands that building an AI agent is only 10% of the work—the real engineering challenge is measuring it. We need a thought leader who can solve the "problem nobody talks about": evaluating non-deterministic agentic systems in producti ...

  • Only for registered members San Francisco

    We're hiring ML/AI engineers to turn that context into reliable, governed agent behavior used by enterprise customers. · This role focuses on model engineering, · MLOps and trust/visibility metrics as part of our core engine work. · ...

  • Only for registered members San Francisco

    You will work directly with the founders on advancing the product roadmap and AgentHub's core evaluation and simulation capabilities. · You'll have significant ownership and will keep up to date with the latest state-of-the-art methodologies and techniques across areas like agent ...

  • Senior AI

    2 weeks ago

    Only for registered members San Francisco

    We are looking for a Senior AI/ML Engineer (Applied) to join our on-site Research & Intelligence team in San Francisco.This role is for an experienced engineer who enjoys working close to research problems but is primarily motivated by building shipping and improving production s ...

  • Only for registered members San Francisco, California, , United States

    We're partnered with a fast-growing applied AI company building production grade AI systems that are deployed directly into customer environments. · ...

Jobs
>
San Francisco