Software Engineer, Applied Evals - San Francisco

Only for registered members San Francisco, United States

1 week ago

Default job background
About the team · Applied Evals defines what good looks like for safe, advanced AI systems. We turn complex, high-value workflows into clear, reproducible signals that guide model training and product quality. Our work bridges frontier customers and models, ensuring improvements s ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Software Engineer, Applied Evals

    Only for registered members

    We're hiring product-minded engineers to design and build evals and harnesses that capture real-world quality for advanced AI systems. ...

    San Francisco, CA

    1 month ago

  • Work in company

    Software Engineer, Applied Evals

    Only for registered members

    We're hiring product-minded engineers to design and build evals and harnesses that capture real-world quality for advanced AI systems. · ...

    San Francisco $255,000 - $325,000 (USD)

    1 month ago

  • Work in company

    Lead LLM Evals Engineer

    Only for registered members

    I'm hiring a Lead LLM Evals Engineer to join an early-stage physical AI startup building systems with general physical ability to experiment, · engineer,and manufacture anything. · → Build eval harnesses for agentic LLM systems in complex workflows · → · → Turn eval failures int ...

    San Francisco

    1 month ago

  • Work in company

    Lead LLM Evals Engineer

    Only for registered members

    I'm hiring a Lead LLM Evals Engineer to join an early-stage physical AI startup building systems with general physical ability to experiment, engineer, and manufacture anything. · ...

    San Francisco, CA

    1 month ago

  • Work in company Remote job

    Senior AI Scientist

    Only for registered members

    We are on a mission to create the most helpful search engine in the world—one that prioritizes transparency, privacy, and user control. · Define and own what means for search-augmented and agentic AI systems by designing evaluation frameworks that measure real-world quality, reli ...

    San Francisco, CA

    1 week ago

  • Work in company

    Senior AI Scientist

    Only for registered members

    We re building a team of innovators problem solvers and visionaries who are passionate about shaping the future of AI and technology. · , ...

    San Francisco $200,000 - $260,000 (USD)

    1 week ago

  • Work in company

    DevRel Engineer

    Only for registered members

    We re looking for a DevRel Engineer to help grow activate and inspire our developer community. · ...

    San Francisco, CA

    1 month ago

  • Work in company

    DevRel Engineer

    Only for registered members

    We're looking for a DevRel Engineer to help grow, activate, and inspire our developer community.As a DevRel Engineer, you'll be the public voice of Braintrust for developers—creating content, engaging with the community, and bringing insights back to our product and engineering t ...

    San Francisco

    1 month ago

  • Work in company

    Senior AI Scientist

    Only for registered members

    We're building a team of innovators who are passionate about shaping the future of AI and technology. · We combine advanced AI models with user-first principles. · ...

    San Francisco $160,000 - $200,000 (USD) Full time

    2 weeks ago

  • Work in company

    DevRel Engineer

    Only for registered members

    We're looking for a DevRel Engineer to help grow, activate, and inspire our developer community. · Create content engaging with the community. · Bring insights back to our product engineering teams. · ...

    San Francisco

    1 week ago

  • Work in company

    Open Role

    Only for registered members

    HUD (YC W25) está desarrollando evaluaciones agénticas para Agentes de Uso de Computadora (CUAs) que navegan por Internet. Nuestro marco evaluativo CUA es la primera herramienta completa de evaluación para CUAs. · Building new evaluations/eval environments for HUD's CUA evaluatio ...

    San Francisco

    1 week ago

  • Work in company

    Backend Software Engineer

    Only for registered members

    The Support Automation team at OpenAI scales the organization by applying cutting-edge AI models to real-world challenges. · We're looking for a Backend Software Engineer with experience working in ML/LLM-heavy domains to help design and build an evals infrastructure that measure ...

    San Francisco $255,000 - $405,000 (USD)

    1 month ago

  • Work in company

    Backend Software Engineer

    Only for registered members

    + Design eval pipelines that are reliable reproducible and extendable · + Build the infrastructure for continuous eval monitoring frameworks regression drift monitoring building robust golden datasets along with feedback loops that ultimately strengthen support automation. · + Co ...

    San Francisco $230,000 - $385,000 (USD)

    5 days ago

  • Work in company

    Research Engineer

    Only for registered members

    This role sits at the intersection of AI research and engineering. · You will anticipate upcoming data needs at leading AI labs, · translate research signals into executable data projects, · and drive early pilots with teams working at the frontier.Anticipate future human data re ...

    San Francisco

    1 week ago

  • Work in company

    Backend Software Engineer

    Only for registered members

    We're passionate about crafting products that serve those around us, blending rapid prototyping with a focus on long-term quality and reliability. · In this role, you will:• Design eval pipelines that are reliable, reproducible, and extendable · • Build robust systems and backend ...

    San Francisco

    1 week ago

  • Work in company

    Director, Agentforce Engineering

    Only for registered members

    We're looking for a technical leader who understands that building an AI agent is only 10% of the work—the real engineering challenge is measuring it. We need a thought leader who can solve the "problem nobody talks about": evaluating non-deterministic agentic systems in producti ...

    San Francisco

    1 week ago

  • Work in company

    ML Research Engineer Intern – Dynamo Guard

    Only for registered members

    At Dynamo AI we are looking for a ML Research Engineer Intern to work at the intersection of machine learning research and production systems. · ...

    San Francisco, CA

    2 days ago

  • Work in company

    ML Engineer

    Only for registered members

    Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. · ...

    San Francisco

    1 month ago

  • Work in company

    Director, Agentforce Testing Center Engineering

    Only for registered members

    We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. · Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driv ...

    San Francisco, CA

    3 weeks ago

  • Work in company

    Director, Agentforce Testing Center Engineering

    Only for registered members

    We are looking for a technical leader who understands that building an AI agent is only 10% of the work—the real engineering challenge is measuring it. · ...

    San Francisco

    3 weeks ago