Software Engineer, Applied Evals - San Francisco

Only for registered members San Francisco, United States

1 week ago

About the team · Applied Evals defines what good looks like for safe, advanced AI systems. We turn complex, high-value workflows into clear, reproducible signals that guide model training and product quality. Our work bridges frontier customers and models, ensuring improvements s ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

Software Engineer, Applied Evals

Only for registered members

We're hiring product-minded engineers to design and build evals and harnesses that capture real-world quality for advanced AI systems. ...

San Francisco, CA

1 month ago

Work in company

Software Engineer, Applied Evals

Only for registered members

We're hiring product-minded engineers to design and build evals and harnesses that capture real-world quality for advanced AI systems. · ...

San Francisco $255,000 - $325,000 (USD)

1 month ago

Work in company

Lead LLM Evals Engineer

Only for registered members

I'm hiring a Lead LLM Evals Engineer to join an early-stage physical AI startup building systems with general physical ability to experiment, · engineer,and manufacture anything. · → Build eval harnesses for agentic LLM systems in complex workflows · → · → Turn eval failures int ...

San Francisco

1 month ago

Work in company

Lead LLM Evals Engineer

Only for registered members

I'm hiring a Lead LLM Evals Engineer to join an early-stage physical AI startup building systems with general physical ability to experiment, engineer, and manufacture anything. · ...

San Francisco, CA

1 month ago

Work in company Remote job

Senior AI Scientist

Only for registered members

We are on a mission to create the most helpful search engine in the world—one that prioritizes transparency, privacy, and user control. · Define and own what means for search-augmented and agentic AI systems by designing evaluation frameworks that measure real-world quality, reli ...

San Francisco, CA

1 week ago

Work in company

Senior AI Scientist

Only for registered members

We re building a team of innovators problem solvers and visionaries who are passionate about shaping the future of AI and technology. · , ...

San Francisco $200,000 - $260,000 (USD)

1 week ago

Work in company

DevRel Engineer

Only for registered members

We re looking for a DevRel Engineer to help grow activate and inspire our developer community. · ...

San Francisco, CA

1 month ago

Work in company

DevRel Engineer

Only for registered members

We're looking for a DevRel Engineer to help grow, activate, and inspire our developer community.As a DevRel Engineer, you'll be the public voice of Braintrust for developers—creating content, engaging with the community, and bringing insights back to our product and engineering t ...

San Francisco

1 month ago

Work in company

Senior AI Scientist

Only for registered members

We're building a team of innovators who are passionate about shaping the future of AI and technology. · We combine advanced AI models with user-first principles. · ...

San Francisco $160,000 - $200,000 (USD) Full time

2 weeks ago

Work in company

DevRel Engineer

Only for registered members

We're looking for a DevRel Engineer to help grow, activate, and inspire our developer community. · Create content engaging with the community. · Bring insights back to our product engineering teams. · ...

San Francisco

1 week ago

Work in company

Open Role

Only for registered members

HUD (YC W25) está desarrollando evaluaciones agénticas para Agentes de Uso de Computadora (CUAs) que navegan por Internet. Nuestro marco evaluativo CUA es la primera herramienta completa de evaluación para CUAs. · Building new evaluations/eval environments for HUD's CUA evaluatio ...

San Francisco

1 week ago

Work in company

Backend Software Engineer

Only for registered members

The Support Automation team at OpenAI scales the organization by applying cutting-edge AI models to real-world challenges. · We're looking for a Backend Software Engineer with experience working in ML/LLM-heavy domains to help design and build an evals infrastructure that measure ...

San Francisco $255,000 - $405,000 (USD)

1 month ago

Work in company

Backend Software Engineer

Only for registered members

+ Design eval pipelines that are reliable reproducible and extendable · + Build the infrastructure for continuous eval monitoring frameworks regression drift monitoring building robust golden datasets along with feedback loops that ultimately strengthen support automation. · + Co ...

San Francisco $230,000 - $385,000 (USD)

5 days ago

Work in company

Research Engineer

Only for registered members

This role sits at the intersection of AI research and engineering. · You will anticipate upcoming data needs at leading AI labs, · translate research signals into executable data projects, · and drive early pilots with teams working at the frontier.Anticipate future human data re ...

San Francisco

1 week ago

Work in company

Backend Software Engineer

Only for registered members

We're passionate about crafting products that serve those around us, blending rapid prototyping with a focus on long-term quality and reliability. · In this role, you will:• Design eval pipelines that are reliable, reproducible, and extendable · • Build robust systems and backend ...

San Francisco

1 week ago

Work in company

Director, Agentforce Engineering

Only for registered members

We're looking for a technical leader who understands that building an AI agent is only 10% of the work—the real engineering challenge is measuring it. We need a thought leader who can solve the "problem nobody talks about": evaluating non-deterministic agentic systems in producti ...

San Francisco

1 week ago

Work in company

ML Research Engineer Intern – Dynamo Guard

Only for registered members

At Dynamo AI we are looking for a ML Research Engineer Intern to work at the intersection of machine learning research and production systems. · ...

San Francisco, CA

2 days ago

Work in company

ML Engineer

Only for registered members

Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. · ...

San Francisco

1 month ago

Work in company

Director, Agentforce Testing Center Engineering

Only for registered members

We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. · Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driv ...

San Francisco, CA

3 weeks ago

Work in company

Director, Agentforce Testing Center Engineering

Only for registered members

We are looking for a technical leader who understands that building an AI agent is only 10% of the work—the real engineering challenge is measuring it. · ...

San Francisco

3 weeks ago

Software Engineer, Applied Evals - San Francisco

Job description

Similar jobs

Software Engineer, Applied Evals

Software Engineer, Applied Evals

Lead LLM Evals Engineer

Lead LLM Evals Engineer

Senior AI Scientist

Senior AI Scientist

DevRel Engineer

DevRel Engineer

Senior AI Scientist

DevRel Engineer

Open Role

Backend Software Engineer

Backend Software Engineer

Research Engineer

Backend Software Engineer

Director, Agentforce Engineering

ML Research Engineer Intern – Dynamo Guard

ML Engineer

Director, Agentforce Testing Center Engineering

Director, Agentforce Testing Center Engineering

Directory

for Recruiters

Information