AI Evaluation Engineer - New York, NY
2 weeks ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
+Job summary · We're looking for a Platform Engineer to define and operationalize quality for agentic systems that power Antimetal's investigation and automation engine. · +QualificationsPrior experience designing evaluation systems where ground truth is noisy, high-volume, and h ...
1 month ago
This is a zero to one opportunity to build the evaluation stack: pipelines, metrics, tooling, and infrastructure from scratch, and shape how teams across platform, product, and research iterate confidently. · ...
1 month ago
We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. · Design novel evaluation methodologies to ...
1 month ago
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, · semantic search,RAG,and agents. · ...
2 weeks ago
We are scaling intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
3 weeks ago
We are looking for a Senior Infrastructure Engineer (5-10yrs experience) to bridge the gap between internal developer velocity and global product delivery by owning the entire Commit-to-Compute lifecycle.You will build the high-performance CI/CD pipelines and self-service tooling ...
1 month ago
We are looking for a Staff Machine Learning Engineer to join our Waymo AI Foundations team. · ...
1 month ago
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver—to improve access to mobili ...
1 week ago
Staff Developer Relations Engineer, Cloud Platform Evaluations Team
Only for registered members
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include: · Health, dental, vision, life, disability insurance · Retirement Benefits: 401(k) with company ...
5 days ago
Staff Developer Relations Engineer, Cloud Platform Evaluations Team
Only for registered members
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include: · Health, dental, vision, life, disability insurance · Retirement Benefits: 401(k) with company ...
4 days ago
We are looking for AI Evaluation & Data Engineering Specialists to design curate and operationalize datasets and evaluation frameworks for AI product performance assessment.This role involves working with large language models LLMs human raters and automation tools to measure mod ...
2 months ago
This Series A startup is pioneering a new category in AI-native infrastructure automation.Their platform doesn't just surface dashboards, · it actively investigates, · reasons over, · and automates responses across production systems, · giving engineers time back to build rather ...
2 weeks ago
Meta Superintelligence Labs is seeking a Research Scientist Manager to lead a high-impact team working on the next generation of AI Assistants powered by frontier-scale foundation models. · ...
1 month ago
We're looking for a Director of Engineering to lead our AI Agent Evaluation and Experimentation Platform team.You'll own the end-to-end evaluation and experimentation lifecycle for both agentic systems and traditional ML models. · ...
3 weeks ago
This role is for a Research Engineer focusing on Chemical, Biological, Radiological and Nuclear (CBRN) AI Responsibility at Google DeepMind. · We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensur ...
1 month ago
We're Hiring: AI/ML Engineer (TAIRC | Remote) · About TAIRC · The AI Research Center (TAIRC) is a nonprofit organization dedicated to advancing trustworthy, safe, · and beneficial artificial intelligence.Through foundational and applied research, · TAGIC develops AI systems, gove ...
3 weeks ago
A respected, multidisciplinary engineering consultancy in New York City is seeking an experienced Senior Structural Engineer (PE Licensed in NY) to join their established Existing Buildings group. This role is ideal for an engineer who thrives on solving complex problems in the b ...
1 week ago
Job summary · In this role you will shape the technical foundations that dozens of product and research teams rely on every day, while building and growing a high-performing engineering team that operates at the intersection of infrastructure applied ML developer experience. · ...
3 weeks ago
ALTANOVA is now actively hiring a Senior Energy Project Engineer for a full-time position in our New York office. Alternatively located in our Paris, France office or remotely (GMT-5) for highly qualified individuals. · About Altanova · Altanova is a leading multidisciplinary con ...
1 week ago
This hourly remote contractor role involves reviewing AI-generated engineering solutions and generating expert Mechanical Engineering content. · Create detailed prompts in various topics to guide AI learning. · Evaluate and rank AI responses to enhance model accuracy. · ...
1 month ago