Manager, Agent Evaluation - Washington
3 weeks ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. · Lead and grow a team focused on agent and model evaluation · Define the strategy, roadmap, and standards for agent testing and validation · ...
3 weeks ago
The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. We build the framework, metrics, and test cases that validate agent behavior, · accuracy, · and reliability before release. Our goal is to ensure agents perform consi ...
3 weeks ago
The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. · Lead and grow a team focused on agent and model evaluation · Define the strategy, roadmap, and standards for agent testing and validation · ...
2 weeks ago
hackajob is collaborating with Comcast to connect them with exceptional tech professionals for this role. · Make your mark at Comcast -- a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experiences we create, we ...
2 weeks ago
The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. We build the framework, metrics, and test cases that validate agent behavior, accuracy, and reliability before release. · ...
3 weeks ago
Make your mark at Comcast -- a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experiences we create, we reach hundreds of millions of customers, viewers, and guests worldwide. Become part of our award-winning tec ...
1 week ago
Requirements: · BS/MS in Computer Science, Statistics, Electrical/Computer Engineering, Mathematics, or a related field. · 7+ years of professional work experience after BS/MS applying machine learning to real-world problems, and crafting scalable and effective ML/AI solutions. · ...
6 days ago
Design AI systems for the Company. · ...
1 month ago
Position: AI Engineer with Agent Building Team · Location: Remote · Duration: 1 Years · Python development with AI/ML frameworks (CrewAI, LangChain, or · equivalent) · Hands-on experience building production agents with Bedrock AgentCore SDK , Strands · SDK/Crew AI · Strong evalu ...
1 week ago
Senior Machine Learning Engineer to design and build AI systems for the Company. · ...
1 month ago
Seeking a Senior Agentic AI / LLM Platform Engineer to architect and deliver a multi-agent AI system on Google Cloud Platform.Architect the TruAgent shell runtime with multi-layer memory services (semantic, episodic, procedural) and knowledge graph integration · Build Model Build ...
2 weeks ago
About Us · At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source too ...
4 days ago
Position Title* AI Developer Position Responsibilities · AI Developer · Onsite 1-2 times a month in Washington,D.C. · developing some AI security tools · Need developer with good experience with python and azure cloud environment · Good understanding on gen AI and open AI models ...
2 weeks ago
Our client is seeking a GenAI & MLOps Engineer / Architect (Azure) to lead the design and implementation of cutting-edge AI solutions. · ...
1 month ago
We are seeking a GenAI & MLOps Engineer / Architect (Azure) to lead the design and implementation of cutting-edge AI solutions. · This role focuses on building Azure-based GenAI applications, including RAG pipelines and sophisticated agentic AI systems. ...
1 month ago
This role involves designing RAG services using Azure AI/Search, · building MCP tools/servers for integration security scanners, · developing agentic AI solutions using AutoGen/CrewAI/Agno, · & creating production-grade chatbots with prompt management.We are looking · a developer ...
2 weeks ago
Dexian stands at the forefront of Talent + Technology solutions with a presence spanning more than 70 locations worldwide and a team exceeding 10, · 000 professionals. · ...
2 weeks ago
Requirements: · BS/MS in Computer Science, Statistics, Electrical/Computer Engineering, Mathematics, or a related field. · 4+ years of experience after BS/MS. · Strong software engineering fundamentals role involves research as well as writing production-grade code. · Knowledge o ...
6 days ago
Apply · Title: Lead Google Cloud Platform Engineer: AI Platforms & Development · Location - Remote in USA · Role Summary: As a Lead Google Cloud Platform Engineer, you are the resident expert and engineering heart of our AI & Agentic capabilities. You serve as the lead specialist ...
21 hours ago
Job summary · Azure Al foundry Engineer/Developer cum lead with python strong. · Qualifications5-10+ years in software engineering with strong Python (async, typing, packaging, testing) and API development. · ...
1 month ago