AI EVAL Engineer - Bellevue, WA
4 weeks ago

Job summary
Akkodis is seeking anAI EVAL Engineer
for a
Contract
with a client in Bellevue, WA(Remote). Candidates must have strong Python programming skills and hands-on experience with AI evaluation frameworks and metrics in a Linux environment.
Candidates will design, implement and automate evaluation test suites to measure LLM accuracy; define robust evaluation metrics such as precision/recall BLEU/ROUGE F1 hallucination rate throughput cost-per-output; build datasets ground-truth references benchmarks maintain versioned test cases for consistent repeatable scoring.
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Director, Agentforce Testing Center Engineering
3 weeks ago
We are looking for a technical leader who understands that building an AI agent is only 10% of the work-the real engineering challenge is measuring it. We need a thought leader who can solve the problem nobody talks about: evaluating non-deterministic agentic systems in productio ...
Technical Program Manager, Quality
1 week ago
We are seeking a Technical Program Manager to bridge the gap between LLM research and a polished user-ready product. · Project Orchestration: Manage the end-to-end lifecycle of LLM projects. · Metric Definition & Design: Transform subjective user feedback into objective metrics a ...
Principal Software Engineer, AI
6 days ago
We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end-to-end, simplified solutions. · The team's mission is to significantly improve analyst productivity, lower the barrier to advanced threat hunting, and ena ...