Economist – AI Model Evaluation
4 days ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Candidate will be familiar with RAG and agentic systems, including end-to-end query-to-response workflowsExperience in fine-tuning language models, tracing, analysis, and debugging. · ...
1 month ago
We are seeking Model Evaluators with experience in fine-tuning language models, tracing analysis and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · familiar with RAG and agentic systems including end-to-end query-to-response workflows · experience i ...
1 month ago
We are seeking Model Evaluators with experience in fine-tuning language models, tracing, analysis, and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · ...
1 month ago
Evaluate LLM outputs for accuracy relevance bias and safety Design test cases and evaluation benchmarks for AI models Analyze model behavior and document findings Collaborate with ML engineers and data scientists to improve models Provide structured feedback to enhance model perf ...
1 month ago
· ...
1 month ago
We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. · Design novel evaluation methodologies to ...
1 month ago
About Anthropic's mission is to create reliable, interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · ...
3 weeks ago
The Payment Model Evaluation role supports the Payment Model Evaluation process, which assesses merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments environment. · Supporting the interpretation and applicatio ...
1 month ago
We are seeking an LLM-GenAI Model Evaluator position for our team in Austin tx & Sunnyvale. The ideal candidate will have strong understanding of LLMs, generative AI, and transformer-based architectures. · Strong understanding of LLMs, generative AI, and transformer-based archite ...
1 month ago
We are looking for someone with natural English writing ability who can evaluate AI model responses and grade responses on a Likert scale. · ...
3 weeks ago
As Model Evaluation QA Lead you'll ensure model quality assurance across Deepgram's AI pipeline. · Design automated evaluation frameworks · Integrate model quality gates into release pipelines · ...
3 weeks ago
Immediate need for a talented LLM-GenAI Model Evaluator. · Evaluate LLM-GenAI models. · ...
1 month ago
Design and implement evaluation pipelines for large generative AI models. · ...
1 month ago
Job summary · The Payment Model Evaluation role is an individual contributor position that supports the Payment Model Evaluation (PME) process assessing merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments en ...
1 month ago
· Key Responsibilities: · Perform scoring and qualitative evaluations of · LLM-generated responses across multiple use cases. · Develop and maintain scoring guidelines and rubrics to · ensure consistency and objectivity. · Collaborate with data scientists, product managers, and ...
1 week ago
...
1 month ago
Company Overview · Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ or ...
2 days ago
We are looking for an LLM-GenAI Model Evaluator position. · ...
1 month ago
We are a forward thinking company looking for a Remote Financial AnalystEvaluate the output of AI systems for accuracy and performanceTrain and enhance AI models used for financial analysisCollaborate with team members on project selection and goalsEvaluate the output of AI syste ...
1 month ago
The freelancer will work on finding if user responses are on or off topic and rehearsed or not. The scripts are provided but need optimization for better accuracy. · ...
2 weeks ago