Economist – AI Model Evaluation

Only for registered members United States

4 days ago

About the Project · Pareto.AI is a human data-collection platform connecting leading AI researchers with trusted industry experts to collaborate on AI alignment, safety, and training projects. We are partnering with a frontier AI lab to evaluate an AI model's ability to replicate ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

Model Evaluator

Only for registered members

Candidate will be familiar with RAG and agentic systems, including end-to-end query-to-response workflowsExperience in fine-tuning language models, tracing, analysis, and debugging. · ...

Austin

1 month ago

Work in company

Model Evaluators

Only for registered members

We are seeking Model Evaluators with experience in fine-tuning language models, tracing analysis and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · familiar with RAG and agentic systems including end-to-end query-to-response workflows · experience i ...

Austin

1 month ago

Work in company

Model Evaluators

Only for registered members

We are seeking Model Evaluators with experience in fine-tuning language models, tracing, analysis, and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · ...

Austin, TX

1 month ago

Work in company

LLM Model Evaluator

Only for registered members

Evaluate LLM outputs for accuracy relevance bias and safety Design test cases and evaluation benchmarks for AI models Analyze model behavior and document findings Collaborate with ML engineers and data scientists to improve models Provide structured feedback to enhance model perf ...

Austin

1 month ago

Work in company

Senior Engineer – Model Evaluation

Only for registered members

· ...

Morrisville

1 month ago

Work in company

Research Engineer, Model Evaluations

Only for registered members

We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. · Design novel evaluation methodologies to ...

New York $300,000 - $405,000 (USD)

1 month ago

Work in company

Research Engineer, Model Evaluations

Only for registered members

About Anthropic's mission is to create reliable, interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · ...

San Francisco $300,000 - $405,000 (USD)

3 weeks ago

Work in company

Payment Model Evaluation Analyst

Only for registered members

The Payment Model Evaluation role supports the Payment Model Evaluation process, which assesses merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments environment. · Supporting the interpretation and applicatio ...

Phoenix

1 month ago

Work in company

LLM-GenAI Model Evaluator

Only for registered members

We are seeking an LLM-GenAI Model Evaluator position for our team in Austin tx & Sunnyvale. The ideal candidate will have strong understanding of LLMs, generative AI, and transformer-based architectures. · Strong understanding of LLMs, generative AI, and transformer-based archite ...

Austin

1 month ago

Work in company Remote job

AI Model Response Evaluation

Only for registered members

We are looking for someone with natural English writing ability who can evaluate AI model responses and grade responses on a Likert scale. · ...

$7 - $10 (USD) per hour

3 weeks ago

Work in company Remote job

Model Evaluation QA Lead

Only for registered members

As Model Evaluation QA Lead you'll ensure model quality assurance across Deepgram's AI pipeline. · Design automated evaluation frameworks · Integrate model quality gates into release pipelines · ...

3 weeks ago

Work in company

LLM-GenAI Model Evaluator

Only for registered members

Immediate need for a talented LLM-GenAI Model Evaluator. · Evaluate LLM-GenAI models. · ...

Austin

1 month ago

Work in company

Senior Engineer – Model Evaluation

Only for registered members

Design and implement evaluation pipelines for large generative AI models. · ...

Morrisville, NC

1 month ago

Work in company

Payment Model Evaluation Analyst

Only for registered members

Job summary · The Payment Model Evaluation role is an individual contributor position that supports the Payment Model Evaluation (PME) process assessing merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments en ...

Phoenix, AZ

1 month ago

Work in company

AI Model Evaluation Specialist

Only for registered members

· Key Responsibilities: · Perform scoring and qualitative evaluations of · LLM-generated responses across multiple use cases. · Develop and maintain scoring guidelines and rubrics to · ensure consistency and objectivity. · Collaborate with data scientists, product managers, and ...

New York, New York, United States

1 week ago

Work in company

New Model Evaluation Engineer

Only for registered members

...

Irvine $70,000 - $93,000 (USD)

1 month ago

Work in company

Model Evaluation QA Lead

Only for registered members

Company Overview · Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ or ...

USA | Remote

2 days ago

Work in company

LLM-GenAI Model Evaluator

Only for registered members

We are looking for an LLM-GenAI Model Evaluator position. · ...

Austin, TX

1 month ago

Work in company

Remote AI model evaluator

Only for registered members

We are a forward thinking company looking for a Remote Financial AnalystEvaluate the output of AI systems for accuracy and performanceTrain and enhance AI models used for financial analysisCollaborate with team members on project selection and goalsEvaluate the output of AI syste ...

North Liberty

1 month ago

Work in company Remote job

GenAI model finetuning and evaluation

Only for registered members

The freelancer will work on finding if user responses are on or off topic and rehearsed or not. The scripts are provided but need optimization for better accuracy. · ...

$250 - $0 (USD) budget

2 weeks ago

Economist – AI Model Evaluation

Job description

Similar jobs

Model Evaluator

Model Evaluators

Model Evaluators

LLM Model Evaluator

Senior Engineer – Model Evaluation

Research Engineer, Model Evaluations

Research Engineer, Model Evaluations

Payment Model Evaluation Analyst

LLM-GenAI Model Evaluator

AI Model Response Evaluation

Model Evaluation QA Lead

LLM-GenAI Model Evaluator

Senior Engineer – Model Evaluation

Payment Model Evaluation Analyst

AI Model Evaluation Specialist

New Model Evaluation Engineer

Model Evaluation QA Lead

LLM-GenAI Model Evaluator

Remote AI model evaluator

GenAI model finetuning and evaluation

Directory

for Recruiters

Information