Economist – AI Model Evaluation

Only for registered members United States

4 days ago

Default job background
About the Project · Pareto.AI is a human data-collection platform connecting leading AI researchers with trusted industry experts to collaborate on AI alignment, safety, and training projects. We are partnering with a frontier AI lab to evaluate an AI model's ability to replicate ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Model Evaluator

    Only for registered members

    Candidate will be familiar with RAG and agentic systems, including end-to-end query-to-response workflowsExperience in fine-tuning language models, tracing, analysis, and debugging. · ...

    Austin

    1 month ago

  • Work in company

    Model Evaluators

    Only for registered members

    We are seeking Model Evaluators with experience in fine-tuning language models, tracing analysis and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · familiar with RAG and agentic systems including end-to-end query-to-response workflows · experience i ...

    Austin

    1 month ago

  • Work in company

    Model Evaluators

    Only for registered members

    We are seeking Model Evaluators with experience in fine-tuning language models, tracing, analysis, and debugging. Strong proficiency in Python and Jupyter Notebooks is required. · ...

    Austin, TX

    1 month ago

  • Work in company

    LLM Model Evaluator

    Only for registered members

    Evaluate LLM outputs for accuracy relevance bias and safety Design test cases and evaluation benchmarks for AI models Analyze model behavior and document findings Collaborate with ML engineers and data scientists to improve models Provide structured feedback to enhance model perf ...

    Austin

    1 month ago

  • Work in company

    Senior Engineer – Model Evaluation

    Only for registered members

    · ...

    Morrisville

    1 month ago

  • Work in company

    Research Engineer, Model Evaluations

    Only for registered members

    We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. · Design novel evaluation methodologies to ...

    New York $300,000 - $405,000 (USD)

    1 month ago

  • Work in company

    Research Engineer, Model Evaluations

    Only for registered members

    About Anthropic's mission is to create reliable, interpretable and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · ...

    San Francisco $300,000 - $405,000 (USD)

    3 weeks ago

  • Work in company

    Payment Model Evaluation Analyst

    Only for registered members

    The Payment Model Evaluation role supports the Payment Model Evaluation process, which assesses merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments environment. · Supporting the interpretation and applicatio ...

    Phoenix

    1 month ago

  • Work in company

    LLM-GenAI Model Evaluator

    Only for registered members

    We are seeking an LLM-GenAI Model Evaluator position for our team in Austin tx & Sunnyvale. The ideal candidate will have strong understanding of LLMs, generative AI, and transformer-based architectures. · Strong understanding of LLMs, generative AI, and transformer-based archite ...

    Austin

    1 month ago

  • Work in company Remote job

    AI Model Response Evaluation

    Only for registered members

    We are looking for someone with natural English writing ability who can evaluate AI model responses and grade responses on a Likert scale. · ...

    $7 - $10 (USD) per hour

    3 weeks ago

  • Work in company Remote job

    Model Evaluation QA Lead

    Only for registered members

    As Model Evaluation QA Lead you'll ensure model quality assurance across Deepgram's AI pipeline. · Design automated evaluation frameworks · Integrate model quality gates into release pipelines · ...

    3 weeks ago

  • Work in company

    LLM-GenAI Model Evaluator

    Only for registered members

    Immediate need for a talented LLM-GenAI Model Evaluator. · Evaluate LLM-GenAI models. · ...

    Austin

    1 month ago

  • Work in company

    Senior Engineer – Model Evaluation

    Only for registered members

    Design and implement evaluation pipelines for large generative AI models. · ...

    Morrisville, NC

    1 month ago

  • Work in company

    Payment Model Evaluation Analyst

    Only for registered members

    Job summary · The Payment Model Evaluation role is an individual contributor position that supports the Payment Model Evaluation (PME) process assessing merchant relationships in alignment with Global Merchant Acquisition policies within a rapidly evolving Fintech and payments en ...

    Phoenix, AZ

    1 month ago

  • Work in company

    AI Model Evaluation Specialist

    Only for registered members

    · Key Responsibilities: · Perform scoring and qualitative evaluations of · LLM-generated responses across multiple use cases. · Develop and maintain scoring guidelines and rubrics to · ensure consistency and objectivity. · Collaborate with data scientists, product managers, and ...

    New York, New York, United States

    1 week ago

  • Work in company

    New Model Evaluation Engineer

    Only for registered members

    ...

    Irvine $70,000 - $93,000 (USD)

    1 month ago

  • Work in company

    Model Evaluation QA Lead

    Only for registered members

    Company Overview · Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ or ...

    USA | Remote

    2 days ago

  • Work in company

    LLM-GenAI Model Evaluator

    Only for registered members

    We are looking for an LLM-GenAI Model Evaluator position. · ...

    Austin, TX

    1 month ago

  • Work in company

    Remote AI model evaluator

    Only for registered members

    We are a forward thinking company looking for a Remote Financial AnalystEvaluate the output of AI systems for accuracy and performanceTrain and enhance AI models used for financial analysisCollaborate with team members on project selection and goalsEvaluate the output of AI syste ...

    North Liberty

    1 month ago

  • Work in company Remote job

    GenAI model finetuning and evaluation

    Only for registered members

    The freelancer will work on finding if user responses are on or off topic and rehearsed or not. The scripts are provided but need optimization for better accuracy. · ...

    $250 - $0 (USD) budget

    2 weeks ago