Staff Machine Learning Research Scientist, LLM Evals - Seattle
1 month ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
+Job summary · As a Technical Program Manager for model evaluations, you'll own end-to-end coordination of our evaluation ecosystem— building a feedback loop from shaping eval strategy during early model development through launch execution. · +ResponsibilitiesStandardize how eva ...
1 month ago
We're looking for an AI Research Engineer who deeply understands LLMs from the inside — someone who doesn't just call APIs, but tinkers, probes, breaks, fine-tunes and rebuilds models to understand how and why they behave the way they do. · Design experiment with and optimize LLM ...
3 weeks ago
We're looking for an AI Researcher who deeply understands LLMs from the inside — someone who doesn't just call APIs, but tinkers, probes, breaks, fine-tunes, · builds and rebuilds models to understand how and why they behave the way they do.Deep hands-on experience working with L ...
2 weeks ago
We're looking for an AI Researcher who deeply understands LLMs from the inside — someone who doesn't just call APIs, but tinkers, probes, breaks, fine-tunes, builds and rebuilds models to understand how and why they behave the way they do. · Design, experiment with, and optimize ...
2 weeks ago
Working at Atlassian · Atlassians have flexibility in where they work – whether in an office, from home, or a combination of the two. · ...
1 month ago
Our Mission · Casium is building the fastest, most comprehensive business immigration solution. We bring together human expertise and AI innovation to help companies secure top global talent with speed, clarity, and compliance. · Talent is global. But the immigration system wasn' ...
1 week ago
About Feather · Feather is building AI agents that do real work for enterprises. · Not chatbots. Not IVR. Not copilots. · Autonomous agents that can reason, take actions, communicate across channels (voice, text, email), and complete end-to-end business workflows. · We operate at ...
1 week ago
We're hiring engineers to build open-source libraries that customers use to observe, understand, and optimize how AI operates inside their production applications. · Build elegant, idiomatic, and resilient SDKs that power Braintrust's LLM evaluation and AI observability platform. ...
2 weeks ago
Atlassian is seeking a Senior Principal Machine Learning Engineer to join our GenAI Platform organization, focusing on the quality and reliability of Rovo Chat. · ...
1 month ago
About Feather · Feather is building AI agents that do real work for enterprises. · Not chatbots. Not IVR. Not copilots. · Autonomous agents that can reason, take actions, communicate across channels (voice, text, email), and complete end-to-end business workflows. · We operate at ...
1 week ago
We're looking for bold, entrepreneurial talent ready to help build something extraordinary — and reshape the future of building products distribution. · QXO is a publicly traded company founded by Brad Jacobs with the goal of building the market-leading company in the building pr ...
3 days ago
Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training (SFT, RLHF, reward modeling). This role will focus on optimizi ...
16 hours ago
We don't just ship features; we build digital-first products that matter. As a Senior Forward Deploy Engineer, you'll join a team that values deep craft, cross-functional collaboration, and relentless focus on quality. · ...
2 weeks ago
We are looking for a Software Engineer to design and implement core platform capabilities for AI/ML and AI Agents in SingleStore Cloud. · You'll work on services that enable model/tool orchestration (e.g. MCP style tool discovery and execution), agent workflows, retrieval pipelin ...
1 month ago
About Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. · As a member of the Applied AI team at Anthropic, you will be a technical Product Engineer focused on becoming ...
1 month ago
Lead AI - Software Engineer · At AT&T we reimagine the communications and technologies that connect the world. Our Consumer Technology eXperience team delivers innovative and reliable technology solutions to power differentiated, simplified customer experiences. Bring your bold i ...
1 week ago
Robots & Pencils is seeking an outcome oriented Forward Deployed Senior Software Engineer to partner with strategic clients on high-impact agentic AI applications. Robots and Pencils builds digital-first products that matter. · ...
1 month ago
Overview: · We're looking for bold, entrepreneurial talent ready to help build something extraordinary — and reshape the future of building products distribution. · QXO is a publicly traded company founded by Brad Jacobs with the goal of building the market-leading company in the ...
2 days ago
· Robots & Pencils is seeking an outcome oriented Forward Deployed Senior Software Engineer to partner with strategic clients on high-impact agentic AI applications. You'll embed directly with customers throughout the product lifecycle, working hands-on from design to production ...
18 hours ago
We're building AI that connects people with others who've achieved similar goals. Our platform helps users find mentors, peers and guides based on shared experiences—not just keywords. · You'll be the technical owner of our entire stack. · Audit and document the current stack; id ...
2 weeks ago