- Designing, proposing and running experiments on our systems. These will range from developing basic probes for different capabilities in models (such as assessing a given system for fairness traits); to designing comparative studies to understand how well these systems complement humans (for example, by analyzing how interacting with an AI system alters human behavior); to developing comprehensive suites of tests we can run complicated systems through (such as developing ways to evaluate the broad capabilities displayed by modern language models).
- Partnering closely with our other researchers to fully understand, analyze and evaluate state-of-the-art research across a broad range of capabilities and safety research.
- Interfacing with, providing feedback on and publicly communicating about our internal technical infrastructure and tools, with a goal of them better supporting the societal impact analysis of AI systems.
- Developing tools, evaluation suites and tests that enable understanding of AI systems for policymakers, academia, civil society and more.
- Working with the policy and other research teams to share these tools to the above stakeholders and incorporating their feedback and requests.
- Sharing your insights from your work by contributing to Anthropic research publications, giving presentations to external groups, and partnering with our policy team to articulate insights to governments.
- Generating net-new insights about the potential societal impact of systems being developed by Anthropic, and using this understanding to inform Anthropic strategy, research and policy campaigns.
- You have experience assessing machine learning models for unknown traits within known capabilities (for instance, assessing the outputs of a generative model to discover what parts of a dataset are being magnified and minimized).
- You have a track record of using technical infrastructure to interface effectively with machine learning models
- You enjoy and are skilled at writing up and communicating your results, even when they're null.
- You're comfortable creating your own research agenda and executing against it.
- You find it exciting to partner with colleagues on 'big science' projects, where the whole company works to build one AI artefact and then analyze it.
- You have a prior background in data science, or another technical field which involves interfacing with technical artefacts and generating insights about them.
- You're passionate about communicating the insights from your research to external stakeholders, ranging from academics to policymakers to other research labs.
- You have an interest in AI policy; this role offers the chance to partner with Anthropic policy to turn your research insights into actionable recommendations for governments.
- Predictability and Surprise in Large Generative Models
- Red Teaming Language Models to
- The Capacity for Moral Self-Correction in Large Language Models
- Opportunities and Risks of LLMs for Scalable Deliberation with Polis
- Towards Measuring the Representation of Subjective Global Opinions in Language Models
- Challenges in evaluating AI systems
- Collective Constitutional AI: Aligning a Language Model with Public Input
- Unpredictable Abilities Emerging from Large AI Models [MIT Tech Review]
- Language models might be able to self-correct biases-if you ask them [Wired]
- Collective Constitutional AI [NYTimes]
- Red team data - used to make open source models, e.g., Lama-2 more harmless
- GlobalOpinionQA - measuring whose global values LMs are aligned with.
-
Research Scientist, Societal Impacts
2 weeks ago
Anthropic San Francisco, CA, United StatesAs a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...
-
Research Scientist, Societal Impacts
2 weeks ago
Karkidi San Francisco, CA, United StatesAs a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...
-
Research Scientist, Societal Impacts
2 weeks ago
Anthropic Limited San Francisco, CA, United StatesAs a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...
-
Accounts Receivable Specialist
4 days ago
BKF Engineers Oakland, United StatesBKF is searching for a skilled **Accounts Receivable Specialist** to join our accounting team to help collect, process, track, and record payments in an accurate, efficient, and timely manner. The accounts receivable specialist will have both a day-to-day and ongoing impact on fi ...
-
Knewin San Francisco, United StatesPlease Note: This role requires the ability to work on site 3 days per week per company policy . · The annual salary range for this role is $140,000-160,000 in San Francisco. · The World Economic Forum, committed to improving the state of the world, is the international organizat ...
-
Client Advisor
2 weeks ago
Acne Studios San Francisco, United StatesAcne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyone's contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...
-
Performance Engineer
2 weeks ago
Anthropic Limited San Francisco, United StatesRunning machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed ...
-
Scientist Full-time
2 weeks ago
Karkidi San Francisco, CA, United States Full timeAs a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...
-
Web Content/Policy Analyst
3 weeks ago
US Tech Solutions San Francisco, United StatesJob Description: · The Analyst will proactively identify emerging ethical and socio-technical abuse trends, investigate machine learning (ML) Fairness incidents, perform deep dives, and identify patterns. Youre able to surface actionable insight to mitigate and combat ethical AI ...
-
Project Manager I
2 weeks ago
Genesis10 San Francisco, United StatesOur client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: · s an analyst, your responsibility is to partner with AI product development teams to ...
-
Project Manager I
3 weeks ago
Genesis10 San Francisco, United StatesOur client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: s an analyst, your responsibility is to partner with AI product development teams to a ...
-
Project Manager I
3 weeks ago
Genesis Corp./New Journey AI LLC San Francisco, United StatesOur client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: · As an analyst, your responsibility is to partner with AI product development teams ...
-
Client Advisor
1 week ago
Acne Studios San Francisco, United StatesAcne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyone's contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...
-
Anthropic Limited San Francisco, CA, United StatesYou want to build the cutting-edge systems that train AI models like Claude. You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI. As an ML Engineer on our Finetuning Infr ...
-
Client Advisor
6 days ago
Acne Studios San Francisco, United StatesAcne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyones contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...
-
Product Marketing Lead
3 weeks ago
Humanloop, Inc. San Francisco, CA, United StatesWe are hiring our first marketer to help us build our marketing foundation to scale our growth. You will work directly with Humanloop's CEO to set the marketing strategy and then execute it. We are seeing strong early traction, but have a lot of work to build the foundation for o ...
-
Product Fairness
2 weeks ago
ICONMA San Francisco, United StatesProduct Fairness & Safety Analyst · Location: San Francisco, CA/Hybrid · Duration: 12 months · Description: · Project Overview: · The Product Fairness Analyst will proactively identify emerging ethical and socio-technical abuse trends, investigate machine learning (Client) F ...
-
OpenAI San Francisco, CA, United StatesFrontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks. To ensure that AI promotes positive change, we have dedicated a team to help us best prepare for the development of increasingly capable frontier AI models. This team, Prepa ...
-
Director of Product, Creators
3 weeks ago
VRChat Inc San Francisco, CA, United StatesJoin the VRChat Team · VRChat offers a first-of-its-kind, game-changing platform that provides an endless collection of social VR experiences and gives the power of creation to its robust community. With over 250,000 worlds and growing, VRChat's vision is to allow users to bring ...
-
Principal Machine Learning Engineer
2 weeks ago
Altana AI San Francisco, United StatesAltana provides the world's only dynamic, intelligent map of the global supply chain - the Altana Atlas - using AI and machine learning models to connect with and learn from massive sets of public and private data. Through the Atlas, companies and governments can understand the d ...
Research Scientist, Societal Impacts - San Francisco, United States - Anthropic
Description
About the role:
As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns.
Strong candidates will have a track record of running experiments relating to machine learning systems, working in a fast-paced startup environment, and an eagerness to develop their own technical skills so they can best interface with our systems.
The ideal candidate will enjoy a mixture of running experiments and doing research, developing new tools and evaluation suites, and evangelizing this and other work to key stakeholders and communities.
Responsibilities:
You may be a good fit if:
Some examples of our work:
Reduce Harms:
Methods, Scaling Behaviors, and Lessons Learned
Deadline to apply:
None. Applications will be reviewed on a rolling basis.
For this role, we are open to a 6-month residency at Anthropic with the expectation that residents convert to full-time if they do well in their residency.