Jobs
>
San Francisco

    Research Scientist, Societal Impacts - San Francisco, United States - Anthropic

    Default job background
    Description

    About the role:


    As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns.


    Strong candidates will have a track record of running experiments relating to machine learning systems, working in a fast-paced startup environment, and an eagerness to develop their own technical skills so they can best interface with our systems.

    The ideal candidate will enjoy a mixture of running experiments and doing research, developing new tools and evaluation suites, and evangelizing this and other work to key stakeholders and communities.


    Responsibilities:

    • Designing, proposing and running experiments on our systems. These will range from developing basic probes for different capabilities in models (such as assessing a given system for fairness traits); to designing comparative studies to understand how well these systems complement humans (for example, by analyzing how interacting with an AI system alters human behavior); to developing comprehensive suites of tests we can run complicated systems through (such as developing ways to evaluate the broad capabilities displayed by modern language models).
    • Partnering closely with our other researchers to fully understand, analyze and evaluate state-of-the-art research across a broad range of capabilities and safety research.
    • Interfacing with, providing feedback on and publicly communicating about our internal technical infrastructure and tools, with a goal of them better supporting the societal impact analysis of AI systems.
    • Developing tools, evaluation suites and tests that enable understanding of AI systems for policymakers, academia, civil society and more.
    • Working with the policy and other research teams to share these tools to the above stakeholders and incorporating their feedback and requests.
    • Sharing your insights from your work by contributing to Anthropic research publications, giving presentations to external groups, and partnering with our policy team to articulate insights to governments.
    • Generating net-new insights about the potential societal impact of systems being developed by Anthropic, and using this understanding to inform Anthropic strategy, research and policy campaigns.

    You may be a good fit if:

    • You have experience assessing machine learning models for unknown traits within known capabilities (for instance, assessing the outputs of a generative model to discover what parts of a dataset are being magnified and minimized).
    • You have a track record of using technical infrastructure to interface effectively with machine learning models
    • You enjoy and are skilled at writing up and communicating your results, even when they're null.
    • You're comfortable creating your own research agenda and executing against it.
    • You find it exciting to partner with colleagues on 'big science' projects, where the whole company works to build one AI artefact and then analyze it.
    • You have a prior background in data science, or another technical field which involves interfacing with technical artefacts and generating insights about them.
    • You're passionate about communicating the insights from your research to external stakeholders, ranging from academics to policymakers to other research labs.
    • You have an interest in AI policy; this role offers the chance to partner with Anthropic policy to turn your research insights into actionable recommendations for governments.

    Some examples of our work:

    • Predictability and Surprise in Large Generative Models
    • Red Teaming Language Models to

    Reduce Harms:
    Methods, Scaling Behaviors, and Lessons Learned

    • The Capacity for Moral Self-Correction in Large Language Models
    • Opportunities and Risks of LLMs for Scalable Deliberation with Polis
    • Towards Measuring the Representation of Subjective Global Opinions in Language Models
    • Challenges in evaluating AI systems
    • Collective Constitutional AI: Aligning a Language Model with Public Input
    • Unpredictable Abilities Emerging from Large AI Models [MIT Tech Review]
    • Language models might be able to self-correct biases-if you ask them [Wired]
    • Collective Constitutional AI [NYTimes]
    • Red team data - used to make open source models, e.g., Lama-2 more harmless
    • GlobalOpinionQA - measuring whose global values LMs are aligned with.

    Deadline to apply:
    None. Applications will be reviewed on a rolling basis.


    For this role, we are open to a 6-month residency at Anthropic with the expectation that residents convert to full-time if they do well in their residency.



  • Anthropic San Francisco, CA, United States

    As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...


  • Karkidi San Francisco, CA, United States

    As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...


  • Anthropic Limited San Francisco, CA, United States

    As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...


  • BKF Engineers Oakland, United States

    BKF is searching for a skilled **Accounts Receivable Specialist** to join our accounting team to help collect, process, track, and record payments in an accurate, efficient, and timely manner. The accounts receivable specialist will have both a day-to-day and ongoing impact on fi ...


  • Knewin San Francisco, United States

    Please Note: This role requires the ability to work on site 3 days per week per company policy . · The annual salary range for this role is $140,000-160,000 in San Francisco. · The World Economic Forum, committed to improving the state of the world, is the international organizat ...

  • Acne Studios

    Client Advisor

    2 weeks ago


    Acne Studios San Francisco, United States

    Acne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyone's contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...

  • Anthropic Limited

    Performance Engineer

    2 weeks ago


    Anthropic Limited San Francisco, United States

    Running machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed ...

  • Karkidi

    Scientist Full-time

    2 weeks ago


    Karkidi San Francisco, CA, United States Full time

    As a societal impacts research scientist, you'll devise new ways to measure and assess Anthropic systems for societally or policy-relevant traits and capabilities, and will discuss what we discover in our research publications and policy campaigns. · Strong candidates will have ...


  • US Tech Solutions San Francisco, United States

    Job Description: · The Analyst will proactively identify emerging ethical and socio-technical abuse trends, investigate machine learning (ML) Fairness incidents, perform deep dives, and identify patterns. Youre able to surface actionable insight to mitigate and combat ethical AI ...

  • Genesis10

    Project Manager I

    2 weeks ago


    Genesis10 San Francisco, United States

    Our client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: · s an analyst, your responsibility is to partner with AI product development teams to ...

  • Genesis10

    Project Manager I

    3 weeks ago


    Genesis10 San Francisco, United States

    Our client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: s an analyst, your responsibility is to partner with AI product development teams to a ...

  • Genesis Corp./New Journey AI LLC

    Project Manager I

    3 weeks ago


    Genesis Corp./New Journey AI LLC San Francisco, United States

    Our client, the world's leading search engine and technology company, is seeking a Project Manager. This is a 6 month + contract hybrid remote position located in San Francisco, CA. · Summary: · As an analyst, your responsibility is to partner with AI product development teams ...

  • Acne Studios

    Client Advisor

    1 week ago


    Acne Studios San Francisco, United States

    Acne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyone's contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...


  • Anthropic Limited San Francisco, CA, United States

    You want to build the cutting-edge systems that train AI models like Claude. You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI. As an ML Engineer on our Finetuning Infr ...

  • Acne Studios

    Client Advisor

    6 days ago


    Acne Studios San Francisco, United States

    Acne Studios is a progressive luxury house and through collaboration and curiosity, we set high aspirations and strive for excellence. We value everyones contribution and embrace feedback to develop ourselves and others. We aim to minimise our environmental impact across all our ...


  • Humanloop, Inc. San Francisco, CA, United States

    We are hiring our first marketer to help us build our marketing foundation to scale our growth. You will work directly with Humanloop's CEO to set the marketing strategy and then execute it. We are seeing strong early traction, but have a lot of work to build the foundation for o ...

  • ICONMA

    Product Fairness

    2 weeks ago


    ICONMA San Francisco, United States

    Product Fairness & Safety Analyst · Location: San Francisco, CA/Hybrid · Duration: 12 months · Description: · Project Overview: · The Product Fairness Analyst will proactively identify emerging ethical and socio-technical abuse trends, investigate machine learning (Client) F ...


  • OpenAI San Francisco, CA, United States

    Frontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks. To ensure that AI promotes positive change, we have dedicated a team to help us best prepare for the development of increasingly capable frontier AI models. This team, Prepa ...


  • VRChat Inc San Francisco, CA, United States

    Join the VRChat Team · VRChat offers a first-of-its-kind, game-changing platform that provides an endless collection of social VR experiences and gives the power of creation to its robust community. With over 250,000 worlds and growing, VRChat's vision is to allow users to bring ...


  • Altana AI San Francisco, United States

    Altana provides the world's only dynamic, intelligent map of the global supply chain - the Altana Atlas - using AI and machine learning models to connect with and learn from massive sets of public and private data. Through the Atlas, companies and governments can understand the d ...