Jobs
>
San Francisco

    Machine Learning Systems Engineer, Finetuning Infrastructure - San Francisco, CA, United States - Anthropic Limited

    Default job background
    Description
    You want to build the cutting-edge systems that train AI models like Claude.

    You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI.

    As an ML Engineer on our Finetuning Infrastructure team, you'll be responsible for the critical algorithms and infrastructure that our researchers depend on to train models.

    Your work will directly enable breakthroughs in AI capabilities and safety.

    You'll focus obsessively on improving the performance, robustness, and usability of these systems so our research can progress as quickly as possible.

    You're energized by the challenge of supporting and empowering our research team in the mission to build beneficial AI systems.

    About the role
    Our finetuning researchers train our production Claude models, and internal research models, using RLHF and other related methods.

    Your job will be to build, maintain, and improve the algorithms and systems that these researchers use to train models.

    You'll be responsible for improving the speed, reliability, and ease-of-use of these systems.
    You may be a good fit if you:
    Have significant software engineering experience
    Are results-oriented, with a bias towards flexibility and impact
    Pick up slack, even if it goes outside your job description
    Enjoy pair programming (we love to pair)
    Want to learn more about machine learning research
    Like working on systems and tools that make other people more productive
    Care about the societal impacts of your work
    Strong candidates may also have experience with:
    High performance, large scale distributed systems
    Kubernetes
    Python
    Machine learning
    Implementing LLM finetuning algorithms, such as RLHF


    Representative projects:
    Profiling our reinforcement learning pipeline to find opportunities for improvement
    Building a system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training pipeline
    Making changes to our finetuning systems so they work on new model architectures
    Building instrumentation to detect and eliminate Python GIL contention in our training code
    Diagnosing why training runs have started slowing down after some number of steps, and fixing it
    Implementing a stable, fast version of a new training algorithm proposed by a researcher

    Deadline to apply:
    None. Applications will be reviewed on a rolling basis.
    #J-18808-Ljbffr


  • Karkidi San Francisco, United States

    Anthropics Finetuning Infrastructure team builds the systems that support our large-scale distributed finetuning runs. As manager of the team, youll support a team of machine learning and distributed systems experts, with the goal of making these systems highly efficient, support ...


  • Django Rest Framework San Francisco, United States

    Cohere is hiring a · Remote Software Engineer Finetuning Infrastructure · Who are we?Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like ...


  • Karkidi San Francisco, United States

    Anthropic's Finetuning Infrastructure team builds the systems that support our large-scale distributed finetuning runs. As manager of the team, you'll support a team of machine learning and distributed systems experts, with the goal of making these systems highly efficient, suppo ...


  • Inflection AI San Francisco, United States

    What We're Building · As Inflection embarks on a new stage of growth, we are focusing on collaborating with commercial partners to adapt and fine-tune our cutting-edge models for their unique business requirements. Our accomplishments in developing, aligning, and deploying state ...

  • Anthropic Limited

    ML Engineer

    2 weeks ago


    Anthropic Limited San Francisco, United States

    As a ML Engineer on our Solutions Architect team, you will drive the adoption of frontier AI by developing bespoke and finetuned LLM solutions for top enterprises. You'll leverage your customer-facing engineering experience and technical skills to architect innovative solutions t ...


  • Inflection AI San Francisco, United States

    What We're Building · As Inflection embarks on a new stage of growth, we are focusing on collaborating with commercial partners to adapt and fine-tune our cutting-edge models for their unique business requirements. Our accomplishments in developing, aligning, and deploying state ...

  • Anthropic

    ML Engineer

    1 week ago


    Anthropic San Francisco, United States

    As a ML Engineer on our Solutions Architect team, you will drive the adoption of frontier AI by developing bespoke and finetuned LLM solutions for top enterprises. You'll leverage your customer-facing engineering experience and technical skills to architect innovative solutions t ...


  • Inflection AI San Francisco, United States

    What We're Building · As Inflection embarks on a new stage of growth, we are focusing on collaborating with commercial partners to adapt and fine-tune our cutting-edge models for their unique business requirements. Our accomplishments in developing, aligning, and deploying state- ...


  • Tata Consultancy Services San Francisco, United States

    JobTitle · Machine Learning Engineer/AI Engineer · RelevantExperience(inyrs) · 10+ · WorkLocation (State, City and Zip) · San Francisco · Technical/FunctionalSkills · Azure, Python, AIML, Kubernetes, Devops · Roles& Responsibilities · Responsibilities: · Hands on experi ...

  • Twelve Labs

    Software Engineer

    2 weeks ago


    Twelve Labs San Francisco, United States

    Who we are · We're a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world's most powerful video understanding inf ...

  • Venchr

    Principal Engineer

    1 day ago


    Venchr San Francisco, United States

    2 days ago · Be among the first 25 applicants · This range is provided by Venchr. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. · Base pay range · $150,000.00/yr - $300,000.00/yr · Direct message the job poster from Venchr ...


  • Anthropic Limited San Francisco, United States

    About Anthropic · Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and busin ...

  • Venchr

    Head of Engineering

    2 weeks ago


    Venchr San Francisco, United States

    Save this job with your existing LinkedIn profile, or create a new one. · Save this job with your existing LinkedIn profile, or create a new one. · Your job seeking activity is only visible to you. · Email · Welcome back · Sign in to save · Head of Engineering · at · Vench ...


  • Addison Group Alameda, United States

    Senior Cloud Security Engineer · Alameda, CA · $200K-$220K/year · Visa Transfer for those that have 1 year or more remaining on an H1B Visa · Job Description: · The Senior Cloud Security Engineer will be a member of the Information Security & Compliance team. This role will ...


  • Brillio San Ramon, United States

    Technical SpecialistPrimary SkillsAzure - Database, Azure - Developer Tools, Azure - Identity, Azure - Integration, Azure - Migration, Azure Administration (IaaS), Azure Analytics Services, Azure Data Catalog, Azure Data Platform (DaaS), Azure Event Hub, Azure Platform Administra ...

  • Brillio

    Technical Specialist

    14 hours ago


    Brillio San Ramon, United States

    About Brillio: Brillio is one of the fastest growing digital technology service providers and a partner of choice for many Fortune 1000 companies seeking to turn disruption into a competitive advantage through innovative digital adoption. Brillio, renowned for its world-class pro ...


  • Addison Group Alameda, United States

    Job Description · Job DescriptionSenior Cloud Security Engineer · Alameda, CA · $200K-$220K/year · Visa Transfer for those that have 1 year or more remaining on an H1B Visa · Job Description: · The Senior Cloud Security Engineer will be a member of the Information Security & Com ...