Jobs
>
San Francisco

    Engineering Manager, AI Inference Systems - San Francisco, United States - OpenAI

    OpenAI
    OpenAI San Francisco, United States

    Found in: Lensa US 4 C2 - 3 days ago

    Default job background
    Description
    About the Team

    The Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL
    •E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate inference infrastructure at scale. There's a lot more on the immediate horizon.

    We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

    We serve end-users directly through ChatGPT, and serve developers through our APIs, which power product features that were never before possible.

    About the Role

    Model inference at OpenAI is powered through a single service we call our "Engine". The Engine wraps the PyTorch transformers which are GPT-4 and ChatGPT. We are looking for an engineering manager to help lead some of the critical work for this service and grow the team.

    In this role, you will:
    • Own substantial portions of our inference stack
    • Ensure we have the ability to run GPT-4, ChatGPT, and future models at increasingly high scale with increasing efficiency
    • Hire world-class AI systems engineers in one of the most competitive hiring markets
    • Coordinate the inference needs of OpenAI's teams and products
    • Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think
    You might thrive in this role if you:
    • Have 3+ years of experience in engineering management and 7+ years as an IC working with high scale distributed systems and ML systems.
    • Have experience with ML systems, particularly high scale distributed inference for modern LLMs.
    • Have experience with highly available, reliable, production grade systems at scale
    • Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented
    • Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams
    • Have experience closing extremely competitive candidates for your team, and the ability to craft and convey compelling visions of the future
    • Have a voracious and intrinsic desire to learn and fill in missing skills-and an equally strong talent for sharing learnings clearly and concisely with others
    • Are comfortable with ambiguity and rapidly changing conditions. You view changes as an opportunity to add structure and order when necessary
    As technical context: at the heart of our infrastructure is a large-scale deployment of GPU nodes running in dozens of Kubernetes clusters across regions. Some core technologies we build with include Python, PyTorch, CUDA, Triton, Redis, Infiniband, NCCL, NVLink

    This role is exclusively based in our San Francisco HQ. We offer relocation assistance to new employees.

    About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

    For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

  • Money Fit by DRS

    Engineering Manager, AI Inference Systems

    Found in: Jooble US O C2 - 3 days ago


    Money Fit by DRS San Francisco, CA, United States

    Location · San Francisco, CA · Type · Full time · Department · Applied AI · Compensation · $350K – $500K · In addition to the salary range listed above, total compensation also includes generous equity and benefits. · Medical, dental, and vision insurance for you and you ...

  • Genai Works

    Engineering Manager, AI Inference Systems

    Found in: Jooble US O C2 - 20 hours ago


    Genai Works San Francisco, CA, United States

    About the Team · The Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL · E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate inference infrastructure at scale. There's a lot more on the immediate horizon ...

  • Together AI

    Systems Research Engineer, Machine Learning Systems

    Found in: Lensa US 4 C2 - 4 days ago


    Together AI San Francisco, United States

    Role · As a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale d ...

  • Together AI

    Systems Research Engineer, Machine Learning Systems

    Found in: Lensa US 4 C2 - 1 day ago


    Together AI San Francisco, United States

    Role · As a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale d ...

  • Genai Works

    Platform ML Engineering Manager, Inference

    Found in: Jooble US O C2 - 1 day ago


    Genai Works San Francisco, CA, United States

    About the Team · The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. · Our ...

  • Together AI

    Systems Research Engineer, Machine Learning Systems

    Found in: Lensa US 4 C2 - 2 days ago


    Together AI San Francisco, United States

    Role · As a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale di ...

  • Genai Works

    Principal - Platform Engineering

    Found in: Jooble US O C2 - 1 day ago


    Genai Works San Francisco, CA, United States

    The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. · Our priorities are to ...

  • Money Fit by DRS

    Platform ML Engineering Manager, Inference

    Found in: Jooble US O C2 - 3 days ago


    Money Fit by DRS San Francisco, CA, United States

    Location · San Francisco, CA · Type · Full time · Department · Research · Compensation · $440K – $530K · In addition to the salary range listed above, total compensation also includes generous equity and benefits. · Medical, dental, and vision insurance for you and your ...

  • Together AI

    Systems Research Engineer, Machine Learning Systems

    Found in: Jooble US O C2 - 1 day ago


    Together AI San Francisco, CA, United States

    Role · As a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale d ...

  • Pinterest

    Staff Software Engineer, ML Serving Platform

    Found in: One Red Cent US C2 - 2 days ago


    Pinterest San Francisco, United States

    About Pinterest: · Millions of people across the world come to Pinterest to find new ideas every day. It's where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they l ...

  • HireIO, Inc.

    Tech lead, senior software engineer,AR/VR

    Found in: Appcast Linkedin GBL C2 - 3 days ago


    HireIO, Inc. San Francisco, United States

    Language:bilingual · Requirements · B.Sc/M.Sc/PhD in Computer Science or in a related field · 5+ years of industrial C/C++ experience, solid CS fundamentals (algorithms, data structures, OOD/DOD, TDD) and problem-solving skills · Solid algorithm system design and implementation e ...

  • Ritual

    Remote - Machine Learning Engineer - Remote

    Found in: Lensa US 4 C2 - 1 day ago


    Ritual San Francisco, CA, United States

    About Ritual · Ritual is the network for open AI infrastructure. We build groundbreaking, new architecture on a crowdsourced governance layer aimed to handle safety, funding, alignment, and model maintenance. · Join us on the journey to decentralize AI · About the role · We ar ...

  • Acceler8 Talent

    Principal Machine Learning Infrastructure Engineer

    Found in: Appcast Linkedin GBL C2 - 5 hours ago


    Acceler8 Talent San Francisco, United States

    Introduction: · Join a pioneering team at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most e ...

  • Surya Systems

    Data Scientist

    Found in: Lensa US 4 C2 - 5 days ago


    Surya Systems San Francisco, United States

    Title: Data Scientist · Location : SFO, CA (Remote) · Duration: Long Term contract · Remote · Make sure to apply with all the requested information, as laid out in the job overview below. · About you · • You are passionate about solving complex problems within the Trust & ...

  • Tubi TV

    Senior Software Engineer, Machine Learning Infrastructure

    Found in: Lensa US 4 C2 - 4 days ago


    Tubi TV San Francisco, United States

    About the Role: · This Software Engineering team works closely with Machine Learning and Product teams to assist in building world-class machine-learning operating systems for the Tubi platform. The efforts of this team help take inference systems to the next level of low latenc ...

  • Anthropic Limited

    Performance Engineer

    Found in: Lensa US 4 C2 - 7 hours ago


    Anthropic Limited San Francisco, United States

    Running machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed ...

  • HireIO, Inc.

    Software/ML Engineer-NLP in San Francisco Bay Area/Seattle

    Found in: Appcast Linkedin GBL C2 - 3 days ago


    HireIO, Inc. San Francisco, United States

    Requirements · Improve the working efficiency of e-commerce merchants, influencers, and account managers to boost the growth of sellers' products, customers, and revenue · Participate in the optimization of sellers' operations, such as marketing insights, assortment planning, cus ...

  • OpenAI

    Research Engineer, AI Security

    Found in: Lensa US 4 C2 - 3 days ago


    OpenAI San Francisco, United States

    About the Team · The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to benefit society. It is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a cultu ...

  • Together AI

    Senior Developer Advocate

    Found in: Lensa US 4 C2 - 5 days ago


    Together AI San Francisco, United States

    Role: · As a member of the Developer Relations team at Together AI, you will improve the overall developer experience of our platform and deeply impact our product. You'll do this by coding example applications, writing technical guides, collecting feedback from our customers and ...

  • Syntegra

    Senior Software Engineer

    Found in: Lensa US 4 C2 - 3 days ago


    Syntegra San Francisco, United States

    San Francisco, CA based or remote (US only) · Job Overview · We are looking for a senior Software Engineer / Architect to help us scale our synthetic data engine and platform. The ideal candidate will have strong Python coding skills, expertise in cloud technologies (AWS, Azure, ...