Jobs
>
San Francisco

    Remote Software Engineer, Data - San Francisco, United States - Imbue

    Default job background
    Full time
    Description

    Summary:

    Imbue believe that high-quality data is the most important part of creating high-performance machine learning systems, regardless of whether they are simple classifiers or state-of-the-art reasoning agents. Unlike many other organizations, they view this work and this role as one of the most important at the company.

    In this role, you will work on the most important part of Imbue's system: the software infrastructure for collecting, preprocessing, generating, analyzing, and distilling the wide variety of data sources that go into both their primary pretraining data corpus, as well as the datasets for all of the other ancillary and secondary models and system. You will make a meaningful, measurable impact on the performance of Imbue's systems, and experience the joy of spending time to make high-quality software that makes high-quality data.

    Example projects:

  • Incorporate new sources of high-quality text data into our existing data pipelines
  • Develop models for accurately classifying and extracting meaningful text from raw HTML
  • Create a high-quality OCR pipeline for pulling pretraining text from images and scans
  • Collect a ludicrous amount of multimodal data(ex: transcripts for thousands of years of video)
  • Design unique data generation pipelines that leverage existing data(ex: convert code from one language to another)
  • Integrate multiple annotation service providers into a sensible interface for researchers
    You are:
  • Detail oriented. Data mistakes are easy to make and hard to catch.
  • Passionate about data. You should be happy to look at and deeply engage with the raw data.
  • An excellent software engineer. We care about engineering best practices.
  • Familiar with Python.
    Benefits:
  • Work on the most important part of Imbue's system
  • Work at a place that deeply cares about data quality
  • Work directly on creating software with human-like intelligence
  • Very generous compensation
  • Flexible working hours
  • Work remotely
  • Time and budget for learning and self-improvement
    About the company:

    Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. Imbue trains its own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, they can gain insights into improving both the capabilities of the underlying models and the interaction design for agents.

    Imbue aims to rekindle the dream of the *personal* computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.

    WorksHub are not directly affiliated/are not a direct part of Imbue


  • WPRO TALENTS

    Data Engineer

    3 days ago


    WPRO TALENTS San Francisco, United States

    This is a remote position. · Our client is at the forefront of web3 innovation. Our mission is to establish The Graph as the unbreakable foundation of open data. Our pioneering sub graphs set the industry standard and solidify The Graph as the premier solution for organizing and ...

  • WPRO TALENTS

    Data Engineer

    1 day ago


    WPRO TALENTS San Francisco, United States

    This is a remote position. · Our client is at the forefront of web3 innovation. Our mission is to establish The Graph as the unbreakable foundation of open data. Our pioneering sub graphs set the industry standard and solidify The Graph as the premier solution for organizing and ...

  • Nexus Solutions

    Data Engineer

    1 week ago


    Nexus Solutions Emeryville, United States

    Soda is teaming up with a leading E-commerce platform based in Germany that specializes in creating lottery platforms. With 20 years of experience, they have continuously grown and innovated across Europe. In 2022, they raised an impressive €286 million for social projects and ch ...

  • Infinity Ventures

    Data Engineer

    6 days ago


    Infinity Ventures Oakland, United States

    Oakland Infomotion GmbH is looking for a Data Engineer (m/w/d) to join our team. We are passionate about extracting the maximum value from data and are looking for someone who shares our enthusiasm and has a strong understanding of digital technology. · As a Data Engineer, you wi ...

  • BayOne Solutions

    Data Engineer

    5 days ago


    BayOne Solutions San Francisco, United States Full time

    Senior Data Engineer · Location: (100% Remote) · Duration: Full time · The Opportunity: · Technology · Our technology team works fast and smart. With San Francisco as our home, we take bringing new tech to market seriously, developing the latest in mobile technologies, scalable a ...

  • Triune Infomatics Inc

    Data Engineer

    3 days ago


    Triune Infomatics Inc San Francisco, United States

    Role: Data Engineer · Location: Hybrid in Pleasanton, CA · Duration: 6 months · Areas of Responsibility Include: · • Build test and implement Data Engineering - ETL, Data Integrations, Data Governance, Data Quality. · • Build reusable Data Engineering assets and frameworks. ...

  • Bayone

    Data Engineer

    4 days ago


    Bayone San Francisco, United States

    Your role at Sephora: · As a Sr. Software Engineer you will design and implement innovative analytical solutions and work alongside the product engineering team, evaluating new features and architecture. Reporting to the Engineering Manager, Data Platform, you will work closely ...

  • Tekfortune Inc

    Data Engineer

    5 days ago


    Tekfortune Inc San Francisco, United States

    Tekfortune is a fast-growing consulting firm specialized in permanent, contract & project-based staffing services for world's leading organizations in a broad range of industries. In this quickly changing economic landscape, virtual recruiting and remote work are critical for the ...

  • VRChat Inc

    Data Engineer

    3 days ago


    VRChat Inc San Francisco, United States

    Join the VRChat team · VRChat offers a first-of-its kind, game-changing platform that provides an endless collection of social immersive experiences and gives the power of creation to its robust community. With over 250,000 worlds and growing, VRChat's vision is to allow users t ...

  • FocusKPI Inc.

    Data Engineer

    20 hours ago


    FocusKPI Inc. San Francisco, United States

    Job Description · Job DescriptionFocusKPI is looking for a Data Engineer to join one of our client's team and to support them. As a member of the Data Engineering, you own the ETL/ELT pipelines and data warehouse that are used to analyze trends and activity in our client's produc ...

  • Legalist

    Data Engineer

    4 hours ago


    Legalist San Francisco, United States

    Job Description · Job DescriptionIntro description:Legalist is an institutional alternative asset management firm. Founded in 2016 and incubated at Y Combinator, the firm uses data-driven technology to invest in credit assets at scale. We are always looking for talented people to ...

  • ConsultNet

    Data Engineer

    4 days ago


    ConsultNet San Francisco, United States

    Data Engineer · Direct Hire · 100% Remote · Salary: $130, ,000 a year + company equity · Job Description: · Seeking Data Software Engineer who is well versed in Python and SQL and who loves to code. No specific industry experience required but Healthcare background is a plus. Mus ...

  • TEKsystems

    Data Engineer

    1 week ago


    TEKsystems San Francisco, United States

    : · The data engineer will collaborate closely with the growth and data foundation teams to define and create foundational data tables, migrate test tables while adhering to EDW standards, and build self-serve ETL frameworks capable of handling streaming and batch processing. The ...

  • ConsultNet

    Data Engineer

    3 days ago


    ConsultNet San Francisco, United States

    Data Engineer · Direct Hire · 100% Remote · Salary: $130, ,000 a year + company equity · Job Description: · Seeking Data Software Engineer who is well versed in Python and SQL and who loves to code. No specific industry experience required but Healthcare background is a plus ...

  • Asana

    Data Engineer

    3 days ago


    Asana San Francisco, United States

    As part of our dynamic Data Engineering team, you will play a pivotal role in aiding company-wide, data-informed decision making. Product and Business teams leverage our data assets to optimize user adoption, growth, and experience. We partner with other Data Infrastructure teams ...

  • Saxon Global

    Data Engineer

    1 week ago


    Saxon Global San Francisco, United States

    Client: T he County of Sacramento · Position: Data Engineer - ETL, Spark, Airflow Specialist · Location: Sacramento, Ca ONLY CANDIDATES LOCAL TO SACRAMENTO WILL BE CONSIDERED. 100% ONSITE WORK · Rate:$65/hr. on C2C (Some Flex) · Visa: Open · Duration: 12+ Months · Looking f ...

  • Speak

    Data Engineer

    4 hours ago


    Speak San Francisco, United States

    Job Description · Job DescriptionAbout usOur mission is to become the de facto way people learn foreign languages. We begin by teaching the next billion people English and Spanish. · English is the global language of business, culture, and communication, and over 1.5 billion peop ...

  • Rigil Corporation

    Data Engineer

    5 days ago


    Rigil Corporation San Francisco, United States

    Job Description · Job DescriptionRole: Data Engineer · About Rigil: · Rigil is an award-winning, woman-owned, small business that specializes in technology consulting, strategy consulting and product development. We value teamwork and strive to build strong leaders. · Location: S ...

  • Speak LLC

    Data Engineer

    3 days ago


    Speak LLC San Francisco, United States

    About us · Our mission is to become the de facto way people learn foreign languages. We begin by teaching the next billion people English and Spanish. · English is the global language of business, culture, and communication, and over 1.5 billion people around the world are activ ...

  • Hazel Health, Inc.

    Data Engineer

    6 days ago


    Hazel Health, Inc. San Francisco, United States

    Hazel Health, the national leader in school-based telehealth, was founded in 2015 to address systemic inequities in healthcare access, and ensure all children can get the quality care they need and deserve. We leverage digital health technology to provide on-demand physical and m ...