- Incorporate new sources of high-quality text data into our existing data pipelines
- Develop models for accurately classifying and extracting meaningful text from raw HTML
- Create a high-quality OCR pipeline for pulling pretraining text from images and scans
- Collect a ludicrous amount of multimodal data(ex: transcripts for thousands of years of video)
- Design unique data generation pipelines that leverage existing data(ex: convert code from one language to another)
- Integrate multiple annotation service providers into a sensible interface for researchers
You are: - Detail oriented. Data mistakes are easy to make and hard to catch.
- Passionate about data. You should be happy to look at and deeply engage with the raw data.
- An excellent software engineer. We care about engineering best practices.
- Familiar with Python.
Benefits: - Work on the most important part of Imbue's system
- Work at a place that deeply cares about data quality
- Work directly on creating software with human-like intelligence
- Very generous compensation
- Flexible working hours
- Work remotely
- Time and budget for learning and self-improvement
About the company:Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. Imbue trains its own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, they can gain insights into improving both the capabilities of the underlying models and the interaction design for agents.
Imbue aims to rekindle the dream of the *personal* computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.
WorksHub are not directly affiliated/are not a direct part of Imbue
-
Data Engineer
3 days ago
WPRO TALENTS San Francisco, United StatesThis is a remote position. · Our client is at the forefront of web3 innovation. Our mission is to establish The Graph as the unbreakable foundation of open data. Our pioneering sub graphs set the industry standard and solidify The Graph as the premier solution for organizing and ...
-
Data Engineer
1 day ago
WPRO TALENTS San Francisco, United StatesThis is a remote position. · Our client is at the forefront of web3 innovation. Our mission is to establish The Graph as the unbreakable foundation of open data. Our pioneering sub graphs set the industry standard and solidify The Graph as the premier solution for organizing and ...
-
Data Engineer
1 week ago
Nexus Solutions Emeryville, United StatesSoda is teaming up with a leading E-commerce platform based in Germany that specializes in creating lottery platforms. With 20 years of experience, they have continuously grown and innovated across Europe. In 2022, they raised an impressive €286 million for social projects and ch ...
-
Data Engineer
6 days ago
Infinity Ventures Oakland, United StatesOakland Infomotion GmbH is looking for a Data Engineer (m/w/d) to join our team. We are passionate about extracting the maximum value from data and are looking for someone who shares our enthusiasm and has a strong understanding of digital technology. · As a Data Engineer, you wi ...
-
Data Engineer
5 days ago
BayOne Solutions San Francisco, United States Full timeSenior Data Engineer · Location: (100% Remote) · Duration: Full time · The Opportunity: · Technology · Our technology team works fast and smart. With San Francisco as our home, we take bringing new tech to market seriously, developing the latest in mobile technologies, scalable a ...
-
Data Engineer
3 days ago
Triune Infomatics Inc San Francisco, United StatesRole: Data Engineer · Location: Hybrid in Pleasanton, CA · Duration: 6 months · Areas of Responsibility Include: · • Build test and implement Data Engineering - ETL, Data Integrations, Data Governance, Data Quality. · • Build reusable Data Engineering assets and frameworks. ...
-
Data Engineer
4 days ago
Bayone San Francisco, United StatesYour role at Sephora: · As a Sr. Software Engineer you will design and implement innovative analytical solutions and work alongside the product engineering team, evaluating new features and architecture. Reporting to the Engineering Manager, Data Platform, you will work closely ...
-
Data Engineer
5 days ago
Tekfortune Inc San Francisco, United StatesTekfortune is a fast-growing consulting firm specialized in permanent, contract & project-based staffing services for world's leading organizations in a broad range of industries. In this quickly changing economic landscape, virtual recruiting and remote work are critical for the ...
-
Data Engineer
3 days ago
VRChat Inc San Francisco, United StatesJoin the VRChat team · VRChat offers a first-of-its kind, game-changing platform that provides an endless collection of social immersive experiences and gives the power of creation to its robust community. With over 250,000 worlds and growing, VRChat's vision is to allow users t ...
-
Data Engineer
20 hours ago
FocusKPI Inc. San Francisco, United StatesJob Description · Job DescriptionFocusKPI is looking for a Data Engineer to join one of our client's team and to support them. As a member of the Data Engineering, you own the ETL/ELT pipelines and data warehouse that are used to analyze trends and activity in our client's produc ...
-
Data Engineer
4 hours ago
Legalist San Francisco, United StatesJob Description · Job DescriptionIntro description:Legalist is an institutional alternative asset management firm. Founded in 2016 and incubated at Y Combinator, the firm uses data-driven technology to invest in credit assets at scale. We are always looking for talented people to ...
-
Data Engineer
4 days ago
ConsultNet San Francisco, United StatesData Engineer · Direct Hire · 100% Remote · Salary: $130, ,000 a year + company equity · Job Description: · Seeking Data Software Engineer who is well versed in Python and SQL and who loves to code. No specific industry experience required but Healthcare background is a plus. Mus ...
-
Data Engineer
1 week ago
TEKsystems San Francisco, United States: · The data engineer will collaborate closely with the growth and data foundation teams to define and create foundational data tables, migrate test tables while adhering to EDW standards, and build self-serve ETL frameworks capable of handling streaming and batch processing. The ...
-
Data Engineer
3 days ago
ConsultNet San Francisco, United StatesData Engineer · Direct Hire · 100% Remote · Salary: $130, ,000 a year + company equity · Job Description: · Seeking Data Software Engineer who is well versed in Python and SQL and who loves to code. No specific industry experience required but Healthcare background is a plus ...
-
Data Engineer
3 days ago
Asana San Francisco, United StatesAs part of our dynamic Data Engineering team, you will play a pivotal role in aiding company-wide, data-informed decision making. Product and Business teams leverage our data assets to optimize user adoption, growth, and experience. We partner with other Data Infrastructure teams ...
-
Data Engineer
1 week ago
Saxon Global San Francisco, United StatesClient: T he County of Sacramento · Position: Data Engineer - ETL, Spark, Airflow Specialist · Location: Sacramento, Ca ONLY CANDIDATES LOCAL TO SACRAMENTO WILL BE CONSIDERED. 100% ONSITE WORK · Rate:$65/hr. on C2C (Some Flex) · Visa: Open · Duration: 12+ Months · Looking f ...
-
Data Engineer
4 hours ago
Speak San Francisco, United StatesJob Description · Job DescriptionAbout usOur mission is to become the de facto way people learn foreign languages. We begin by teaching the next billion people English and Spanish. · English is the global language of business, culture, and communication, and over 1.5 billion peop ...
-
Data Engineer
5 days ago
Rigil Corporation San Francisco, United StatesJob Description · Job DescriptionRole: Data Engineer · About Rigil: · Rigil is an award-winning, woman-owned, small business that specializes in technology consulting, strategy consulting and product development. We value teamwork and strive to build strong leaders. · Location: S ...
-
Data Engineer
3 days ago
Speak LLC San Francisco, United StatesAbout us · Our mission is to become the de facto way people learn foreign languages. We begin by teaching the next billion people English and Spanish. · English is the global language of business, culture, and communication, and over 1.5 billion people around the world are activ ...
-
Data Engineer
6 days ago
Hazel Health, Inc. San Francisco, United StatesHazel Health, the national leader in school-based telehealth, was founded in 2015 to address systemic inequities in healthcare access, and ensure all children can get the quality care they need and deserve. We leverage digital health technology to provide on-demand physical and m ...
Remote Software Engineer, Data - San Francisco, United States - Imbue
Description
Summary:
Imbue believe that high-quality data is the most important part of creating high-performance machine learning systems, regardless of whether they are simple classifiers or state-of-the-art reasoning agents. Unlike many other organizations, they view this work and this role as one of the most important at the company.
In this role, you will work on the most important part of Imbue's system: the software infrastructure for collecting, preprocessing, generating, analyzing, and distilling the wide variety of data sources that go into both their primary pretraining data corpus, as well as the datasets for all of the other ancillary and secondary models and system. You will make a meaningful, measurable impact on the performance of Imbue's systems, and experience the joy of spending time to make high-quality software that makes high-quality data.
Example projects: