- Lead the design and implementation of advanced NLP techniques and methodologies to extract intricate scientific concepts and reasoning from vast textual sources.
- Lead the design and implementation of advanced NLP techniques and methodologies to extract chemical information including SMILES notations, properties, and interleaved text for multimodal language model training and chemical property predictions.
- Develop and refine language-based data labeling pipelines tailored for scientific discovery, ensuring high-quality annotated datasets for training large language models and AI agents.
- Collaborate closely with cross-functional teams to identify key research areas and define labeling strategies to capture nuanced scientific insights effectively.
- Spearhead the development of innovative approaches for data annotation, incorporating state-of-the-art NLP algorithms to enhance accuracy and efficiency.
- Provide expert guidance on data annotation best practices, ensuring consistency and quality across labeled datasets.
- Conduct thorough analyses to evaluate the effectiveness of labeling pipelines and make continuous improvements to optimize performance.
- Stay abreast of the latest advancements in NLP and data annotation techniques, integrating emerging methodologies to enhance our data labeling capabilities.
- Experience with AI agent's studies, using knowledge-based Retrieval-Augmented Generation (RAG) to facilitate the accuracy of language generation.
- Experience with cloud computing platforms and services (e.g., AWS, Azure, Google Cloud) for scalable data processing and storage.
- Knowledge of data visualization techniques and tools for exploring and presenting scientific insights.
- Advanced degree (master's or PhD preferred) in computer science, data science, or a related field.
- Extensive hands-on experience in natural language processing, with a strong emphasis on designing and implementing language-based data labeling pipelines.
- Proven track record of leveraging NLP techniques to extract complex scientific concepts and reasoning from textual sources.
- Familiarity with deep learning models and architectures for NLP tasks, such as transformer-based models (e.g., BERT, GPT).
- Proficiency with Git and Linux based systems and proficiency in programming languages such as Python, R, or Java, along with expertise in relevant libraries and frameworks (e.g., PyTorch, NLTK, TensorFlow).
- Exceptional problem-solving skills with meticulous attention to detail, coupled with a passion for advancing scientific discovery through data science.
- Excellent communication and collaboration skills, with the ability to effectively convey complex technical concepts to diverse stakeholders.
-
Data Scientist
2 weeks ago
Center Remote, United States Full timeCenter is seeking to expand our Data team with a Data Scientist who is passionate about advancing data science initiatives, conducting experiments, and actively engaging in the development of data pipelines and frameworks. You will work with our Revenue, Product, Engineering, and ...
-
Data Scientist
1 week ago
Sunnova Energy Remote, United States Full timeBrief Description of Sunnova · Sunnova (NYSE: NOVA) is an industry-leading adaptive energy services company focused on making clean energy more accessible, reliable, and affordable for homeowners and businesses. · At Sunnova, we embrace diverse perspectives, vibrant creativity an ...
-
Data Scientist
3 weeks ago
MAJORITY Remote, United States Full time· Data Scientist - Risk Analytics · Miami, FL · MAJORITY is a groundbreaking mobile banking service built for migrants, by migrants. Global migration is a 21st-century reality, whether people are following their dreams, love, or new experiences. Our purpose at MAJORITY is to emp ...
-
Data Scientist
2 weeks ago
Raptive Remote, United States Full timeRaptive is seeking a Data Scientist to join our RevOps Data Science team. Your mission is to leverage our vast data set to surface meaningful insights that drive decision-making across the business. Given the broad scope of this effort, the ideal candidate will be driven by under ...
-
Data Scientist
9 hours ago
Retail Zipline Remote, United States Full timeZipline is looking for a skilled Data Scientist to leverage data-driven insights and advanced analytics to drive strategic decision-making and deliver actionable solutions for our retail SaaS platform. As a key member of our Data team, You'll collaborate with a diverse team of in ...
-
Data Scientist
1 week ago
Sustainment Remote, United States Full time· Company Overview: Sustainment is a company whose products enable US-based manufacturers to confidently build and manage vetted domestic supplier networks, with modern tools to find, communicate, collaborate with, and manage qualified manufacturing suppliers. Our vision is to r ...
-
Data Scientist
1 week ago
StackAdapt Remote, United States Full timeStackAdapt is a self-serve advertising platform that specializes in multi-channel solutions including native, display, video, connected TV, audio, in-game, and digital out-of-home ads. We empower hundreds of digitally-focused companies to deliver outcomes and exceptional campaign ...
-
Data Scientist
3 weeks ago
Two Six Technologies Remote, United States Full time· At Two Six Technologies, we build, deploy, and implement innovative products that solve the world's most complex challenges today. Through unrivaled collaboration and unwavering trust, we push the boundaries of what's possible to empower our team and support our customers in b ...
-
Data Scientist
2 weeks ago
Allstate Insurance Company Remote, United StatesThe world isn't standing still, and neither is Allstate. We're moving quickly, looking across our businesses and brands and taking bold steps to better serve customers' evolving needs. That's why now is an exciting time to join our team. You'll have opportunities to take risks, c ...
-
Senior Data Scientist
1 week ago
Skill Demand Remote, United StatesJob Description · Job Description We are seeking a highly motivated and experienced Data Architect to join our team. You will be responsible for designing, developing, and implementing the data infrastructure to support our Informatics, Medicaid, and Medicare initiatives. · Respo ...
-
Staff Data Scientist
3 weeks ago
Clover Health Remote, United States Full timeStaff Data Scientist · Clover is reinventing health insurance by working to keep people healthy. · Clover's Data Science team is charged with leveraging our data—our most important asset—to generate value for our members. From understanding how the member experience impacts clin ...
-
Senior Data Scientist
2 weeks ago
Yieldmo Remote, United States Full timeYieldmo, Inc. has an opening for a Senior Data Scientist in New York, NY. · The position duties are as follows: Responsible for developing optimization models to enhance the company's marketplace and using statistical analysis, machine learning, Generative AI, and optimization te ...
-
Cyber Data Scientist
2 weeks ago
Redjack Remote, United States Full timeWe are looking for people who share our core values of audacity, excellence, co-creation, and servanthood to join our team. We are a tight-knit group of engineers, analysts, and business professionals who have organically built a company that monitors more than 8% of the Internet ...
-
Sr. Data Scientist
6 days ago
Pluralsight Remote, United States Full timeJob Description: · Summary: · As a Senior Data Scientist, you will play a pivotal role in our Data Science and Machine Learning (DSML) team, leading technical delivery of strategic projects from inception to implementation. You will apply your deep expertise in data science, mach ...
-
Senior Data Scientist
1 week ago
Bonterra Remote, United States Full timeBonterra exists to propel every doer of good to their peak impact. We measure that impact against our vision to increase the giving rate as a percentage of GDP from 2% to 3% by 2033. We know that this goal is lofty, but we are confident that the right technology and expertise wil ...
-
Senior Data Scientist
3 weeks ago
Nomi Health Remote, United States Full timeWe are seeking a Senior Data Scientist to join our team. Your primary focus will involve researching and developing rapid solutions to drive action around improving quality, reducing cost, and delivering next best actions to consumers. We expect you to approach problem-solving wi ...
-
Senior Data Scientist
2 days ago
Dotdash Meredith Remote, United States Full timeAbout Your Role: · Dotdash Meredith is the largest premium publisher in the world. Every day tens of millions of people trust our diverse brands to help them learn, make decisions, and find inspiration. We fund our digital content with digital advertising, but we believe the onl ...
-
Senior Data Scientist
2 weeks ago
UrbanFootprint Remote, United States Full time· Taking the next step in your career with UrbanFootprint will set you on the path to revolutionizing how organizations make decisions. · UrbanFootprint is the world's first Urban Intelligence Platform – trusted by some of the largest energy utilities, financial institutions, g ...
-
Staff Data Scientist
4 weeks ago
Dropbox Remote, United States Full timeRole Description · The Product Instrumentation team is responsible for event tracking implementation and data collection across Dropbox web, desktop, and mobile experiences, and making this data available for self-serve reporting and actionable insights. This data is used by Prod ...
-
Lead Data Scientist
2 weeks ago
Protecht Remote, United States Full timeProtecht is reinventing refunds, aiming to make every experience refundable. Our core strength lies in our proprietary Software-as-a-Service embedded refund protection platform, which delivers massive distribution and a best-in-class digital purchase experience to insurance carri ...
Senior Data Scientist - Remote, United States - SES AI Corp
4 days ago
Description
Senior Data Scientist, Natural Language Processing and Data Annotation Expert
About us
At SES AI, we are at the forefront of revolutionizing lithium-metal battery creation with our groundbreaking approach that integrates cutting-edge machine learning techniques into our research and development processes. Our mission is to lead the next wave of scientific discovery in material science, powered by advanced AI technologies with a dedication to AI for Science.
To learn more about SES, please visit:
Position Scope
We are seeking a seasoned (senior) Data Scientist specializing in Natural Language Processing (NLP) and Data Annotation to spearhead our innovative projects. The ideal candidate will possess exceptional expertise in NLP,. utilizing state-of-the-art multimodal language models to conduct the retrieval and extraction of intricate chemical information within the realms of material and battery science. Moreover, we are looking for an individual with a profound understanding of designing language-based data labeling pipelines to extract scientific chains of thought for advanced reasoning and discovery. This pivotal role will involve leading the creation of labeled datasets crucial for training our cutting-edge language models and AI agents.
This role will be remote.
Responsibilities
Preferred Qualifications
Qualifications