Senior Data Scientist - Nashville, United States - XOi Technologies

    XOi Technologies
    XOi Technologies Nashville, United States

    1 month ago

    Default job background
    Description

    Applicants must be authorized to work for ANY employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time.
    Senior Data Scientist


    XOi Technologies is powering a world in which people and equipment are connected, decisions are transparent, and quality outcomes are predictable.

    XOi is the market leader in "curb-to-curb" technician enablement solutions, empowering techs to safely capture—and access—critical job site information, launch remote support, and provide customers photos and videos of recommended and completed work.

    Field services technicians across the nation utilize our suite of products to increase efficiency and performance on the job, streamline communication, build skills and gain data insights to be unlocked and shared with the entire ecosystem.


    We are looking for a Senior Data Scientist who is well versed across traditional Machine Learning (ML), Deep Learning (DL), NLP, OCR, and has experience leveraging Large Language Models (LLM) to drive powerful insights and actionable recommendations that empower our customers and users.

    You will work with unstructured and structured data from millions of jobs completed by our users, including documents, images, and videos.

    You will own complex data problems from concept to deployment. You will have an opportunity to leverage your expertise as a Data Scientist to impact critical business solutions.


    Our ideal candidate:

    Customer first:

    You love tackling complex and challenging data questions and delivering innovative solutions to real-world data problems


    Able to effectively work cross functionally (Product, Engineering, etc) in scoping data science projects and problems and adhering to defined deadlines


    Full-cycle AI Solution Development:

    You have built end-to-end solutions from ideation to deployment, including testing and ongoing training of models.


    Proficient with OCR, Computer Vision, NLP, Search, Recommendation, and Generative machine learning systems.


    Solid understanding of Large Language models (LLMs) and foundational models, with practical experience in related concepts like RAG, fine tuning and prompt engineering


    Comfortable with DevOps/ MLOps for model release management and deployment


    Technical:

    Hands-on developing data science solutions using open-source tools, languages and cloud computing platforms, preferably AWS.


    Proficient with the following (or related): Python, scikit-learn, numpy, pandas, spaCy


    You understand the algorithms underlying DNNs, GANs, GPTs, CNN, RNN, NLP


    Proven track record implementing ML experiments using both structured and unstructured data


    Committed to creating strong evaluations and ensuring quality is forefront to model development and training


    Able to communicate complex technical concepts to non-technical audiences


    Tech Stack: Git, Python and Python frameworks, Tensorflow, PyTorch, AWS (EC2, RDS, DynamoDB, S3, Lambda, and Sagemaker), Snowflake, and Docker

    Optimized Model Performance:

    Analysis and Interpretation:

    Experienced establishing project success metrics and analyzing and interpreting model performance


    Experienced in predictive modeling


    Innovative:

    You continually research and evaluate emerging technologies, published state-of-the-art methods, and applications and seek out opportunities to apply them.


    You aren't afraid to experiment and have a track record of implementing ML experiments using both structured and unstructured data


    Data Analysis and Modeling:

    Leveraging analytic modeling, statistical analysis, programming and/ or another appropriate scientific method, develop and implement qualitative and quantitative methods for characterizing, exploring, and assessing large datasets in various states of organization, cleanliness, and structure that consider unique features and limitations that may exist in the data


    You have outstanding data skills to extract, connect, and analyze data from a variety of structured and unstructured sources. You know great data science begins with great understanding of the data.


    Qualifications:

    3+ years of hands-on experience in Data Science and Machine Learning


    Master's degree in Computer Science, Statistics, Mathematics or related fields


    Strong programming skills using Python and Python-based frameworks (eg, Tensorflow, PyTorch)


    Strong understanding of commonly used Machine Learning and Deep Learning algorithms, processes, tools and platforms (eg, HuggingFace)


    Proficient working with large structured and unstructured data sets incorporating NLP and OCR to solve problems. Computer Vision is a bonus.


    Practical experience working with Large Language Models and using GPT or other Transformer models for NL Search and information retrieval


    Proficient using AWS and appropriate services, including but not limited to Lambda, EC2, S3, DynamoDB etc


    Proven experience working with complex data structures to drive query efficiency for search and analysis while executing ML tasks


    Experienced leveraging modern tools for building predictive models from ground up using ML pipelines at each stage, including data processing, training, inference, deployment and evaluation


    Thrive working independently or as part of a team