Data Engineer - Hartford, United States - Caremark Ltd

    Caremark Ltd
    Caremark Ltd Hartford, United States

    1 month ago

    Default job background
    Description
    Caremark LLC, a CVS Health company, is hiring for the following role in Hartford, CT: Data Engineer to design, build and manage large scale data structures, pipelines and efficient Extract/Load/Transform (ETL) workflows to support business applications


    Duties include:

    develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and addresses reporting needs; write ETL (Extract/Transform/Load) processes, design database systems, and develop tools for real-time and offline analytic processing; collaborate with Data Science team to transform data and integrate algorithms and models into automated processes; leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines; utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems; build data marts and data models to support Data Science and other internal customers; integrate data from a variety of sources and ensure adherence to data quality and accessibility standards; analyze current information technology environments to identify and assess critical capabilities and recommend solutions; and experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case.

    Telecommuting Available. Multiple Openings.

    Requirements:

    Masters degree (or foreign equivalent) in Computer Science, Data Science, Information Systems, Mathematics, Analytics, or a related field and completion of a university-level course, research project, internship, or thesis in each of the following: Jenkins and GIT; Java, R, and Python; Agile methodologies; Hive; MySQL and NoSQL; Analyzing large data sets from multiple data sources; Statistical analysis and predictive modeling; Infrastructure components: GPUs or CPUs.

    Telecommuting available.

    Apply at {rel="noreferrer noopener" target=" blank"}


    OR by email:
    []{rel="noreferrer noopener" target=" blank"}. Must reference job title, location, and Req ID when applying.


    Requisition ID:
    R0205189