Data Engineer - Topeka, United States - GradBay

    Default job background
    Description

    Title:
    Data Engineer


    Location:
    North Carolina

    *Competitive salary + benefits package*The Company

    A healthcare research institute committed to conducting innovative basic and clinical research, rapidly translating breakthrough discoveries to patient care and population health, providing a unique educational experience to future clinical and scientific leaders, improving the health of populations around the world.

    Job Description
    This position reports to the Data Partnerships Director of Data and Analytics Platforms.

    This position will be responsible for the management and execution of the de-identification processes applied to the University Health System data assets to be included in the Federated Clinical Applications Platform (FCAP), and the management and administration of the FCAP deidentification cloud environment.

    Solutions will focus on using state-of-the-art data and analytics tools including traditional and near real-time data warehousing, big data, relational and document-based databases using both extract, load, and transform (ELT) toolsets as well as REST APIs and FHIR.

    Essential Tasks/Responsibilities
    Create and follow defined procedures in the deidentification of patient medical information
    Maintain and tune the deidentification environment to perform optimally and comply with DUHS and DHTS policies and standards
    Collaborate with the Cloud Team on troubleshooting issues
    Assemble large, complex data sets that meet business requirements
    Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
    Recommend design of analytics solutions that improves data integration, data quality, and data delivery with an eye toward re-useable components
    Create and maintain an optimal data pipeline architecture
    Articulate differences, advantages, and disadvantages between architectural solution methods
    Work with Agile team members to document and execute test plans for data loading and data validation scripts.
    Support the code promotion process through development and production as required by using standard CI/CD processes
    Develop, implement, and maintain schedule/dependency logic for automated ETL processing
    Develop monitoring, logging, and error notification processes to ensure data is updated as expected and processing metrics reported
    Participate in the creation and maintenance of standards for coding, documentation, error handling, error notification, logging, etc.
    Accountable for conforming to established architectural, developmental, and operational standards and practices including the creation of metadata
    Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics
    Work with stakeholders including the Executive, Product, Data, and Design teams to assist with data-related technical issues and support their data infrastructure needs
    Require Skills and Experience
    Bachelor's degree in a related field, or four years of equivalent technical experience required
    5+ years of experience in a Data Engineer role who should also have hands-on, professional experience with the following:
    Relational SQL and NoSQL databases
    Writing and executing Python programs and shell scripts on Linux
    Intermediate Linux administration
    Data Engineering on Microsoft Azure
    Data pipeline and orchestration tools such as Azure Data Factory and SQL Server Integration Services
    Developing on cloud-based analytic platforms such as Azure Synapse
    Performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
    Supporting and working with cross-functional teams in a dynamic environment
    A successful history of manipulating, processing, and extracting value from large disconnected datasets
    Intermediate to advanced skills in Python programming
    Intermediate to advanced skills in the Azure Cloud Data Engineering Stack
    Intermediate Linux administration and shell scripting Advanced working SQL knowledge and experience working with relational and non-relation database systems
    Strong analytic, documentation, and organizational skills

    Desired Skills:
    Prior experience in health care IT
    Working knowledge of Azure DevOps & Automation/Orchestration
    Knowledge of open-source software solutions and open-source as a business model
    Technical breadth across application development, enterprise architecture, or application integration
    Understanding of Agile methodology
    Knowledge of APIs, API Integration, and API Management
    How to Apply?
    Interested candidates are invited to submit their resume by following the apply link.

    #J-18808-Ljbffr