Jobs
>
Somerville

    Azure Data Lake Principal Architect/Principal Data Engineer - Somerville, United States - Mastech Digital

    Default job background
    Technology / Internet
    Description

    TITLE: Azure Data Lake Principal Architect/Principal Data Engineer

    LOCATION: Somerville MA (HYBRID): 1-2x a week onsite - Hours: 8:30am-5pm

    Type: PERM

    Salary range: $140k-$170k (Depending on Exp)

    *** Candidate needs to have experience doing solution architecture on Azure. The customer has a mature data platform currently and this person will be responsible for all new designs moving forward.***

    OVERVIEW

    • The Data Lake Team plays a pivotal role in harnessing the power of data to drive innovation, enhance patient care, and advance research initiatives. As a leading integrated health system relies on a robust data infrastructure to make informed decisions and optimize healthcare delivery.
    • The mission of the Data Lake Team is to establish and maintain a centralized and scalable data platform - Data Lake, that consolidates diverse datasets from various sources across the healthcare system. This initiative aims to break down data silos, foster collaboration, and empower stakeholders with timely and accurate information.
    • The Data Lake Team collaborates closely with various stakeholders, including clinicians, researchers, administrators, and IT professionals. Cross-functional partnerships ensure that the Data Lake aligns with the evolving needs of the community.
    • Embracing a culture of innovation, the team is committed to exploring emerging technologies and methodologies to enhance the capabilities of the Data Lake, Lake house and Enterprise Data Warehouse. Continuous improvement initiatives are undertaken to refine processes, optimize performance, and stay at the forefront of healthcare data management.
    • The Data Lake Team is at the forefront of revolutionizing healthcare through effective data management. By fostering collaboration, ensuring data integrity, and facilitating advanced analytics, the team contributes significantly to providing high-quality patient care and advancing medical knowledge.

    DESCRIPTION

    Seeking a highly skilled and experienced Principal Data Engineer that will play a crucial role in designing, developing, and maintaining data infrastructure and platform integration solutions. The Principal Data Engineer will be responsible for advancing data engineering capabilities, ensuring data quality, and contributing to the success of data-driven initiatives.

    RESPONSIBILITIES

    Infrastructure, Architecture and Design:

    • Design end-to-end data solutions on the Azure data platform, considering Scalability, Security, Compliance and Performance.
    • Contribute to technical foundation of the Data Lake platform, expanding and optimizing the data ecosystem and pipeline architecture.
    • Provide expertise in selecting and implementing appropriate data platform services (e.g. Azure, Databricks, Snowflake) for various data processing and storage needs.
    • Collaborate with cross-functional teams to integrate data solutions with existing platform systems and applications, integrate new data management technologies aligned with the latest vendor products & capabilities (Microsoft Azure and Fabric, Databricks, Snowflake, Collibra, etc.) and software engineering tools into existing structures.
    • Design, develop, construct, test, and maintain Data Lake architectures and large-scale data processing systems.
    • Support big data, data lake, lakehouse ecosystem related tool selection and POC analysis.

    Data Pipeline Development:

    • Gather and process raw data at scale including large complex data sets, meeting functional/non-functional business requirements (using ADF, Databricks, Python, Pyspark, scripts, REST API calls, SQL Queries, etc.).
    • Develop data set processes for data modeling, mining, and production.
    • Create and maintain optimal data pipeline architecture on cloud-based platforms (e.g., Azure) and relational data systems (SQL Server, SSIS, Snowflakes).

    Cross-Functional Collaboration:

    • Work on cross-functional teams delivering enterprise solutions for internal and external clients.
    • Collaborate with Software Developers, Database Architects, Data Analysts, and Data Scientists on data initiatives.
    • Support stakeholders, including the Management team, Product owners, and Architecture teams, in addressing data-related technical issues and fulfilling data infrastructure needs.
    • Partner with integrated platform & data science teams to define integrated solutions to meet the evolving needs of data analysts, developers & consumers (e.g. data ingestion framework, AI/ML capabilities, scalable compute), contributing to the innovation and leadership of the organization.

    Data Optimization and Automation:

    • Identify, design, and implement internal process improvements, automation of manual processes, optimization of data delivery, etc.
    • Build the data infrastructure required for optimal extraction, transformation, and loading of data from traditional/legacy sources.

    Subject Matter Expertise and Leadership:

    • Act as a subject matter expert for internal or external data products.
    • Contribute to solution architecture design and advise on engineering solution best practices.
    • Mentor and guide junior data engineers by leading code reviews and documenting best practices.

    QUALIFICATIONS:

    • Bachelor's or Master's degree in Computer Science, Information Technology, or related fields; or comparable work experience
    • 8+ years of related professional experience including 5+ years in data lake development in large reporting environment(s)
    • Expert in Azure cloud computing, specializing in Azure data engineering stacks like ADF (Azure Data Factory), ADLS, Event Hubs, Snowflake, Databricks, streaming, Azure PowerShell, and Log Analytics.
    • Hands-on development experience with Design and Architecture of big data frameworks/tools: Azure Data Lake, Snowflake, Azure Data Bricks.
    • Expert experience with Hadoop based technology (e.g. Spark) and programming (Python, SQL, PySpark)
    • Knowledgeable about cloud computing costs and performance and capable of providing ongoing suggestions for cost & performance optimization.
    • Solid understanding of Snowflake computing, including its integration with Azure Data Lake, utilizing ADLS as a source for data processing.
    • Experience with Design and Architecture of relational SQL and NoSQL databases, including MS SQL Server, Snowflake.
    • Experience with Azure DevOps, familiar with CI/CD (Continuous Integration/Continuous Deployment) processes and capable of scaling them across engineering teams, leveraging Azure native resources for data ingestion.
    • Experience leading and working with cross-functional teams in a dynamic environment, demonstrated track record of team leadership, technical acumen, innovation, tactical and strategic.
    • Proven verbal, communication, and presentation skills, ability to clearly and concisely communicate complex technical concepts to both technical and non-technical audiences.
    • Proven ability to work independently.

    SKILLS

    • Advanced hands-on SQL, Spark, Python, PySpark knowledge and experience working with relational databases for data querying and retrieval on multiple platform.
    • Proficiency in Data Modeling tools (e.g. Erwin, Visio).
    • Strong interpersonal and communication skills, both written and verbal.
    • Strong Scrum/Agile development experience.
    • Excellent organizational skills and attention to detail, manage multiple tasks and projects, meet deadlines, follow through, and manage to schedule.
    • Strong innovation capabilities and the ability to think creatively.
    • Strong collaboration and team building skills within, across and outside of an organization. Maintain and promote a positive team environment.
    • Maintains stable performance under pressure, demonstrating sensitivity to diverse organizational culture.
    • Ability to effectively cope with change, remain flexible and adaptable within a fast-paced environment with rapidly changing requirements, and ability to negotiate situations when the big picture is not clearly defined.

    Note: If interested please email me your resume at or call me at .

    Best Regards,

    Anil Chamoli

    Team Lead ( Recruitment & Sales)


  • Puma

    Engineer IT Data

    2 weeks ago


    Puma Somerville, United States

    YOUR MISSION Design and Implementation of Spark Solutions end-to-end, from Data Ingestion, Data Lake Design and ELT Design. · Building of prototypes for PUMA Solution Architects in technological areas of Apache Spark, Synapse Pipelines, Python and Git Hub DevOps. · Deep underst ...

  • Tessera Therapeutics

    Data Engineer

    3 weeks ago


    Tessera Therapeutics Cambridge, United States

    Company Summary: · Arana Biosciences is one of the latest companies founded through Flagship Pioneering's venture creation engine, where companies such as Moderna Inc. and Indigo Ag were conceived and created. Since Flagship's founding in 2000, the firm has originated and fostere ...

  • Catalytic Data Science

    Data Engineer

    2 weeks ago


    Catalytic Data Science Cambridge, United States

    Job Description · Job DescriptionSalary: · Data Engineer · Engineering · REMOTE OPPORTUNITY · About Catalytic Data Science (CDS): · Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific resources, data, and analytic tools ...

  • GlaxoSmithKline

    Data Engineer

    1 week ago


    GlaxoSmithKline Cambridge, United States Full time

    Site Name: USA - Massachusetts - Cambridge · Posted Date: May · Hybrid role at Cambridge Park Drive, MA - requires 2-3 days on-site per week. · We are seeking a Software Engineer/Data Engineer who is passionate about improving patients' quality of life. The successful candidate ...

  • TetraScience

    Data Engineer

    2 weeks ago


    TetraScience Boston, United States

    Job Description · Job DescriptionWho We Are · TetraScience is the Scientific Data and AI Cloud company with a mission to radically improve and extend human life. TetraScience combines the world's only open, purpose-built, and collaborative scientific data and AI cloud with deep s ...

  • Gunderson Dettmer

    Data Engineer

    2 weeks ago


    Gunderson Dettmer Boston, United States

    Job Description · Job DescriptionGunderson Dettmer is the only business law firm of its kind - exclusively serving the global venture capital and emerging technology marketplace. With 400 attorneys in eleven offices - from Silicon Valley to Singapore - we innovate for innovators, ...

  • IQVIA

    Data Engineer

    4 hours ago


    IQVIA Boston, United States Full time

    Lasso, an IQVIA company, is growing and looking for super curious, passionate, and driven individuals to join the team. Our people are our greatest asset and we're committed to creating an environment where we all thrive doing what we love. · Lasso is the world's most comprehensi ...

  • TetraScience

    Data Engineer

    1 week ago


    TetraScience Boston, United States

    Job Description · Job DescriptionWho We Are · TetraScience is the Scientific Data and AI Cloud company with a mission to radically improve and extend human life. TetraScience combines the world's only open, purpose-built, and collaborative scientific data and AI cloud with deep s ...

  • Gunderson Dettmer

    Data Engineer

    1 week ago


    Gunderson Dettmer Boston, United States Full time

    Gunderson Dettme r is the only business law firm of its kind - exclusively serving the global venture capital and emerging technology marketplace. With 400 attorneys in eleven offices - from Silicon Valley to Singapore - we innovate for innovators, accelerate entrepreneurship, an ...

  • TD Garden

    Data Engineer

    2 weeks ago


    TD Garden Boston, United States Full time

    The Opportunity · The Boston Bruins and TD Garden seek a technical professional to join our dynamic data-driven department. The Data Engineer will support a group that is responsible for strategy and business intelligence across all facets of the business side of an NHL team and ...

  • firsthand

    Data Engineer

    3 weeks ago


    firsthand Boston, United States Full time

    firsthand is changing the way individuals living with Serious Mental Illness (SMI) get care. We are focusing on delivering real outcomes for a cohort that has historically been underserved, stigmatized, and deprioritized. By building a service focused on whole-person care, firsth ...

  • Northeastern University

    Data Engineer

    3 weeks ago


    Northeastern University Boston, United States Full time

    The College of Social Sciences and Humanities is a leader in the Experiential Liberal Arts. The College is strongly committed to fostering excellence through diversity and enthusiastically welcomes applications from members of groups underrepresented in higher education administr ...


  • Formlabs Somerville, United States

    To reinvent an industry, you have to build the best team. Join Formlabs if you want to bring groundbreaking professional 3D printers to the desktop of every designer, engineer, researcher, and artist in the world. · Formlabs is growing fast, and that means a lot of data to crunch ...

  • Evolution Recruitment Solutions, USA

    Senior Data Engineer

    3 weeks ago


    Evolution Recruitment Solutions, USA Cambridge, United States

    AI Focused Start Up: Senior Data Engineer with AI focus (Cambridge, MA - 2 days onsite) · SPONSORSHIP UNAVAILABLE · $160,000-170,000 · Evolution is partnering with a cutting-edge AI-focused startup dedicated to revolutionizing the healthtech industry through the power of artifici ...


  • Mass General Brigham Somerville, United States

    About Us: · As a not-for-profit organization, Mass General Brigham is committed to supporting patient care, research, teaching, and service to the community by leading innovation across our system. Founded by Brigham and Women's Hospital and Massachusetts General Hospital, Mass ...

  • PA Consulting

    Data Engineer

    1 week ago


    PA Consulting Boston, United States Full time

    Company Description · We believe in the power of ingenuity to build a positive human future. · As strategies, technologies and innovation collide, we create opportunity from complexity. · Our diverse teams of experts combine innovative thinking and breakthrough use of technologie ...

  • Sanofi Group

    Lead Data Engineer

    3 weeks ago


    Sanofi Group Cambridge, United States

    Lead Data Engineer - AI-Driven Solutions (IICS Proficiency) · About Sanofi: · We are an innovative global healthcare company, driven by one purpose: we chase the miracles of science to improve people's lives. Our team, across some 100 countries, is dedicated to transforming the p ...

  • Evolution Recruitment Solutions

    Senior Data Engineer

    4 weeks ago


    Evolution Recruitment Solutions Cambridge, United States

    AI Focused Start Up: Senior Data Engineer with AI focus (Cambridge, MA - 2 days onsite) · SPONSORSHIP UNAVAILABLE · $160,000-170,000 · All the relevant skills, qualifications and experience that a successful applicant will need are listed in the following description. · Evolu ...

  • EF World Journeys

    Lead Data Engineer

    3 weeks ago


    EF World Journeys Cambridge, United States

    At EF World Journeys, we believe that the best way to learn about the world is to experience it, and we strive to help as many people as possible share that experience. EF Go Ahead Tours and EF Ultimate Break are divisions of EF World Journeys and we make world travel easy. We wa ...


  • BlueWave Solutions Winthrop, United States

    What You'll Do at Base Camp · Conceptualize, implement, and optimize business processes and systems in various areas of expertise within the private equity industry. · Consult and support clients in projects to optimize control-related content in areas such as improving corporate ...