Data Engineer - Palo Alto
1 week ago

Job description
We are so glad you are interested in joining Sutter Health
Organization:
SHSO-Sutter Health System Office-BayPosition Overview:
Responsible for developing Sutter Health's research analytic data infrastructure. This includes all aspects of how data is ingested, stored, collected, governed, cleansed, accessed, and used. It includes making sure that the data used by the organization is of the highest quality and is made available as soon as possible in a format that allows the business (researchers) to make critical decisions based on the data. Utilizes tools and infrastructure such as scalable data pipelines to manage high volume and high-speed data storage and retrieval, as well as automated testing and tools for improving data quality. Works with all types of data including batch and streaming data, structured, semi-structured and unstructured data, files, web downloads, and other sources of data. Creates and improves processes required by other data-dependent function including analytics, strategic business intelligence, and data science. Uses state-of-the-art methods to capture, route and store data, combining information from different sources, transforming it to improve the data's reliability, quality and usability. Develops and tests new architectures that enable data extraction, automation, and modeling for predictive or prescriptive analytic purposes. Sets the standard for high-value high quality datasets that are accurate, timely, secure and well-suited to strategic analytic purposes research organization.Work on IRB approved research studies providing accurate and timely curated data.
Work closely with Principal investigators and statisticians.
Work in accordance with Research Privacy and HIPAA regulations and methods for safeguarding PHI and PII.
This position is part of a new, exciting strategic initiative within Sutter Health Research. While this role is designated as limited-term, there is a strong possibility that qualified incumbents may be considered for extensions based on organizational needs and performance.
This is a hybrid role with both work from home and onsite requirements. The successful candidate will live in the Sutter footprint.
Job Description:
EDUCATION:
- Bachelor's degree in Computer Science, Engineering, Information Management, or Healthcare Administration
TYPICAL EXPERIENCE:
- 8 years recent relevant experience.
SKILLS AND KNOWLEDGE:
- Experience creating data pipelines on big data platforms and data integrations in databases and data lakes, working with various cloud and on-premises technologies.
- Experience leveraging scalable data platforms to build secure infrastructure; experience building batch or streaming data ingestion pipelines.
- Ability to assess and profile raw data and reassemble raw data from multiple sources into a single, enterprise model.
- Hands on experience with data management tools (Cloudera, Spark, Python, Databricks, etc.); fluency with SQL programming, scripting, and data architecture.
- Extensive familiarity with relational database concepts / technologies (SQL, Oracle, etc.) including data design, table design, partitioning, as well as determining the technology to use in any given scenario.
- Experience ensuring data quality and implementing tools and frameworks for automating identification of data quality issues.
- Strong understanding of data engineering and data traceability best practices and framework
- Ability to work in a consulting role, building technology and communicating with end-users and customers of varying levels of technical capability.
- Ability to produce high-quality, professional documentation and communication materials.
- Strong knowledge in the development of Business Intelligence and Reporting solutions.
- Ability to translate data into Management reports and presentations.
- Strong problem solving, organization, and prioritization skills.
- Detail-oriented, producing timely results and ability to work both independently with minimal supervision and as a member of a scrum/product team.
- Track-record of successful project delivery, building collaborative cross-functional relationships, and an ability to find creative ways to solve business problems.
- Ability to balance the competing needs of multiple priorities and work in a dynamic environment; ability to perform under pressure and in stressful situations.
- Demonstrable capacity for learning technical concepts and adapting to new technologies quickly; ability to stay current with evolving best practices in data management.
- Familiar with healthcare provider data structures and sources; experienced with HIPAA regulations and methods for safeguarding PHI and PII through mitigation of data exposure risk.
- Knowledge of health care operations and structure, general requirements in an integrated delivery.
- The role requires significant technical skills, including multiple programming languages and extensive knowledge of (T-SQL, ANSI SQL), as well as experience with modern, distributed, scalable data platform.
- Build tables and analytical datasets for the research team by partnering with the Principal Investigator and statistician for IRB approved studies.
- Propose, develop and implement algorithms for investigating and cleaning data collected from healthcare systems.
- Work with Research Privacy and Security staff as necessary
- Write clear and logical documentation on the sources and methodologies used to extract and transform the information for the researchers, other analysts and programming staff.
Job Shift:
DaysSchedule:
Full TimeDays of the Week:
Monday - FridayWeekend Requirements:
OccasionallyBenefits:
YesUnions:
NoPosition Status:
ExemptWeekly Hours:
40Employee Status:
Limited Term (Fixed Term)Sutter Health is an equal opportunity employer EOE/M/F/Disability/Veterans.
Pay Range is $145,204.80 to $217,796.80 / annual salary. Sacramento Valley Pay Range is $126,256.00 to $189,384.00 / annual salary.The compensation range may vary based on the geographic location where the position is filled. Total compensation considers multiple factors, including, but not limited to a candidate's experience, education, skills, licensure, certifications, departmental equity, training, and organizational needs. Base pay is only one component of Sutter Health's comprehensive total rewards program. Eligible positions also include a comprehensive benefits package.
Similar jobs
· About Abaka AI · Abaka AI is built on one mission: to be the world's most trusted data partner for AI companies. More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to power their data pipelines. With our headquarters in Silicon Val ...
1 day ago
Data Engineer role at SHC: build end-to-end data pipelines and infrastructure for the Data Science team. · ...
1 week ago
We're hiring our first Data Engineer in the United States to shape Abaka AI's data engineering standards systems culture from day one. · Work closely with foundation model clients to understand their data requirements coordinate internal teams to create tailored delivery plans th ...
1 month ago
Tabapay is looking for an experienced Data Engineer to join the team. · Develop and manage scalable and reliable ETL/ELT processes for loading data into data warehouses. · Monitor performance and troubleshoot issues. · ...
1 month ago
We're hiring Data Engineers to build data systems behind large-scale robot AI training. · Owning pipelines · Labeling workflows · Storage and cloud infrastructure end-to-end. · ...
1 week ago
We are hiring a Data Engineer for a fast-growing startup that is building an AI-powered platform. · ...
1 month ago
Athena empowers possibility through transformative delegation. True leaders reflect on what they want and map the path to get there. · ...
1 month ago
We are looking for a Software Engineer to join our Data Analytics Engineering team. The successful candidate will have the opportunity to build critical analytics infrastructure used to derive insights and serve several teams in the organization. · ...
1 month ago
· Transform healthcare with us. · At Qualified Health, we're redefining what's possible with Generative AI in healthcare. Our infrastructure provides the guardrails for safe AI governance, healthcare-specific agent creation, and real-time algorithm monitoring—working alongside l ...
1 day ago
+Job summaryRivian and Volkswagen Group Technologies is building a best-in-class data platform that empowers the entire organization to make data-driven decisions. · +Architect data models and schemas that enable self-service analytics and ensure data accuracy and consistency acr ...
3 weeks ago
Mudflap serves the $800B trucking industry. Our market-leading payment products help truckers save thousands of dollars on fuel while providing our fuel stop partners with access to new customers. · Architect and scale ingestion frameworks to bring diverse data sources into our p ...
2 weeks ago
We're a passionate team of builders, dreamers, doers and innovators focused on creating entirely new vertically integrated small EVs designed to meet the global mobility challenges of today and tomorrow. We're looking for a Staff Data Engineer to own the telemetry data platform f ...
2 weeks ago
We are a high-growth company transforming how businesses operate by integrating AI IoT and cloud-native services into scalable real-time platforms As Data Platform Engineer you'll play a critical role in building and maintaining the data infrastructure that powers our products se ...
1 week ago
We're a passionate team of builders, dreamers, doers and innovators focused on creating entirely new vertically integrated small EVs designed to meet the global mobility challenges of today and tomorrow. We inspire everyone to ride ALSO—replacing many local car truck and SUV mile ...
2 weeks ago
Mudflap serves the $800B trucking industry, the backbone of the U.S. economy. Our market-leading payment products help truckers save thousands of dollars on fuel (their #1 business expense), while providing our fuel stop partners with access to new, hard-to-reach customers. We're ...
1 day ago
I'm recruiting for a Data Engineer at a fast-growing startup building an agentic AI platform recently backed by a $30M raise from top-tier venture investors. · ...
1 month ago
We're building the world's first healthcare-only safety-focused LLM platform designed to transform patient outcomes at a global scale. · Build & operate data platforms and pipelines that feed training RAG evaluation and analytics. · Own data governance and access control implemen ...
2 weeks ago
Job summary · About ALSO. · We're passionate about helping the world find a better way to get there—wherever it is you're headed. · ...
3 weeks ago
At Latica our goal is to unlock the value of data to transform patient care. · ...
1 week ago
We are looking for a Senior Data Engineer to work with Hippocratic AI, a leading generative AI company in healthcare. The role involves designing and operating data platforms and pipelines that feed training, RAG, evaluation, and analytics using tools like Prefect, dbt, Airflow, ...
1 month ago