
sreeja p
Technology / Internet
About sreeja p:
I've been working in big data ecosystems like Spark, Hive, Athena, Python, Pyspark, Redshift, AWS kinases, and DynamoDB. I’m currently working in architecture where we ingest data both in batch and real-time. for batch, we ingest data from Teradata and SQL Server land data in S3, write Pyspark ETL scripts to read data from S3, and do all the computations in Spark and EMR, write data back into Hive and Athena, and I also export the latest snapshot data into Redshift for analytical purposes, where we have visualization developers that build Power bi / Tableau dashboards on top of that, for real-time I write Spark streaming applications where we ingest data from AWS kinases and write data into DynamoDB for API consumption. our downstream consumers are stakeholders, data scientists, and other engineering teams as well. We use airflow DAGs to schedule all the workflows we develop in Python, and that's pretty much about my experience in my current project as well.
Experience
I've been working in big data ecosystems like Spark, Hive, Athena, Python, Pyspark, Redshift, AWS kinases, and DynamoDB. I’m currently working in architecture where we ingest data both in batch and real-time. for batch, we ingest data from Teradata and SQL Server land data in S3, write Pyspark ETL scripts to read data from S3, and do all the computations in Spark and EMR, write data back into Hive and Athena, and I also export the latest snapshot data into Redshift for analytical purposes, where we have visualization developers that build Power bi / Tableau dashboards on top of that, for real-time I write Spark streaming applications where we ingest data from AWS kinases and write data into DynamoDB for API consumption. our downstream consumers are stakeholders, data scientists, and other engineering teams as well. We use airflow DAGs to schedule all the workflows we develop in Python, and that's pretty much about my experience in my current project as well.
Education
masters in Data Science
bachelor's in Computer Science and engineering
Professionals in the same Technology / Internet sector as sreeja p
Professionals from different sectors near Baltimore, Baltimore
Other users who are called sreeja
Jobs near Baltimore, Baltimore
-
We are seeking a talented Data Engineer to support the ingestion of mission-critical and mission-support data sets into a big data environment. The ideal candidate will have a background in supporting cyber and/or network-related missions within military spaces, as either a devel ...
Columbia, Maryland, United States2 days ago
-
Data Engineer position at CITI driving growth and seizing new business opportunities with hands-on data pipeline development experience on AWS. · 3+ years hands-on experience in building deploying and maintaining data pipelines on AWS or equivalent cloud platforms. · Strong codin ...
Baltimore1 month ago
-
Description · Sinclair, Inc. is seeking a Data Quality and Master Data Management Engineer to join our Enterprise Data and Insights Team (ED&I). This role is responsible for designing, developing, and maintaining Master Data Management (MDM) and Data Quality (DQ) solutions that e ...
Hunt Valley, MD, United States1 week ago