Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sreeja T

Fremont,USA

Summary

Experienced Senior Data Engineer with over 8 years of expertise in Big Data, Cloud, ETL, and data analysis. Eager to continuously enhance the company's data ecosystem, transforming challenges into valuable opportunities by leveraging data engineering and analytical skills to support the delivery of critical business metrics, enable self-serve reporting and analysis capabilities, contributing to the company's mission of maintaining an industry-leading analytics stack.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Country Financial
06.2019 - Current
  • Translated business requirements in to technical requirements and created design artifacts like Data model, Technical specification documents and Application distribution diagrams.
  • Designed and built Spark framework that is extensible and reusable between on-prem and cloud to ingest and transform structured/semi-structured data in Hadoop.
  • Reengineered existing ETL workflows to improve performance by identifying bottlenecks and optimizing Hive queries accordingly in cost effective way.
  • Developed ETL pipelines in PySpark and responsible for maintaining data integrity, and verifying pipeline stability.
  • Driving initiative to automate recurring manual activities for monitoring and operations using Unix Scripting or python scripts.

Data Engineer

CUNA Mutual Group
02.2017 - 06.2019
  • Developed data ingestion, aggregation, integration and advanced analytics using Snowflake and Azure Data Factory.
  • Integrated semi-structured data like XML with structured data from multiple sources, storing as ORC files on blob.
  • Developed ETL flows in Spark using python within Microsoft Azure Cloud.
  • Built ETL pipelines in PySpark on Azure to load data in Snowflake for optimized reporting needs replacing Netezza.

Software Development Engineer

Ratiocination Ltd.
08.2014 - 07.2015
  • Worked on developing ETL processes to load data from multiple data sources to HDFS using Sqoop, perform structural modifications using Map-Reduce, analyze data using Hive and visualizing in dashboards.
  • Developed pig scripts to transform the data into structured format

Systems Engineer Intern

Infosys Technologies
03.2014 - 07.2014
  • Worked on Informatica Designer, Workflow Manager, and Workflow Monitor.
  • Created and Designed Data Source and Data Source Views Using SQL Server Analysis Services 2008 (SSAS).
  • Created Dimensions and Cubes using Star schema and Snowflake schema.
  • Used standard reports, sub reports, cross tab reports, bar chart, line graphs in Cognos.
  • Supported Datastage processes for enhancements.

Education

Artificial Intelligence Graduate Certificate -

Stanford College of Engineering
California
12.2025

MS Computer Science -

University of Illinois
Springfield, IL
07.2016

BTech Computer Science -

JNTUH, India
India
06.2013

Skills

  • Big Data Ecosystems: Hadoop, MapReduce, HDFS, Hive, Pig, Sqoop, Spark, YARN
  • Programming Languages: Java, SQL, Python
  • Software: Informatica, SSAS, IBM Cognos, SSIS, IBM DataStage
  • Databases: NoSQL, Oracle, My SQL, MS SQL, Snowflake
  • Cloud: Amazon AWS, Microsoft Azure

Certification

  • MapR Certified Hadoop Developer (MCHD), 09/2016
  • Professional Scrum Master I (PSM-I), 09/2018
  • Data Analytics from Cornell University, 10/2018
  • Certified SAFe 5 Practitioner, 11/2020
  • Certified SAFe Product Owner/Product Manager, 02/2021

Timeline

Senior Data Engineer

Country Financial
06.2019 - Current

Data Engineer

CUNA Mutual Group
02.2017 - 06.2019

Software Development Engineer

Ratiocination Ltd.
08.2014 - 07.2015

Systems Engineer Intern

Infosys Technologies
03.2014 - 07.2014

Artificial Intelligence Graduate Certificate -

Stanford College of Engineering

MS Computer Science -

University of Illinois

BTech Computer Science -

JNTUH, India
  • MapR Certified Hadoop Developer (MCHD), 09/2016
  • Professional Scrum Master I (PSM-I), 09/2018
  • Data Analytics from Cornell University, 10/2018
  • Certified SAFe 5 Practitioner, 11/2020
  • Certified SAFe Product Owner/Product Manager, 02/2021
Sreeja T