Shashank K

Houston,TX

Summary

Around 7+ years of experience as a Data Engineer and coding with analytical programming using Python. Highly skilled Data Engineering with a proven track record in leading high-performing teams and architecting advanced data solutions. I bring extensive expertise in AWS, ECS, EMR, Snowflake, Redshift, CI/CD automation tools, and programming languages such as Python, and PySpark. Recognized for designing and implementing scalable data solutions, optimizing pipelines, and ensuring unmatched data quality. I am committed to delivering actionable insights and spearheading data-driven decision-making initiatives for organizations. Ability to lead cross functional teams, foster collaboration, and architect robust data solutions that exceed organizational objectives. A results-driven professional with a passion for staying at the forefront of data engineering advancements. Ready to contribute skills and strategic vision to drive success in challenging and dynamic environments. Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills.

Overview

years of professional experience

Work History

Data Engineer/AWS Developer

Fannie Mae

Reston, VA

03.2019 - Current

Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.
Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.
Streamlined complex workflows by breaking them down into manageable components for easier implementation and maintenance.
Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
Migrated legacy systems to modern big-data technologies, improving performance and scalability while minimizing business disruption.
Provided technical guidance and mentorship to junior team members, fostering a collaborative learning environment within the organization.
Designed compliance frameworks for multi-site data warehousing efforts to verify conformity with state and federal data security guidelines.
Led end-to-end implementation of multiple high-impact projects from requirements gathering through deployment and post-launch support stages.
Collaborated with data scientists to develop machine learning models by providing the necessary data infrastructure and preprocessing tools.

Data Engineer/AWS Developer

Global Foundries

Malta, NY

07.2018 - 02.2019

ETL Process Design with SQL Server: Designed ETL processes using Informatica to load data from Flat Files, SQL server, and Excel files into the target SQL Server Data Warehouse database.
Support and Collaboration: Provided support for code/design analysis, strategy development, and project planning.Collaborated with infrastructure, network, database, application, and BI teams to ensure data quality and availability.
Spark for Interactive Queries and Streaming: Leveraged Spark for interactive queries, streaming data processing, and seamless integration with NoSQL databases to handle massive data volumes.Developed bespoke ETL solutions, encompassing batch processing and real-time data ingestion pipelines, utilizing Python and shell scripting to facilitate smooth data movement in and out of Hadoop.
Hadoop Ecosystem Deployment: Contributed to streamlining business processes for a regional bank by developing, installing, and configuring Hadoop ecosystem components.

Education

MS Information Technology and Management -

Campbellsville University

Campbellsville, KY

Skills

Database: Databricks, Snowflake,Redshift,GoogleBigQuery
Big Data Processing
Spark Framework
Hadoop Ecosystem
Programming: Pyspark , Python, Scala, SQL,NoSQL
Tool: DBT, Airflow,Kafka, Rest API

Reporting: Tableau, Quick sights
Devops: GitLab,Jenkins , Terraform, serverless,Docker
AWS: EMR, Glue, RDS, IAM, Lambda,S3,EC2, ECS, Redshift, Stepfunction
ETL development
Data Warehousing

Accomplishments

Optimized Data Processing: Enhanced data pipeline performance, reducing processing times by 70% through the implementation of parallel processing and efficient data handling techniques.

Timeline

Data Engineer/AWS Developer

Fannie Mae

03.2019 - Current

Data Engineer/AWS Developer

Global Foundries

07.2018 - 02.2019

MS Information Technology and Management -

Campbellsville University