Summary
Overview
Work History
Education
Skills
Timeline

Swathi K

Chicago,IL

Summary

  • Detail-oriented Data Engineer designs, develops and maintains highly scalable, secure and reliable data structures. Accustomed to working closely with system architects, software architects and design analysts to understand business or industry requirements to develop comprehensive data models. Proficient at developing database architectural strategies at the modeling, design and implementation stages.
  • Organized and dependable candidate successful at managing multiple priorities with a positive attitude. Willingness to take on added responsibilities to meet team goals.
  • Detail-oriented team player with strong organizational skills. Ability to handle multiple projects simultaneously with a high degree of accuracy.

Overview

8
8
years of professional experience

Work History

Big Data Engineer

Samsung
10.2022 - Current
  • Developed Spark streaming applications for cloud-to-Hive data transfer, configuring Spark for optimized processing, and designing Scala workflows for data pull and transformation. Created shell scripts for HDFS data ingestion, utilized PIG scripts for semi-structured data analysis, and contributed to planning a STAR schema data warehouse. In addition, developed Tableau dashboards, used Informatica ETL for workflow creation, implemented data visualizations with Python and Tableau, and designed configurable data delivery pipelines. Supported production with UNIX shell scripting, implemented MapReduce programs, and contributed to MongoDB design and maintenance.

Data Engineer

Albertsons Companies
02.2021 - 10.2022
  • I collaborated with business users to gather and define requirements, developing Spark scripts with Python and Scala for scalable data solutions. I utilized AWS and GCP services, implemented ETL frameworks, and created Tableau visualizations. Working with Snowflake Schemas and Data Warehousing, I processed batch and streaming data loads. Additionally, I handled data pre-processing, cleaning, and analysis using Python, wrote Pig Latin scripts, and set up databases in GCP. In an Agile environment, I actively participated in daily scrum meetings and sprint planning, demonstrating proficiency in data engineering, scripting, and cloud services.

Data Engineer

Arvest Bank
06.2019 - 01.2021
  • I crafted source-to-target data mapping documents, developed Spark programs in Scala for batch processing, and applied functional programming principles. I used Spark transformations for data cleansing, monitored jobs, and analyzed large datasets with Map Reduce. Responsible for ETL solutions and data modeling, I developed Sqoop scripts, designed PySpark ETL pipelines on AWS EMR, and created reports in Tableau. Additionally, I implemented Pig Latin scripts, worked on real-time processing with Kafka, and engaged with the AWS stack. In Snowflake Schemas and Data Warehousing, I managed batch and streaming data loads, created HBase tables, and contributed to Agile methodologies.

Data Engineer

Nextbrain
07.2016 - 05.2019
  • Designed compliance frameworks for multi-site data warehousing efforts to verify conformity with state and federal data security guidelines. Streamlined complex workflows by breaking them down into manageable components for easier implementation and maintenance. Collaborated with data scientists to develop machine learning models by providing the necessary data infrastructure and preprocessing tools.

Education

Master of Science - Information Technology

Valparaiso University, Valparaiso, IN

Skills

  • R Programming
  • Python
  • SQL
  • Scala
  • Red Hat Linux
  • Unix
  • Snowflake
  • Teradata
  • Oracle
  • MySQL, Microsoft SQL, PostgreSQL
  • AWS
  • Azure
  • GCP
  • ETL Development
  • Data Quality Assurance
  • Data Warehousing
  • Data Visualization
  • Data Migration
  • Data Modeling
  • Big Data Processing
  • Machine Learning
  • Performance Tuning
  • Database Management
  • Project Management
  • NoSQL Databases
  • Real-time Analytics
  • SQL Server Integration Services (SSIS)
  • User Acceptance Testing (UAT)
  • Apache Spark

Timeline

Big Data Engineer - Samsung
10.2022 - Current
Data Engineer - Albertsons Companies
02.2021 - 10.2022
Data Engineer - Arvest Bank
06.2019 - 01.2021
Data Engineer - Nextbrain
07.2016 - 05.2019
Valparaiso University - Master of Science, Information Technology
Swathi K