Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

SATWIK V

Summary

Results-driven Data Engineer with 4+ years of experience in building scalable data pipelines, cloud migration, and containerized data processing solutions. Proficient in Python, Spark, Snowflake, SQL, and cloud platforms (AWS, Azure). Hands-on expertise in containerizing ETL jobs with Docker, orchestrating workflows with Airflow, and designing robust data architectures. Strong skills in Git, CI/CD, and troubleshooting distributed processing across Kubernetes-based environments.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

One Community Global Inc
, USA
09.2024 - 12.2024
  • Built ETL pipelines using ADF, Kafka, Spark & Snowflake
  • Designed Snowpipes for real-time ingestion
  • Created predictive models & dashboards with Python & Tableau
  • Containerization: Docker, Kubernetes (EKS), Airflow in Docker
  • Migrated on-prem ETL workflows to Azure Data Factory with containerized Spark processing in Docker.
  • Deployed batch jobs in Airflow using KubernetesPodOperator for isolated execution and scale.
  • Troubleshot container issues like memory bottlenecks and Spark executor failures in Dockerized pipelines.

Data Analyst

ScRibezonee Pvt. Ltd
, India
01.2022 - 12.2022
  • Cleaned & analyzed datasets using Python (Pandas, NumPy)
  • Built Power BI dashboards & automated reports
  • Designed containerized ETL prototypes using Python and Docker for local Spark testing environments.
  • Refactored poorly documented Python scripts, reverse engineered data logic, and migrated to reusable modules.
  • Delivered actionable insights from structured & unstructured data
  • Big Data: Hadoop, Hive, Spark, MapReduce, Kafka
  • Cloud: AWS (S3, EMR, Lambda, Redshift), Azure, GCP
  • ETL Tools: Informatica, Talend, Azure Data Facto

Data Engineer

Cognizant (Client: Zoetis)
, India
02.2021 - 06.2022
  • Developed Spark & Snowflake pipelines on AWS
  • Containerized Spark jobs using Docker and deployed in AWS EMR for repeatable data processing environments.
  • Used Git for version control of ETL scripts and coordinated branch-based workflows in CI/CD pipeline.
  • Conducted reverse engineering of legacy S3-based pipelines to improve performance in Snowflake.
  • Automated ETL using Apache Airflow & Scala
  • Migrated data from S3 to Snowflake; performed data profiling
  • Databases: Snowflake, MongoDB, HBase, Cassandra, PostgreSQL
  • Visualization: Power BI, Tableau
  • Tools: Apache Airflow, Docker, Git, Jenkins

Data Analyst

Creators Touch
, India
07.2019 - 12.2020
  • Designed ETL pipelines using Hadoop & cloud tools
  • Performed EDA and built dashboards in Tableau
  • Used Sqoop, Spark & Hive for data migration and transformation

Python Intern

Subrain Solutions
, India
01.2019 - 05.2019
  • Built data pipelines & APIs with Python
  • Assisted in ETL automation using Airflow
  • Gained experience in Spark, Hadoop, and cloud services

Education

M.S. - Information Technology (MIS)

University of Memphis
USA
12.2024

B.Tech - Civil Engineering

Guru Nanak Institute of Technology
India
07.2022

Diploma - Civil Engineering

St. Mary’s Integrated Campus
India
04.2019

Secondary School Certificate (SSC) -

Board of Secondary Education
Telangana
01.2016

Skills

  • ETL pipelines
  • ADF
  • Kafka
  • Spark
  • Snowflake
  • Snowpipes
  • Predictive models
  • Dashboards
  • Python
  • Tableau
  • Big Data
  • Hadoop
  • Hive
  • MapReduce
  • Cleaned datasets
  • Analyzed datasets
  • Pandas
  • NumPy
  • Power BI
  • Automated reports
  • Actionable insights
  • Structured data
  • Unstructured data
  • Cloud
  • AWS
  • S3
  • EMR
  • Lambda
  • Redshift
  • Azure
  • GCP
  • ETL Tools
  • Informatica
  • Talend
  • Azure Data Factory
  • Databases
  • MongoDB
  • HBase
  • Cassandra
  • PostgreSQL
  • Visualization
  • Looker
  • Tools
  • Apache Airflow
  • Docker
  • Git
  • Jenkins

Certification

Google Professional Machine Learning Engineer, https://www.credly.com/earner/earned/badge/d6a7b330-53ad-4675-95db-cb4a5701e2b9

Languages

  • English
  • Telugu
  • Hindi

Timeline

Data Engineer

One Community Global Inc
09.2024 - 12.2024

Data Analyst

ScRibezonee Pvt. Ltd
01.2022 - 12.2022

Data Engineer

Cognizant (Client: Zoetis)
02.2021 - 06.2022

Data Analyst

Creators Touch
07.2019 - 12.2020

Python Intern

Subrain Solutions
01.2019 - 05.2019

M.S. - Information Technology (MIS)

University of Memphis

B.Tech - Civil Engineering

Guru Nanak Institute of Technology

Diploma - Civil Engineering

St. Mary’s Integrated Campus

Secondary School Certificate (SSC) -

Board of Secondary Education
SATWIK V