Summary
Overview
Work History
Education
Skills
Websites Social Links - Linkedin
Timeline
Generic

Suchitra Anand Gunisetty

Charlotte,NC

Summary

Strategic and results-driven Data Engineer with 3+ years of expertise in designing and implementing data architectures. Hands-on in ETL processes, data warehousing, and optimizing data pipelines. Adept at leveraging cutting-edge technologies to ensure data accuracy, availability, and scalability. Worked on transforming raw data into actionable insights to drive informed business decisions. Collaborative team player with excellent communication abilities. Committed to delivering high-quality, efficient data solutions to meet organizational goals. Organized and dependable candidate successful at managing multiple priorities with a positive attitude. Willingness to take on added responsibilities to meet team goals.

Overview

4
4
years of professional experience

Work History

Data Engineer

Global Logic
08.2022 - 04.2023
  • Developed scalable data pipelines using Scala and Spark to analyze, develop, and test potential use cases for the business in a fast-paced agile development environment.
  • Leveraged AWS services such as EMR, S3, and MWAA for data ingestion on Airflow.
  • Implemented functional programming principles with strong Scala programming skills, emphasizing DataFrames, Datasets, and Hadoop Filesystem.
  • Curated and maintained data layers on the data lake, ensuring data quality checks and optimizing Spark jobs for performance and cost efficiency.
  • Created reusable YAML configs for faster development of ingestion pipelines and participated in data architecture design using DBT components with AWS services.
  • Engineered multiple Glue jobs to consume data from the S3 standard layer, loading it into the Postgres master table.
  • Applied Test-Driven Development (TDD) to prevent data leakage and dirty reads.
  • Implemented data quality checks to ensure accurate and valuable results, providing stakeholders confidence in the system design.
  • Worked on performance optimization of Postgres tables and views to reduce run time and deadlock occurrences.
  • Developed and deployed Lambda functions for a serverless data pipeline using SQS services, orchestrated on Airflow.
  • Implemented CI/CD best practices using Jenkins and participated in DevOps processes with Git and Azure DevOps.

Associate Data Engineer

Edge IT Soft
07.2019 - 11.2021
  • Built data engineering pipelines using Scala in AWS for financial data analysis.
  • Refactored Python codes to Pyspark to utilize Spark parallelization, emphasizing strong Scala programming skills.
  • Provided business data to users for analysis and conducted enhancements in data pipelines based on user requirements.
  • Performed performance and integration testing for platform upgrades.
  • Developed data ingestion jobs in Python for collecting data from multiple channels and external applications.
  • Implemented batch and streaming ingestion of data, optimizing Spark performance using broadcast variables, dynamic allocation, partitioning, and building custom Spark UDFs.
  • Collaborated on ETL tasks, ensuring data integrity and verifying pipeline stability.
  • Designed and implemented effective database solutions and models for storing and retrieving data.

Education

Master of Science - Computer Science

Jessup University
San Jose, CA
11.2023

Bachelor of Science - Computer Science

JNTUH
Hyderabad, India
09.2020

Skills

  • Team Management
  • Strong Communication
  • Data Structures and Algorithms
  • C
  • C
  • R
  • Python
  • Pyspark
  • SQL
  • NoSQL
  • MongoDB
  • Sqoop
  • Apache Spark
  • Hadoop
  • AWS
  • Amazon S3
  • AWS Glue
  • AWS Lambda
  • Power BI
  • MS Excel

Websites Social Links - Linkedin

https://www.linkedin.com/in/suchitra-anand-gunisetty

Timeline

Data Engineer

Global Logic
08.2022 - 04.2023

Associate Data Engineer

Edge IT Soft
07.2019 - 11.2021

Master of Science - Computer Science

Jessup University

Bachelor of Science - Computer Science

JNTUH
Suchitra Anand Gunisetty