Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic
Chandana Cheenepalle

Chandana Cheenepalle

West Haven,CT

Summary

Driven Data Engineer with experience in designing, constructing, installing, testing and maintaining highly scalable data management systems. Skilled in translating complex functional and technical requirements into detailed architecture and design. Demonstrated strengths include troubleshooting skills, analytical thinking, and familiarity with data tools such as Snowflake, AWS, Spark, DBT, Denodo. Contributed to improving system efficiency in previous role through effective collaboration and innovative problem-solving techniques.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer Intern

Pragmatic IT
Vernon, CT
06.2023 - 05.2024
  • Designed and implemented a comprehensive data pipeline, leveraging AWS Kinesis, Snowflake tables, stages, and dbt models, to integrate various data sources, ensuring efficient and seamless data flow across the enterprise.
  • Enhanced data processing efficiency by 30% through optimizing Kinesis Firehose, Snowflake external stages, and dbt transformations and tests, significantly reducing latency in real-time and batch processing scenarios.
  • Integrated Denodo for data virtualization to streamline access to various data sources, reducing data extraction times, and improving overall query performance across multiple environments.
  • Developed AWS Lambda functions in Python, triggered by SQS for complex data transformations and validations, integrated with Snowflake and Denodo, reducing false positives by 15% and enhancing system accuracy.
  • Optimized data ingestion speed by 40% by fine-tuning Kinesis Firehose and Snowpipe configurations and leveraging Denodo's data federation capabilities, reducing overall processing time by 25%.
  • Implemented CI/CD pipelines with Git, reducing deployment time by 50%, integrating dbt for automated testing and deploying Snowflake objects, ensuring consistent and rapid delivery of updates.

Data Engineer

LUMEN Technologies
, INDIA
08.2018 - 08.2022
  • Utilized Airflow for ETL orchestration in Databricks, achieving a 16% improvement in processing efficiency and accuracy
  • Orchestrated the development and continuous maintenance of data pipelines using PySpark, SparkSQL, and ADLS in Databricks, achieving a 30% reduction in data ingestion time and enhancing data integrity across multiple sources
  • Collaborated with various business departments to design and implement 20+ data pipelines within Databricks, enabling comprehensive analysis of technical issues and improving operational decision-making
  • Led the migration of data processing workflows from traditional ETL tools to Databricks and Delta Lake, resulting in a 14% increase in performance and annual cost savings of $678,000
  • Crafted scalable data pipelines and ETL processes in Databricks using PySpark, processing large datasets (RDDs) with a daily volume of 10 TB and improving data ingestion speed by 67%
  • Implemented data validation and quality checks within Databricks using PySpark and SparkSQL, ensuring the accuracy and reliability of transformed data from ADLS, which led to a 25% reduction in data discrepancies
  • Utilized Delta Lake within Databricks to manage large-scale data, ensuring ACID transactions and optimizing storage, which improved query performance by 20% and facilitated efficient data handling.

Education

Master of Science - Data Science

University of New Haven
Connecticut, USA
05.2024

Skills

  • Python(NumPy, Pandas, Seaborn, MatplotLib)
  • Java
  • SQL
  • PyTorch
  • PySpark
  • Hadoop
  • Spark (RDD)
  • Airflow
  • DBT (Data Build Tool)
  • Informatica
  • Databricks
  • AWS (Glue, EC2, S3, EBS, EMR, IAM, Lambda, VPC, RDS, CloudWatch, SNS, SQS, Glue, Athena, Redshift, Step Functions)
  • Snowflake (External Tables, Clone, Data Sharing, Time Travel, Snowpipe, Streams and Tasks)
  • Natural Language Processing, Machine Learning,Large Language Models(LLMs), RAGs
  • Power BI

Certification

  • Databricks Certified Associate Developer for Apache Spark 3.0, Databricks
  • SnowPro Core Certification, Snowflake

Timeline

Data Engineer Intern

Pragmatic IT
06.2023 - 05.2024

Data Engineer

LUMEN Technologies
08.2018 - 08.2022

Master of Science - Data Science

University of New Haven
Chandana Cheenepalle