Summary
Overview
Work History
Education
Skills
Timeline
Generic

Suma Ch

Dallas,TX

Summary

  • Results-oriented Data Engineer with 5+ years of combined experience in data science, analytics, and backend portal development.
  • Proven expertise in building scalable data pipelines, ETL workflows, and automation scripts to support high-volume data environments.
  • Strong proficiency in Python and SQL for data manipulation, cleansing, and transformation.
  • Hands-on experience with ETL tools like Talend, Airflow, Informatica, and Alteryx across diverse data integration projects.
  • Solid understanding of data warehousing concepts, dimensional modeling, and cloud-based data storage (AWS S3, Redshift).
  • Adept in working with relational databases including MySQL, Oracle, and SQL Server, with optimized query design and performance tuning.
  • Developed interactive dashboards and KPIs using Tableau and Power BI for business reporting and insights.
  • Excellent communicator and team collaborator with experience in Agile/Scrum environments, cross-functional coordination, and stakeholder engagement.

Overview

4
4
years of professional experience

Work History

Data Analyst

Frontier Communications
08.2024 - Current
  • Designed and implemented end-to-end data pipelines using Python and SQL to automate data ingestion from APIs, internal systems, and third-party sources, reducing manual data handling by 40%.
  • Developed robust ETL workflows using tools like Apache Airflow and Talend to transform and cleanse large-scale datasets for analytics and reporting.
  • Collaborated with cross-functional teams to identify data needs, define data models, and build scalable data solutions that supported advanced analytics and machine learning initiatives.
  • Created reusable data validation scripts and anomaly detection checks to ensure data integrity across multiple sources and stages.
  • Partnered with business analysts and stakeholders to develop actionable dashboards and visualizations using Tableau and Power BI, enabling data-driven decision-making.
  • Supported the migration of on-premises data workloads to cloud-based architecture (AWS Redshift/S3), optimizing storage and query performance.
  • Actively participated in Agile ceremonies and collaborated in sprint planning to align data engineering deliverables with business priorities.

Data Analyst Intern

PNC Bank
04.2023 - 07.2024
  • Assisted in building ETL workflows to collect, clean, and transform financial data from internal databases using SQL and Excel, supporting reporting and analytics teams.
  • Created and maintained data dictionaries and documentation for internal datasets, contributing to improved data governance practices.
  • Developed Python scripts for data preprocessing and automated data quality checks, reducing manual intervention by 30%.
  • Worked with senior data engineers to design schema models and support data pipeline development for customer behavior analytics.
  • Supported dashboard creation in Tableau for reporting key KPIs and trends to business stakeholders.

Software Engineer

Citrix
07.2021 - 12.2022
  • Developed and maintained internal web portals using JavaScript and Java frameworks to streamline business operations and improve user experience.
  • Integrated backend APIs and optimized data exchange between frontend interfaces and databases, improving system responsiveness by 25%.
  • Collaborated with data engineering teams to implement data-driven features, including real-time data visualization and dynamic reporting dashboards.
  • Wrote SQL queries for data retrieval and manipulation to support user-specific content delivery and reporting functionalities.
  • Participated in performance tuning and debugging of data-intensive modules, ensuring reliable access to large datasets through the portal.
  • Contributed to Agile development cycles and cross-functional team meetings to align portal functionalities with data platform goals.

Education

Data Science

University of Maryland, Baltimore
Baltimore, MD

Skills

    Technical Skills

    Programming & Scripting:
    Python, SQL, Java, JavaScript, Shell Scripting

    Data Engineering & ETL:
    Apache Airflow, Talend, Alteryx, Informatica, AWS Glue, Data Pipelines, ETL Workflows

    Databases & Storage:
    MySQL, SQL Server, Oracle, PostgreSQL, AWS Redshift, Snowflake, Hive, S3

    Big Data & Cloud:
    AWS (S3, Redshift, EC2), Azure Data Lake, Hadoop (basics), Spark (beginner level)

    Data Modeling & Warehousing:
    Star/Snowflake Schema, Dimensional Modeling, Data Lakes, Data Marts

    BI & Visualization Tools:
    Tableau, Power BI, Microsoft Excel

    Version Control & Project Tools:
    Git, JIRA, Confluence

    Concepts & Methodologies:
    ETL/ELT, Data Governance, Data Quality, Data Cleaning, Agile, SDLC

Timeline

Data Analyst

Frontier Communications
08.2024 - Current

Data Analyst Intern

PNC Bank
04.2023 - 07.2024

Software Engineer

Citrix
07.2021 - 12.2022

Data Science

University of Maryland, Baltimore
Suma Ch