Overview
Work History
Education
Skills
Timeline
Generic

NEHA GALLA

Charlotte,NC

Overview

4
4
years of professional experience

Work History

Data Engineer- Intern

TEKPRISMIT
Cary, NC
09.2024 - Current
  • Assisted in building ETL pipelines using AWS Glue and Python
  • Developed SQL queries for data extraction and transformation
  • Created Tableau dashboards to provide business insights
  • Performed data quality checks and troubleshooting for pipeline issues
  • Automated repetitive data processing tasks using Python scripts
  • Collaborated with senior engineers to optimize existing data workflows.

Research Assistant

UNIVERSITY OF NORTH CAROLINA
Charlotte, NC
01.2024 - 05.2024
  • Lead the collection and coding of public estimates on conflict-related casualties across multiple months, ensuring data accuracy and consistency for ongoing research
  • Utilize SQLite3 to integrate monthly datasets, perform data validation, correct errors, and automate ingestion via
  • Python scripts, streamlining the data processing pipeline
  • Worked on a cloud-based data project using Azure services
  • Cleaned and analyzed large datasets using Python and SQL
  • Develop data visualizations, including time series plots and summary statistics, to provide insights into casualty trends over time, supporting the research team’s analysis and publications
  • Author detailed reports documenting the data collection and visualization processes, contributing to a manuscript for submission to a peer-reviewed journal
  • Assist in statistical models, applying Bayesian methods to resolve identifiability issues and ensure the robustness of model outputs.

Program Analyst

LEGATO HEALTH TECHNOLOGIES
07.2021 - 08.2021
  • Analyzed program data to track performance and identify areas for improvement
  • Developed interactive reports and dashboards using Power BI and Tableau
  • Assisted in budget analysis, cost tracking, and financial reporting for business programs
  • Optimized workflows using SQL and Python to improve operational efficiency
  • Collaborated with cross-functional teams to provide data-driven insights
  • Automated data collection and reporting processes to enhance decision-making.

BASED ETL PIPELIE USING AWS, University of North Carolina
01.2024 - 04.2024
  • Designed and implemented an ETL pipeline using AWS services (S3, Glue, Redshift)
  • Developed Python scripts to clean and transform raw data before storing it in Redshift
  • Created a Power BI dashboard to visualize key insights from processed data
  • Automated data ingestion and transformation processes for real-time reporting
  • Optimized query performance in Redshift using indexing and partitioning techniques.

SALES ANALYTICS

POWER BI
09.2023 - 12.2023
  • Extracted sales data from multiple sources and stored it in SQL Server
  • Designed interactive dashboards in Tableau and Power BI to track key business metrics
  • Automated data updates to ensure real-time reporting
  • Developed advanced DAX calculations for deeper data analysis
  • Created user-friendly reports and visualizations for stakeholders.

AZURE DATA PIPELINE FOR RETAIL DATA PROCESSING, University of North Carolina
08.2022 - 12.2022
  • Built an end-to-end data pipeline using Azure Data Factory and Azure SQL Database
  • Processed large retail datasets and performed transformations for business analysis
  • Used Power BI for visualization and reporting
  • Implemented data validation and error handling in Azure Data Factory
  • Configured Azure Blob Storage for efficient data storage and retrieval.

Education

Master’s - computer science, Data Science

UNIVERSITY OF NORTH CAROLINA, 2024
May

Bachelor’s - computer science

JAWAHARLAL NEHRU TECHNOLOGICAL UNIVERSITY Hyd
Jul 2021

Skills

  • Programming Languages:
  • Python, SQL, Cloud Platforms: AWS (S3, Lambda, Redshift, Glue), Azure (Data Factory, Data
  • Lake, Synapse),Data Engineering: ETL, Data Pipelines, Data Warehousing, Databases: Snowflake, PostgreSQL, SQL
  • Server, Visualization Tools: Tableau, Power BI, Big Data Technologies: Spark, Hive (Basic Understanding)Version
  • Control: Git, GitHub Program Analysis: Data-driven decision-making, Budget tracking, Performance analysis

Timeline

Data Engineer- Intern

TEKPRISMIT
09.2024 - Current

Research Assistant

UNIVERSITY OF NORTH CAROLINA
01.2024 - 05.2024

BASED ETL PIPELIE USING AWS, University of North Carolina
01.2024 - 04.2024

SALES ANALYTICS

POWER BI
09.2023 - 12.2023

AZURE DATA PIPELINE FOR RETAIL DATA PROCESSING, University of North Carolina
08.2022 - 12.2022

Program Analyst

LEGATO HEALTH TECHNOLOGIES
07.2021 - 08.2021

Master’s - computer science, Data Science

UNIVERSITY OF NORTH CAROLINA, 2024

Bachelor’s - computer science

JAWAHARLAL NEHRU TECHNOLOGICAL UNIVERSITY Hyd
NEHA GALLA