Assisted in building ETL pipelines using AWS Glue and Python
Developed SQL queries for data extraction and transformation
Created Tableau dashboards to provide business insights
Performed data quality checks and troubleshooting for pipeline issues
Automated repetitive data processing tasks using Python scripts
Collaborated with senior engineers to optimize existing data workflows.
Research Assistant
UNIVERSITY OF NORTH CAROLINA
Charlotte, NC
01.2024 - 05.2024
Lead the collection and coding of public estimates on conflict-related casualties across multiple months, ensuring
data accuracy and consistency for ongoing research
Utilize SQLite3 to integrate monthly datasets, perform data validation, correct errors, and automate ingestion via
Python scripts, streamlining the data processing pipeline
Worked on a cloud-based data project using Azure services
Cleaned and analyzed large datasets using Python and SQL
Develop data visualizations, including time series plots and summary statistics, to provide insights into casualty
trends over time, supporting the research team’s analysis and publications
Author detailed reports documenting the data collection and visualization processes, contributing to a manuscript for
submission to a peer-reviewed journal
Assist in statistical models, applying Bayesian methods to resolve identifiability issues and ensure the robustness of
model outputs.
Program Analyst
LEGATO HEALTH TECHNOLOGIES
07.2021 - 08.2021
Analyzed program data to track performance and identify areas for improvement
Developed interactive reports and dashboards using Power BI and Tableau
Assisted in budget analysis, cost tracking, and financial reporting for business programs
Optimized workflows using SQL and Python to improve operational efficiency
Collaborated with cross-functional teams to provide data-driven insights
Automated data collection and reporting processes to enhance decision-making.
BASED ETL PIPELIE USING AWS, University of North Carolina
01.2024 - 04.2024
Designed and implemented an ETL pipeline using AWS services (S3, Glue, Redshift)
Developed Python scripts to clean and transform raw data before storing it in Redshift
Created a Power BI dashboard to visualize key insights from processed data
Automated data ingestion and transformation processes for real-time reporting
Optimized query performance in Redshift using indexing and partitioning techniques.
SALES ANALYTICS
POWER BI
09.2023 - 12.2023
Extracted sales data from multiple sources and stored it in SQL Server
Designed interactive dashboards in Tableau and Power BI to track key business metrics
Automated data updates to ensure real-time reporting
Developed advanced DAX calculations for deeper data analysis
Created user-friendly reports and visualizations for stakeholders.
AZURE DATA PIPELINE FOR RETAIL DATA PROCESSING, University of North Carolina
08.2022 - 12.2022
Built an end-to-end data pipeline using Azure Data Factory and Azure SQL Database
Processed large retail datasets and performed transformations for business analysis
Used Power BI for visualization and reporting
Implemented data validation and error handling in Azure Data Factory
Configured Azure Blob Storage for efficient data storage and retrieval.