Detail-oriented Data Analyst with 4+ years of experience in extracting, transforming, and analyzing complex datasets to support data-driven decision-making in healthcare and financial services. Skilled in SQL, Python, R, and cloud platforms like AWS and GCP, with expertise in building ETL pipelines, developing predictive models, and creating interactive dashboards using Power BI and Tableau. Strong background in data warehousing, statistical analysis, and cross-functional collaboration to improve business intelligence and operational efficiency.
Overview
5
5
years of professional experience
1
1
Certification
Work History
Data Analyst
E&Y (Ernst & Young)
TX
06.2024 - Current
Extracted and prepared over 20 million financial transactions using SQL in BigQuery, consolidating customer activity, device metadata, and merchant profiles to create a unified dataset for fraud pattern analysis.
Designed and automated ETL workflows on Google Cloud Platform using Cloud Composer (Apache Airflow) and Dataflow, supporting daily ingestion and transformation of real-time fraud signals, and reducing manual intervention by over 90%.
Created custom fraud indicators such as location variance, transaction velocity, device ID frequency, and account behavior metrics using Python (pandas, NumPy) to enhance model features and improve signal-to-noise ratio in training data.
Trained and optimized classification models using scikit-learn (Random Forest, Logistic Regression) to score transactions by fraud risk; achieved an AUC of 0.91 and improved fraud detection precision by 38% over the legacy rules-based system.
Developed interactive dashboards in Power BI to visualize anomaly scores, regional fraud trends, and high-risk accounts; streamlined risk team workflows by enabling immediate access to daily alerts and priority scoring.
Integrated model outputs into the client's fraud case management system and collaborated with compliance and audit teams to define actionable thresholds; insights contributed to a $1.2M reduction in fraud losses over two quarters and improved investigative efficiency.
Data Analyst
Softage Group
India
07.2020 - 07.2023
Engineered a centralized data integration framework for over 3.2 million patient records from multiple hospital systems (Cerner, Meditech, in-house HIS) using SQL Server Integration Services (SSIS) and Python (pandas, pyodbc), improving data accessibility across departments.
Automated data ingestion and transformation pipelines with SSIS and Python, handling daily batch processing of clinical datasets, including demographics, lab results, diagnoses, and vitals, reducing ETL runtime by 40% and maintaining a 99.9% data load success rate.
Developed over 30 custom validation rules in SQL and Python to identify and resolve data quality issues such as duplicates, missing values, and format inconsistencies. This improved the accuracy of patient records and reduced manual cleanup efforts by 50%.
Standardized incoming data into HL7 and FHIR formats in collaboration with IT and clinical informatics teams, enabling seamless integration across platforms and ensuring compliance with HIPAA and internal governance standards.
Performed statistical and visual analysis using Python (pandas, seaborn, matplotlib) to evaluate patient outcomes and treatment efficiency, supporting the implementation of targeted care plans that led to a 17% decrease in 30-day readmissions.
Built executive-level dashboards in Power BI to monitor trends in infection rates, length of stay, and readmission by department and condition. These dashboards enabled real-time operational insights across 12 hospitals.
Automated weekly reporting workflows with Python and Excel VBA, reducing turnaround time by 70% and eliminating manual errors in recurring performance summaries sent to department heads and leadership.
Documented data architecture, lineage, and transformation logic using Confluence and SharePoint, facilitating easier onboarding for new team members and ensuring audit-readiness for compliance reviews.
Education
Master of Science - Business Analytics
East Texas A&M University
Bachelor of Business Administration - Information Technology
Parul University
Skills
SDLC
Agile
Waterfall
Kanban
Lean Six Sigma
Python
SQL
R
Visual Studio Code
PyCharm
Jupyter Notebook
NumPy
Pandas
Matplotlib
SciPy
Ggplot2
TensorFlow
Seaborn
Scikit-learn
Tableau
Power BI
Advanced Excel
Amazon Web Services (AWS)
Google Cloud Platform (GCP)
MySQL
SQL Server
PostgreSQL
Oracle
Ad hoc Report
EDA
ETL Tools
Informatica Power Center
Machine Learning Algorithms
Deep Learning
NLP
Big Data Technologies
Spark
Probability distributions
Predictive Modelling
Hypothesis Testing
Regression Analysis
Linear Algebra
Advance Analytics
SAS
SSIS
SSRS
SSMS
Data Mining
Data warehousing
Data transformation
Clustering
Classification
Regression
A/B Testing
Forecasting & Modelling
Data Cleaning
Data Wrangling
Jira
Confluence
GitHub
Bitbucket
Windows
Linux
Mac OS
Certification
Google Data Analytics Professional Certificate
CS50: CS50's Introduction to Computer Science, Harvard University
Work Availability
monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse
Accomplishments
“Automated the ingestion and transformation of over 20 million financial transactions using Python and GCP tools (Airflow, Dataflow), reducing manual intervention by 90%.”
“Built predictive fraud detection models using Random Forest and Logistic Regression (AUC = 0.91), improving detection precision by 38%.”
“Created custom fraud indicators like transaction velocity and device frequency, improving model feature quality and business insight.”
“Standardized 3.2M+ patient records into HL7 and FHIR formats using SSIS and Python, ensuring HIPAA compliance and interoperability.”
“Reduced ETL runtime by 40% by automating daily batch processing of clinical datasets with SSIS and Python.”
“Developed Power BI dashboards to visualize fraud trends and patient outcomes, enhancing decision-making across finance and healthcare departments.”
“Collaborated with cross-functional teams to implement data quality rules in SQL and Python, cutting manual cleanup time by 50%.”
“Reduced 30-day hospital readmissions by 17% through analytical support of targeted care plan strategies.”
“Improved weekly reporting efficiency by 70% using Python automation and Excel VBA scripting.”
“Integrated model outputs with fraud case management systems, contributing to a $1.2M reduction in fraud losses over two quarters.”
Languages
English
Native or Bilingual
Timeline
Data Analyst
E&Y (Ernst & Young)
06.2024 - Current
Data Analyst
Softage Group
07.2020 - 07.2023
Master of Science - Business Analytics
East Texas A&M University
Bachelor of Business Administration - Information Technology