Analytical and detail-oriented Data Analyst with strong real-time experience at Fifth Third Bank and Humana, specializing in data pipeline development, database querying, and business reporting. Proven ability to collaborate cross-functionally and support high-impact projects involving credit risk modeling and healthcare claims analysis. Adept at using tools such as SQL, Python (Pandas, NumPy), Power BI, Tableau, PySpark, Snowflake, and Databricks to drive data-driven decisions. Master’s in Data Science with deep knowledge of data mining, machine learning foundations, and modern ETL workflows. Strong communicator who delivers actionable insights through dashboards, reports, and executive presentations.
Designed scalable ETL pipelines with PySpark, Apache Airflow, and SQL Server to process millions of loan applications efficiently.
Built and validated machine learning models using Scikit-learn and Python, achieving an 11% reduction in loan delinquency rates.
Integrated Teradata, Snowflake, and AWS Redshift into Databricks for unified data analysis and comprehensive risk feature engineering.
Automated model deployment and monitoring through Jenkins and Git, ensuring compliance with regulatory audit requirements.
Developed Power BI dashboards to visualize credit risk metrics, aiding stakeholders in monitoring model performance and refining policies.
Automated claims data validation processes using SQL, Python, and Snowflake, reducing claim denial rates by 18%.
Streamlined CPT and ICD code mapping to enhance clinical documentation accuracy.
Designed Power BI and Tableau dashboards to visualize claims trends, enabling identification of compliance gaps.
Implemented ETL pipelines with Talend and Informatica for EHR data extraction from Epic and Cerner systems.
AWS Certified Data Engineer – Associate, Amazon Web Services