A Data Analyst with 3+ years of experience in finance and healthcare sectors, Proficient in analyzing complex datasets using SQL, Python (Pandas, NumPy, Scikit-Learn), R, and SAS. Experienced in creating dashboards and visualizations with
Tableau, Power BI, and QlikView to deliver actionable insights.
Skilled in applying statistical methods and predictive modeling using tools like Python (statsmodels, TensorFlow, scikit-learn),
R, and PySpark to identify trends and patterns.
Experienced in implementing ETL processes with Talend, Informatica, and Apache Spark. Proficient in data warehousing and querying databases using MySQL, PostgreSQL, MongoDB, and Oracle.
Knowledgeable in cloud platforms such as AWS (SageMaker, EMR, Redshift, and KMS) for data processing and security. Experienced in process automation using UiPath and scripting languages.
Adept at collaborating with cross-functional teams using JIRA, Confluence, and Agile methodologies. Ensures data compliance with healthcare regulations (HIPAA) and maintains version control with Git.
Research & Data Analysis
• Led the development of a data-driven inventory management system, optimizing stock levels across multiple distribution centers. Utilized SQL to extract, transform, and load (ETL) data from enterprise systems to reduce the stock discrepancies.
• Preprocessed and integrated structured and unstructured data from systems such as SAP, ensuring high data quality. Automated Excel-based reports using VBA for reducing manual effort.
• Conducted data audits using SQL queries to identify inconsistencies and discrepancies in datasets, improving data accuracy for reporting and analysis.
Data Science and Predictive Modeling
• Built predictive models in Python using scikit-learn and statsmodels to forecast transportation delays and optimize inventory planning, reducing late deliveries by 15%.
• Leveraged pandas, NumPy, and Matplotlib for exploratory data analysis (EDA) and trend visualization, identifying inefficiencies in supply chain operations.
Data Visualization / Reporting
• Designed and implemented interactive dashboards in Tableau to monitor key metrics for the EU MDR Program, providing real-time visibility into regulatory compliance and operational performance.
• Collaborated with stakeholders to gather reporting requirements and ensure dashboards met business needs, enhancing data-driven decision-making.
• Optimized logistics and customer service data analysis by integrating SQL-based queries into Tableau reports, streamlining data retrieval and visualization..
Data Management and Databases: SQL, NoSQL (MongoDB, Cassandra), ETL Processes
Programming Languages: Python (Pandas, NumPy, Matplotlib, Seaborn), R, SAS
Data Visualization: Tableau, Power BI, Excel (Advanced, VBA)
Data Warehousing Amazon Redshift, Snowflake, Google BigQuery
Statistical Analysis & Machine Learning: Statistical Software (SAS, SPSS), Machine Learning (TensorFlow, Scikit-learn), Data Mining
Cloud Platforms: AWS, Azure (Certified)
Data Processing and ETL Tools: ETL Tools (Talend, Informatica, SSIS), Apache Airflow
Version Control: Git/GitHub
Time Series Forecasting for Bitcoin (crypto currency) in TensorFlow
• Developed deep learning models using RNN and LSTMs models with TensorFlow to capture temporal correlations in historical data, successfully implemented and maintained Bit-Predict in a closed system.
• Performance measures such as MAE and MSE were used to evaluate and generalize predicting skills for previously unknown data.
Ship Detection in Satellite images using Neural Network
• Collaborated in a team of three to categorize 4000 RGB pictures from planet satellite photos using ANN and CNN models using python libraries and technologies.
• Achieved 99% accuracy in ship classification using the CNN model, displaying skills in dataset navigation, pixel value extraction, and deep learning approaches for computer vision and analysis.