Data Scientist with 7+ years of proven experience in data science and analysis in a data scientist role, building and deploying ML models. Expertise in applying machine learning concepts and techniques related to supervised and unsupervised learning. Hands-on experience developing deep learning models. Programming expertise in Python, R, PySpark, and SQL, and the ability to work with relational databases using SQL, coupled with big data architecture and pipeline, Hadoop, and Hive. Expertise in Databricks, Tableau, and Power BI. Expertise in articulating and translating business questions, and using statistical techniques to arrive at an answer using available data, including data visualization. Strong communication skills, ability to work with multi-functional teams.
key Accomplishments
Healthcare Patient Segmentation: Segmented patients based on health conditions and demographics using K-Means clustering. Improved resource allocation for preventive care by 22% in a simulated healthcare dataset. Analyzed over 500K records from a publicly available Kaggle EHR dataset. Developed a predictive model to flag high-risk patients for chronic diseases.
Key Tools: Python, Scikit-Learn, Matplotlib.
Market Direction Predictor:
Technologies: Python, Pandas, NumPy, Scikit-Learn, Matplotlib.
Description: Developed a predictive analytics model as part of a course project aimed at forecasting market trends. Utilized statistical analysis and machine learning algorithms to analyze historical market data and predict future directions.
Achievements: Achieved model accuracy of over 85%, outperforming baseline predictions by 15%. Enhanced personal expertise in quantitative finance and machine learning applications in economic forecasting.
Programming: Python (Pandas, NumPy, Scikit-Learn, TensorFlow), SQL, R (expert)
Machine Learning: Classification, Regression, Clustering, NLP (Expert)
Big Data: Spark, Hadoop, Power BI (Expert)
Data Visualization: Matplotlib, Seaborn, Tableau, Power BI (Expert)
Tools: Jupyter Notebooks, Watson Studio, OpenAI API, statistical analysis, data mining, predictive modeling, operational efficiency, process optimization, data governance, automation solutions (expert)
Project management
Team collaboration