Data-Driven Product Design, Volleyball, Traveling, Hiking
Results-driven Data Scientist with 6 years of experience in building scalable machine learning solutions across insurance, healthcare, finance, and consumer industries. Proven expertise in predictive modeling, real-time data pipelines, and cloud-based ML platforms like Azure and AWS. Skilled in deploying end-to-end solutions using Python, Spark, and Kafka, and translating insights into impactful business outcomes through dynamic dashboards and stakeholder collaboration. Adept at solving complex problems with a blend of data engineering, machine learning, and domain knowledge.
Machine Learning: Logistic Regression, Random Forest, SVM, GBT, XGBoost, K-means, Hierarchical Clustering, NLP, Fraud Detection, Churn Prediction, AUC, ROC, K-fold
Data Engineering: Spark, Kafka, ETL, CDC, Real-time Processing, CI/CD (Jenkins, Git)
Cloud & Deployment: Azure ML, Databricks, AWS Lambda, Docker, Kubernetes
Visualization: Power BI, Tableau, Tableau Server, Excel
Programming: Python, SQL, Bash
Tools & Platforms: Git, Jenkins, Azure AutoML, EHR Systems
Data-Driven Product Design, Volleyball, Traveling, Hiking