Experienced Data Scientist with over 5 years of expertise in utilizing data-driven and technology-focused methodologies. Skilled in effectively communicating complex insights to stakeholders and building consensus around well-founded models. Proficient in developing applications and refining models for improved accuracy and efficiency.
Programming Languages: C, C, R, Java, Python, scala
undefinedPatent : Apparatus and method for monitoring and recording disintegration times for pharmaceutical products
Link : https://patents.google.com/patent/WO2021067207A1/en
Paper : Elsevier Journal - Disintegration testing augmented by computer Vision technology
Link : https://doi.org/10.1016/j.ijpharm.2022.121668
Multi-Modal RNN Prediction for hemodialysis patients. RNN/Tensorflow/Python
◦ Authored a thesis on designing a novel fractal based multi-modal deep learning RNN using TensorFlow to analyze HRV values of patients and predict occurrences of probable emergency events, surpassing a benchmark accuracy of 88.07%.
Airlines Customer Satisfaction Analysis Azure/Python/SQL/AutoML/Spark
◦ Analyzed big data containing 500K airline passenger records using Azure HD Insight to create highly scalable ETL pipelines.
◦ Benchmarked ML algorithms like Decision Trees and Random Forest using Azure AutoML for regression and employed gradient boosting for cross validation.
Pennsylvania Health Insurance Analysis
AssociationRules/ LinearRegression/ DecisionTree/AWS
◦ Analyzed a data set containing 65k records of citizens using Linear regression, Arules & Decision tree algorithms
◦ Monitored metrics using the AWS Cloudwatch by creating dashboards to visualize the results
Research Assistant : Martin J. Whitman School University. 2019 - 2021
Student Supervisor : Sadler Dinning, Syracuse University. 2019 - 2020
Head Event Organizer : Athenaeum Cultural Society, Anna University 2017 - 2018
CTO & Treasurer : : Computer Society of MIT, Anna University 2016 - 2018