Predictive Maintenance Project (for a Concrete Ready-Mix Producer/Deliver): Currently leading an ongoing Predictive Maintenance Project for electrical equipment at a rock plant. Utilizing machine learning to predict motor failures, optimize performance, and reduce downtime. Analyzing sensor data (vibration, temperature, voltage, and amperage) and employing statistical modeling to establish normal motor benchmarks, resulting in cost savings and extended motor lifespan.
Optimization Algorithm Design, Implementation, and Deployment (for a Concrete Ready-Mix Producer/Deliver): Conceptualized and coded a multi-step optimization algorithm to enhance truck routing efficiency. This algorithm, developed using techniques from linear programming, convex optimization, and combinatorial optimization, effectively manages various constraints, including truck availability, material availability at different batch plants, and diverse order requirements. The deployed solution significantly reduces both total travel time and discrepancies from scheduled load times, thereby boosting operational efficiency and ensuring improved customer satisfaction
Environment: Python, cvxpy, scipy.optimize, Scikit-learn
Document Image Analysis project (for InterFirst Mortgage)
Environment: Python, OpenCV, Tensorflow, Keras
Stock trading strategy development (for InterFirst Mortgage)
Environment: Python, Scikit-learn, XGBoost
Environment: Python, AWS, SageMaker, Scikit-learn, Periscope Data, Redshift, Postgres SQL, MySql
Clients: PepsiCo, Bridgestone Corporation, Grupo Éxito, Guitar Center
Incorporating cannibalization sub-model into company's signature product increased overall retail demand prediction accuracy by 0.5-3%
Environment: SAS. SAS Macro Language, SAS HPF, R, Linux, Windows
Environment: C, C++, C# .NET, SSIS, T-SQL, Windows, Microsoft Windows Server, Linux, Unix, OpenVMS
Obtained speedups: ×20-30 times
Environment: NVIDIA TESLA 2050, CUDA framework, Cuda C
Obtained speedups: ×0.8×n times on n-core workstation
×10-20 times for complex double arithmetic,
×25-35 times for complex double-double arithmetic
Environment: multithreading: within Linux using C++, pthreads; GPGPU: on NVIDIA TESLA 2050 using CUDA framework/CUDA C
It improved elevation prediction accuracy from under 50% to 90%
Environment: R, R-studio
Environment: C/C++, Linux
Skilled, creative, and devoted data scientist with a PhD degree in Mathematics. Specializing in designing and implementing custom Machine Learning pipelines, while employing the most advanced statistical models and optimization algorithms, or developing new mathematical or statistical apparatus as necessary. Will help your business derive meaningful insights and optimized operational recommendations from data in hand. Will deploy, maintain, and do necessary improvements of the developed solution on your server or in the cloud, ensuring seamless integration and continued performance enhancement. Additional market data developer and parallel scientific computing experience.
eCornell Machine Learning Certificate
SAS Base Programmer