Summary
Work History
Education
Skills
Projects
Timeline
SoftwareDeveloper

Mohd Omar Ali Khan

Data Scientist
Chicago,IL

Summary

Results-driven and versatile data professional with graduate-level expertise in machine learning, cloud computing, backend engineering, and advanced data analysis. Proven real-world experience at Amazon and successful delivery of large-scale capstone and academic projects in healthcare systems, virtualization, cybersecurity, and cloud deployment. Adept at leveraging Python, SQL, Azure, Power BI, and full-stack technologies (Flask, React.js, PostgreSQL) to build impactful solutions. Targeting roles in Machine Learning Engineering, Data Science, Data Analysis, or Data Engineering.

Work History

Machine Learning Associate

Amazon
09.2022 - 02.2023
  • Annotated and labeled complex image/video data to train ML models with high accuracy.
  • Played a pivotal role in enhancing model quality through precise data annotation.
  • Improved inventory accuracy and supported operational efficiency through detailed data checks.
  • Collaborated remotely with teams and consistently delivered high-quality work.

Education

Master of Science - Computer and Information Technology

Elmhurst University
Elmhurst, IL
05-2025

Skills

Languages/Frameworks: Python (Pandas, NumPy, Scikit-Learn, TensorFlow, Keras), SQL, JavaScript, Flask, Reactjs, REST APIs

Projects

Capstone Project: 

CareBridge – Healthcare Appointment Management System Group Capstone Project, Elmhurst University (2025) 

Tech Stack: React.js, Flask, PostgreSQL, JWT, Docker, Azure VMs 

• Developed a full-stack web application for doctors and patients to manage appointments, availability, and communications. • Implemented secure JWT-based authentication, role-based dashboards, and dynamic availability scheduling. 

• Deployed on Microsoft Azure using Docker containers and multi-tier architecture. 

• Designed normalized relational schema with robust data validation, encryption, and API security. 

• Planned future enhancements: telemedicine, mobile apps, multilingual support, HIPAA compliance. 

Course Projects:

 Employee Data Analytics – Power BI 

• Visualized gender, salary, and ethnic diversity trends across departments and cities. 

• Delivered actionable insights to promote DEI and improve compensation strategies. Automobile Acceptability Prediction – ML Modeling 

• Achieved 99.81% accuracy using Gradient Boosted Trees. 

• Conducted ROC, precision-recall, and confusion matrix analysis. Tech Gear E-Commerce Analytics – SQL + DBMS 

• Built normalized relational schemas and implemented complex SQL queries.Azure VM & Monitoring Project 

• Deployed VMs, created blob storage, configured CPU alerts, and validated alert triggers. BingX Cryptocurrency Security Breach – Case Analysis 

• Investigated $44M theft and proposed architecture-level security defenses. Hyper-V Virtualization and Networking 

• Managed VMs and resolved networking conflicts in Azure for remote VM access. 

Current Project: 

Flash Rank – Memory-Efficient Global Ranking Engine (In Progress) 

Solo Project | Python, SVD, Count-Min Sketch, Bloom Filters, Graph Theory, CSR, Flask (planned) 

• High-speed, scalable recommendation system under 500MB memory. 

• Applies SVD for dimensionality reduction and CSR for similarity lookups. 

• Uses Count-Min Sketch for frequency estimation and Bloom Filters for exclusion. 

• Implements quantized boosting to fine-tune ranking with minimal compute cost. 

• Upcoming: Auto-tuned SVD rank and Flask/Streamlit API for real-time deployment

Timeline

Machine Learning Associate

Amazon
09.2022 - 02.2023

Master of Science - Computer and Information Technology

Elmhurst University
Mohd Omar Ali KhanData Scientist