Results-driven graduate with an MS in Machine Learning and over 3 years of experience as a Data Engineer and Data Analyst. Demonstrated expertise in designing, developing, and optimizing data pipelines and warehouses, as well as conducting in-depth data analysis to solve complex business challenges. Proficient in utilizing Apache Spark, Airflow, AWS, SQL, Python, and Tableau to deliver scalable solutions and drive actionable insights. Committed to meeting established timelines and consistently delivering high-quality results.
Data Analysis, Reporting & Machine Learning
Data Pipelines & Automation
SQL Optimization & Data Preparation
Collaboration & Stakeholder Communication
Course Assistance & Student Mentorship
Machine Learning Research & Data Analysis
ETL Development & Data Pipeline Optimization
Automation & Workflow Management
Data Modeling & Machine Learning
CI/CD, Testing & Deployment
Collaboration & Stakeholder Engagement
Leadership & Mentorship
Data Governance & Security
Languages & ML Frameworks – Python, R, Java, Scala, Pandas, NumPy, Matplotlib, Seaborn, NLTK, Scikit-learn, Keras, TensorFlow, PyTorch, SpaCy
Big Data & Data Warehousing – Apache Spark, Hadoop, Kafka, Hive, Cassandra, Airflow, Snowflake, DBT, PostgreSQL, MySQL, MongoDB, Oracle
Visualization & CI/CD – Tableau, Power BI, Arcadia Data, Git/GitHub, BitBucket, SVN, DataDog, Docker, Kubernetes, Jenkins, Jira, Linear
Cloud Services – Azure(Databricks, Data Factory, Data Lake), AWS(EMR, S3, Redshift, Glue, Lambda), GCP(Compute Engine, BigQuery, Dataflow)
Smart Glove for Sign Language - Presented at National Level Conference on Frontiers in Engineering and Technology (NSCFET) 2017