Career Object
Distributed Big Data Processing for E-Commerce, Built a distributed data processing system for large-scale e-commerce analytics., Ingested and preprocessed retail data in HDFS/cloud, enabling parallel processing., Performed EDA and applied KMeans clustering with Spark MLlib to uncover transaction patterns., Deployed Spark in a cloud-based cluster, optimizing performance and resource use., Demonstrated Spark’s scalability and efficiency in real-world big data scenarios., Machine Learning, Microsoft Excel, Python (Pandas, NumPy, Matplotlib, Scikit-learn), SQL (MySQL, PostgreSQL), Tableau, Power BI, Jupyter Notebooks, AWS (S3 for data storage)
MySQL, PostgreSQL, Microsoft SQL Server, Power BI, Tableau, Python (Pandas, NumPy, Matplotlib, Seaborn), Advanced Microsoft Excel, Word, PowerPoint, Basic knowledge of SAP BusinessObjects, Jupyter, Git, GitHub, Experienced in client coordination, requirement gathering, and cross-functional teamwork to meet project deliverables.