Experienced Data Analyst with a strong background in Big Data, Machine Learning, NLP, and Data Engineering. Proficient in SQL, Python, and R, with expertise in data processing, ETL pipelines, and database management. Skilled in building scalable data solutions, automating workflows, and creating data visualizations to support business insights. Adept at problem-solving, optimizing performance, and improving decision-making through data-driven strategies.
Programming: Python, R, SQL, Java, C
Databases: MySQL, PostgreSQL, MongoDB, Oracle 12c
Big Data & Cloud: AWS (S3, Glue, Redshift), Azure, GCP (BigQuery, Dataproc)
ETL & Data Pipelines: Apache Airflow, AWS Glue, Databricks
Machine Learning & NLP: Scikit-learn, TensorFlow, Spark MLlib, BERT, LightGBM, XGBoost
Data Mining & Business Analytics: RapidMiner, Excel (Analytic Solver), PCA, Clustering, Association Rule Mining
Visualization & BI Tools: Tableau, Power BI, Matplotlib, Seaborn, Plotly
Other Tools: Git, Hadoop, Kafka, Google Colab