
Over 7 years of experience as a Data Engineer, specializing in designing and implementing complex data solutions. Proficient in Big Data technologies: Hadoop, Spark, HDFS, Hive, and more. Skilled in programming languages: Python, SQL, Scala, PowerShell, and JavaScript. Extensive experience with databases: MySQL, SQL Server, Oracle, Teradata, Snowflake. Strong background in data modeling and ETL processes using SSIS and DBT transformations. Managed data workflows, ensuring data quality and compliance. Developed and deployed high-performance machine learning models in production. Designed and implemented end-to-end ETL pipelines in Azure Data Factory (ADF). Leveraged DBT for storing transformed data and creating SQL models. Utilized Azure Databricks for efficient data processing and transformation. Implemented data ingestion strategies into Snowflake using Snowpipe and bulk loading. Led the migration of on-premises databases to cloud platforms, minimizing downtime. Applied Spark Streaming for real-time data processing and transformation. Integrated Power BI with Snowflake for in-depth reporting and analysis. Developed Spark applications using PySpark and Spark-SQL for data extraction and transformation. Created Directed Acyclic Graphs (DAGs) in Airflow for workflow management. Automated regular AWS tasks using Python scripts for efficient operations.
AWS Certified Developer – Associate
Data Engineer