
Data Engineer with over 5 years of expertise in designing and optimizing data pipelines and ETL/ELT workflows across cloud and on-premises platforms. Demonstrated proficiency in Python, SQL, and big data technologies such as Spark, Hadoop, and Kafka, enabling effective real-time and batch data processing. Proven ability to convert complex datasets into actionable insights that enhance business decision-making and operational efficiency.
Environment: Azure Data Factory, Azure Databricks, Azure Data Lake, Azure Synapse Analytics, Azure SQL Database, Spark, PySpark, Python, SQL, Power BI, Azure DevOps, ETL/ELT pipelines
Environment: Google Cloud Platform (GCP), Dataflow, Dataproc, Pub/Sub, BigQuery, TensorFlow, Apache Beam, Apache Spark, Python, SQL, Google AI Platform, Cloud Build, Terraform, JavaScript
Environment: Apache Spark, Hadoop, ETL tools, SQL Server, PostgreSQL, Apache Kafka, Python, Tableau, Machine Learning (Python/Scikit-learn)
Cloud & Platforms: Azure (Data Factory, Databricks, Synapse, SQL DB, Data Lake), GCP (Dataflow, Dataproc, Pub/Sub, BigQuery, AI Platform)
Data Engineering & ETL: ETL/ELT pipelines, Data Modeling (Dimensional, Star/Snowflake), Batch & Streaming Processing, Data Governance & Quality
Big Data & Processing: Apache Spark, PySpark, Hadoop, Apache Beam, Apache Kafka
Databases & Querying: SQL Server, PostgreSQL, Spark SQL, Azure SQL Database, BigQuery
Programming & Scripting: Python, SQL, Scala, Shell, JavaScript
Machine Learning & Analytics: TensorFlow, Scikit-learn, Predictive Modeling, Data Analysis
Data Visualization & BI: Power BI, Tableau, SSIS, SSRS
DevOps & CI/CD: Azure DevOps, Cloud Build, Git, Terraform