
Results-driven Senior Big Data Engineer with 10+ years of experience designing scalable, cloud-native data architectures across AWS, GCP, and Azure. Expert in ETL/ELT development, data warehousing, distributed computing, and pipeline orchestration. Proven ability to lead data initiatives, mentor engineers, and collaborate with stakeholders to drive business decisions. Strong expertise in Snowflake, Databricks, Apache Spark, and Airflow. Passionate about performance optimization, cost efficiency, automation, and integrating AI/ML capabilities for data model enhancements and validation frameworks.
Cloud & Data Warehousing: Snowflake, Databricks, Redshift, BigQuery, Hive
ETL/ELT & Orchestration: Apache Airflow, Fivetran, Glue, Dataflow, ADF
Big Data Processing: Apache Spark, SparkSQL, PySpark, Kafka Streaming, Flink
Programming & Automation: Python (Pandas, NumPy, Scikit-learn), SQL, Shell Scripting
AI & Data Science Exposure: ML Model Integration, AI-driven Data Quality Validation, Feature Engineering
DevOps & CI/CD: Jenkins, Terraform, Azure DevOps, GitHub Actions, Kubernetes, Docker
Security & Governance: Snowflake RBAC, AWS Lake Formation, Data Compliance, GDPR/CCPA Standards
Reporting & Visualization: Tableau, Power BI, Looker
✅ Leadership & Mentorship
✅ Stakeholder Collaboration
✅ Data Strategy & Architecture
✅ Analytical Problem-Solving
✅ Communication & Documentation
✅ AI & ML Data Strategy