
Senior Software Engineer with 10+ years of experience designing scalable data platforms, data warehouses, and ELT pipelines across AWS, Azure, and GCP for enterprise analytics and regulatory reporting. Strong expertise in Python, SQL, and Spark, building high-performance batch and real-time data pipelines using Snowflake, BigQuery, and Synapse Analytics. Extensive experience in developing and optimizing data warehouses using Snowflake, Redshift, BigQuery, and Azure Synapse, enabling high-volume analytical workloads and reporting. Proficient in designing dimensional data models (star/snowflake schemas) and Data Vault architectures, delivering analytics-ready datasets for business intelligence and compliance use cases. Hands-on experience building scalable ETL/ELT pipelines using AWS Glue, Azure Data Factory, Databricks, and Apache Airflow with strong focus on performance and reliability. Developed real-time streaming pipelines using Kafka, Flink, Pub/Sub, and Kinesis, supporting fraud detection, IoT analytics, and event-driven architectures. Strong programming expertise in Python and PySpark, implementing modular, reusable, and production-grade data engineering solutions with robust error handling and monitoring. Designed and implemented cloud-native data lakes and lakehouse architectures using S3, ADLS Gen2, and GCS, enabling data ingestion, governance, and analytics. Experience with CI/CD and DevOps practices using Jenkins, Azure DevOps, GitHub Actions, Terraform, Docker, and Kubernetes for automated deployments and infrastructure provisioning. Implemented data quality frameworks using Great Expectations and custom validation techniques, ensuring data accuracy, completeness, and consistency across pipelines. Strong experience in metadata management, lineage, and governance using Apache Atlas, Amundsen, AWS Glue Catalog, and cloud-native cataloging solutions. Ensured data security and compliance using IAM, RBAC, encryption, and governance frameworks aligned with GDPR, HIPAA, and financial regulatory standards. Delivered BI and reporting solutions using Power BI, Tableau, and Looker, enabling business users with real-time insights and operational dashboards. Collaborated with business, analytics, and platform teams to translate requirements into scalable data solutions aligned with enterprise architecture standards. Supported machine learning and advanced analytics initiatives by delivering curated, feature-ready datasets and optimized data pipelines. Experienced in building fault-tolerant and highly available data systems with strong monitoring, alerting, and SLA management across distributed environments.