Highly accomplished and results-driven Data Engineering Leader with over 10+ years of experience spearheading complex data initiatives across various domains. Proven expertise in architecting, developing, and maintaining peta byte scale data platforms cloud eco-systemms (AWS, Azure) and Big Data (Hadoop, Spark) environments, consistently demonstrating a progressive mindset in adopting and mastering new technologies. Adept at driving significant cost savings, enhancing data quality and governance, ensuring regulatory compliance, and levaraging cutting edge GenAI models like Anthropic claude, AWS Nova models to boost productivity.
HBA Data Lake Implementation
WFI Reporting Data Model
Data quality framework
Data Platform Modernization
IFRS 17 Data Integration Platform
Finane Data Warehouse Implementation
Enterprise Hadoop Platform Administration
Programming Languages: Python, Scala, SQL, UNIX Shell Scripting, TypeScript
Big Data Technologies: Hadoop, Spark, Hive, Pig, Sqoop, MapReduce, Flink
ETL & Data Integration: Mainframes, SSIS, Informatica (PowerCenter, BDE), Apache Airflow
Databases & Data Warehousing: SQL Server, PostgreSQL, MySQL, Oracle, Snowflake, ,IBM DB2
Streaming & Real-Time Processing: Apache Kafka, Kafka Streams, Apache Flink, Amazon Kinesis, Spark Streaming
DevOps & CI/CD: Git, Jenkins, Docker
Azure (Azure Data Factory, Synapse, Azure Databricks, Blob Storage)
AWS (EC2, S3, Glue, Redshift, DynamoDB, RDS, Athena, Kinesis, Firehose, MWAA, EMR, CDK, CloudFormation, Route53, Glue, step functions, Lambda, IAM, LakeFormation, Bedrock, Sagemakes)