Experienced Data Engineer with over 4 years of expertise in designing and implementing scalable data pipelines and cloud-based data solutions. Demonstrated success across leading organizations including Caterpillar, Autodesk, and APSPDCL. Specialized in Python, AWS, Azure, and modern tools like Airflow, Snowflake, and Databricks. Strong in orchestration, automation, data modeling, and cross-functional collaboration to deliver high-quality, enterprise-grade data platforms.
At Caterpillar, I worked as part of an enterprise-level data engineering team responsible for building and maintaining dealer and customer data pipelines to support analytics and operational systems. My role involved:
At Autodesk, I contributed to the development of scalable, cloud-native data engineering solutions to support enterprise analytics and machine learning initiatives. My responsibilities included:
At APSPDCL, I was part of the data engineering team responsible for managing and modernizing the utility's data processing and reporting infrastructure. My role focused on delivering reliable and scalable data solutions to support operational, billing, and analytics systems. Key responsibilities included:
Languages: Python, Java, SQL, ABAP, SAP SQLScript
Cloud Platforms: AWS (S3, Lambda, Glue, EC2, RDS, Redshift), Azure (ADF, Functions, Blob Storage)
ETL & Orchestration: Apache Airflow, AWS Glue, Step Functions, SAP HANA SDI
Data Warehousing: Snowflake, Redshift, Azure Synapse
Big Data & Streaming: Databricks, PySpark, Apache Spark, Kinesis, HDFS, Spark Streaming
Databases: PostgreSQL, Oracle, Snowflake, SAP HANA, RDS
DevOps & CI/CD: GitHub, Azure DevOps, Jenkins, GitHub Copilot
Visualization Tools: Power BI, Tableau, SAP Analytics Cloud
Other Tools: JIRA, Terraform, REST APIs, JSON, XML, Linux