I bring over 12+ years of experience as a Senior Data Engineer specializing in Big Data engineering and cloud solutions. I have expertise in developing and optimizing ETL processes using Spark and leveraging various AWS services and with a background in production deployment and support. Moreover, I have coordinated with multiple teams and handled the scrum board for Jira stories, ensuring effective collaboration and task management.
I am well-versed working with team(s) to address intricate data challenges and pioneering innovative data pipeline architecture approaches. Additionally, my certification as an associate developer for Apache Spark 3.0 attests to my expertise in this area. I have extensive experience in writing and optimizing Spark SQL, leveraging it to efficiently query and process large-scale datasets, contributing to efficient data processing and analysis.
Furthermore, I have experience and expertise in architecting, configuring, scheduling, and monitoring pipelines using Apache Airflow. This includes developing intricate Directed Acyclic Graphs (DAGs) that orchestrate and automate workflow processes, ensuring efficient and reliable data processing and delivery.