Data Engineer with over 3+ years of experience designing, building, and maintaining data pipelines to support data integration, transformation, and reporting needs across enterprise systems.
Strong expertise in SQL and Python, with practical experience in writing complex queries, optimizing database performance, and automating data workflows.
Experienced in working with structured and semi-structured data using relational databases such as SQL Server, Snowflake, Redshift, and Oracle.
Proficient in developing scalable ETL solutions using tools like Apache Airflow, AWS Glue, Azure Data Factory, and Dataflow across AWS, Azure, and GCP environments.
Hands-on experience with real-time data streaming and processing using Kafka and Spark Streaming for time-sensitive business applications.
Proven ability to analyze and troubleshoot data pipeline issues, perform root cause analysis, and implement solutions that improve data accuracy and performance.
Background in implementing data validation and reconciliation processes to ensure consistency and reliability of data across systems.
Familiar with CI/CD pipelines, version control, and automated deployments using Jenkins, GitHub Actions, and Terraform.
Successfully collaborated on cross-functional teams, supporting multiple projects and aligning data solutions with business and technical requirements.
Committed to maintaining data security and compliance with regulatory standards including HIPAA and GDPR.