Data Engineer with 4+ years of experience in designing and implementing scalable ETL/ELT pipelines, big data processing, and cloud-based data platforms. Proficient in Python, PySpark, and SQL for data transformation and automation. Skilled in AWS services (S3, Glue, EMR, Redshift, Lambda, Athena, CloudWatch) with strong expertise in Airflow and Step Functions for orchestration. Adept at data modeling, data lakes, and warehousing, with proven ability to deliver cost-effective, reliable, and business-focused data solutions. Collaborative team player experienced in Agile development, CI/CD pipelines, and Git-based workflows.
Environment: Python, PySpark, SQL, AWS (S3, Glue, EMR, Redshift, Lambda, Athena, CloudWatch), Airflow, Git, Docker, Terraform, Agile/Scrum.
Environment: Python, SQL, PySpark, AWS (S3, Glue, Redshift, CloudWatch), SQL Server, PostgreSQL, Airflow, Git, Jenkins, Agile/Scrum