Data Engineer with over 3 years of experience in designing, building, and optimizing data pipelines, ETL processes, and data warehousing solutions. Skilled in developing scalable and efficient data architectures to support business intelligence and analytics initiatives. Proficient in SQL, Python, Spark, and cloud platforms (AWS, Azure, GCP), with expertise in handling large-scale datasets and ensuring data integrity. Strong background in database management, performance tuning, and automation of data workflows. Adept at collaborating with cross-functional teams to deliver high-quality, data-driven solutions that enhance decision-making, and operational efficiency.
Programming Languages: Python, SQL
Data Engineering Tools: PySpark, Snowflake
Apache Airflow
Big Data Technologies: Databricks
Cloud Platforms: AWS, GCP, Azure (Basic)
Knowledge
Database Management: MySQL, PostgreSQL
Version Control & CI/CD: Git, JIRA
Data Quality & Testing: Unit Testing, Data
Validation
Workflow Orchestration: Apache Airflow