Data Engineer with 2 years of experience in ETL development using PySpark and Snowflake, specializing in data transformation and workflow orchestration with Apache Airflow. Proficient in Python and SQL, with a solid foundation in Big Data technologies and cloud platforms. Passionate about designing scalable data solutions and optimizing pipelines to enhance business insights and decision-making.
Overview
2
2
years of professional experience
Work History
Data Engineer
Cognizant Technology Solutions, CTS
Hyderabad, Telangana
05.2021 - 07.2023
Developed, maintained, and optimized ETL workflows and data pipelines.
Assisted in orchestration of ETL workflows using Apache Airflow.
Provided technical support and enhancements for data transformation processes.
Participated in code reviews to improve code quality and performance.
Debugged and optimized SQL queries for performance improvements.
Designed and implemented automated monitoring systems for data pipelines to ensure data integrity.
Developed documentation and best practices for data ingestion and transformation processes.
Collaborated with cross-functional teams to define and implement data governance policies.
Implemented data security measures and ensured compliance with company standards.
Analyzed system performance and proposed optimizations that led to a 20% reduction in data processing time.
Assisted in data migration and integration processes from on-premise to cloud.
Developed unit tests and implemented data quality checks.
Documented data flow processes and created workflow diagrams.
Supported the maintenance of an internal application with unit testing and bug fixes.
Contributed to automation efforts, reducing the manual workload by 30%.
Designed and implemented reusable data processing scripts, improving efficiency and accuracy.
Education
Master of Science - Computer Science
University of Central Missouri
Warrensburg, MO
12-2024
Skills
Programming Languages: Python, SQL
Data Engineering Tools: PySpark, Snowflake, Apache Airflow