Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Yougender Y

Denton,USA

Summary

Experienced Data Engineer with expertise in building scalable data pipelines, proficient in cloud platforms (Azure, AWS, GCP), big data processing frameworks, and robust ETL solutions. Skilled in using Python, PySpark, SQL, and Apache Spark to automate data extraction, transformation, and loading processes, enhancing data quality, integrity, and accessibility. Demonstrated success in designing and implementing cloud-based solutions using Azure Data Factory, Azure Databricks, Synapse Analytics, and Apache Airflow, optimizing efficiency and performance of large-scale data workflows. Proficient in data modeling, integrating structured and unstructured data sources, and performing advanced analytics and visualization with tools such as Power BI, Tableau, and Jupyter Notebooks. Strong expertise in CI/CD pipelines, automation, and infrastructure as code using Azure DevOps, Docker, Kubernetes, Terraform, and CloudFormation, significantly reducing deployment time and enhancing system reliability. Proven ability in applying machine learning and deep learning techniques, leveraging cloud-based ML services for predictive modeling and analytics to facilitate strategic business decisions. Excellent communicator with a proven track record in agile environments, collaborating across teams to deliver innovative data-driven solutions, achieving measurable improvements in business outcomes.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Data Engineer

International Suppliers Network
Dallas, Texas
07.2024 - Current
  • Engineered big data pipelines using Python, Azure Synapse Analytics, and Azure Data Factory, effectively migrating workloads to Azure with streamlined processes.
  • Implemented robust ETL processes with Apache Airflow, automating data extraction and transformation into an Azure Data Lake, achieving a 15% increase in pipeline efficiency.
  • Developed and optimized data models and target mappings leveraging DBT, enhancing data query performance and decision-making effectiveness by approximately 20%.
  • Migrated data workflows from Google Analytics 3 to Azure Synapse Analytics, executing SQL operations on Google Cloud Platform (GCP), significantly reducing costs and improving overall system efficiency.
  • Automated CI/CD pipelines with Azure DevOps, facilitating smoother deployments and consistent delivery cycles.
  • Built interactive web interfaces using HTML, CSS, and TypeScript, enhancing user experience and functionality.
  • Deployed and managed Open Metadata Platform, resulting in a notable customer satisfaction rate of 98%.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Streamlined complex workflows by breaking them down into manageable components for easier implementation and maintenance.
  • Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
  • Collaborated with cross-functional teams for seamless integration of data sources into the company's data ecosystem.

Data Engineer

Cognizant
Hyderabad, India
08.2021 - 07.2023
  • I designed and delivered data pipelines to make sure data storage is accessible, high-performing, secure, and scalable.
  • I developed prototypes, test scripts, and conducted tests for data replication, extraction, loading, and cleansing.
  • I built sustainable data models optimized for performance in data warehouses. I ensured data integrity and security through robust pipeline design and validation.
  • I collaborated with teams to streamline and optimize data pipeline workflows for long-term scalability.
  • Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.
  • Automated routine tasks using Python scripts, increasing team productivity and reducing manual errors.
  • Collaborated with system architects, design analysts and others to understand business and industry requirements.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Developed database architectural strategies at modeling, design, and implementation stages to address business or industry requirements.
  • Designed data models for complex analysis needs.
  • Adaptable individual with exceptional interpersonal skills and talent for building relationships. Known for delivering outstanding service and enhancing client satisfaction. Focused on fostering positive interactions and creating collaborative environment.

Education

Master of Science - Advanced Data Analytics

University of North Texas
Denton, TX
12-2024

Skills

  • C
  • C
  • Java
  • Python
  • Apace technologies
  • ETL
  • HTML/CSS
  • Unix
  • Linux
  • Salesforce
  • Springboot
  • GIT
  • MySQL
  • MangoDB
  • Snowflake
  • Tableau
  • Hadoop
  • Hive
  • Spark
  • GCP
  • Microsoft Office
  • Excel
  • Jupyter notebook
  • Google colab
  • Airflow
  • Airbyte
  • DBT
  • Spring tools
  • JDK
  • Eclipse
  • ETL development
  • Data warehousing
  • Data modeling
  • Data pipeline design
  • Big data processing
  • Spark framework
  • Scripting languages
  • SQL expertise
  • Machine learning
  • Real-time analytics
  • API development
  • Data quality assurance
  • Data pipeline control
  • SQL and databases
  • SQL programming
  • Data analysis
  • Backup and recovery
  • Big data technologies

Certification

  • AICTE (cyber security virtual internship)
  • AICTE (cloud computing virtual internship)
  • Data camp certifications: Intermediate python, Introduction to python, EDA with python, Introduction to functions in python, Visualizing time series data in python

Timeline

Data Engineer

International Suppliers Network
07.2024 - Current

Data Engineer

Cognizant
08.2021 - 07.2023

Master of Science - Advanced Data Analytics

University of North Texas
Yougender Y