Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Yamuna B

Summary

Resourceful professional in data architecture, known for high productivity and efficient task completion. Specialize in database design, data modeling, and information management systems. Excel in problem-solving, communication, and teamwork, ensuring successful project outcomes.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Azure Data Engineer

Omni Data
Oregon, TX
08.2023 - Current
  • Optimized SQL queries, created indexes, and redesigned database schemas to improve query performance and data retrieval.
  • Streamlined data flow from various sources using ETL tools such as Talend, Informatica, and Airflow, and designed scalable data pipelines using Python and SQL.
  • Applied dimensional modeling techniques to design data warehousing solutions for efficient data retrieval.
  • Resolved complex technical issues, including reducing SSRS report runtime from 50 minutes to 3 minutes, and significantly cutting timeout issues.
  • Delivered high-impact reports and projects, contributing to a $125k increase in company billability, and recognized for exceptional customer service and problem-solving skills.
  • Designed and implemented feature engineering pipelines to extract, transform, and optimize raw data for machine learning model training.
  • Applied domain knowledge to create meaningful features, improving model accuracy and performance in predictive analytics.
  • Worked closely with data scientists and engineers to integrate engineered features into machine learning workflows for production models.
  • Automated feature engineering processes reduce manual effort and accelerate model development cycles.
  • Implemented techniques like one-hot encoding, normalization, and feature scaling to prepare data for optimal model performance.

Data Engineer

Accion Labs
Benguluru, Karnataka
07.2021 - 01.2022
  • Collaborated with application teams and product owners to design analytics solutions and successfully transitioned on-premises systems to the Azure platform.
  • Engineered queries using PySpark/SparkSQL in Azure Databricks and maintained Apache Airflow for efficient data processing and storage in Azure Blob and Data Lake.
  • Developed and implemented CDC logic for regular data processing and worked with Azure Data Factory to transform critical data into aggregated tables on Hive Cloud.
  • Involved in real-time streaming applications development using PySpark, Apache Flink, Kafka, and Hive, while also utilizing Snowflake data modeling techniques.

Education

Master of Science - Artificial Intelligence

University of North Texas
Denton, TX
05-2023

Skills

  • Big data processing
  • Performance tuning
  • Data modeling
  • Data pipeline design
  • Data migration
  • SQL expertise
  • ETL development
  • Machine Learning
  • Feature Engineering

Masters Knowledge

Deep Learning, Data Mining, Software Development for AI, NLP, Data Visualization, Statistics for Data science (Empirical Analysis)

Certification

« Microsoft Azure Fundamentals AZ900

« Microsoft Azure Fundamentals DP900

« Microsoft Azure Data Engineer DP203

« Azure Fabric DP600 (Fabric)

« Azure Fabric (DP700)

Timeline

Azure Data Engineer

Omni Data
08.2023 - Current

Data Engineer

Accion Labs
07.2021 - 01.2022

Master of Science - Artificial Intelligence

University of North Texas
Yamuna B