Summary
Overview
Work History
Education
Skills
Personal Information
Timeline
Generic

Supriya M

Cedar Rapids,USA

Summary

Data Engineer with expertise in Cloud Computing, Data Modeling, and Database Management. Proficient in Python, SQL, and optimizing ETL processes using Microsoft Azure Data Factory (ADF) and Azure Synapse. Experienced in developing scalable data pipelines, implementing machine learning models, and driving data-driven business decisions. Skilled in Azure Data Lake Storage (ADLS), Azure Functions, and Azure Databricks to support complex data workflows in cloud environments.

Overview

4
4
years of professional experience

Work History

Data Engineer

United Tek info Hub LLC
08.2023 - Current
  • Company Overview: Transamerica, Cedar Rapids, IA
  • Designed and implemented scalable ETL pipelines using Azure Data Factory and Azure Synapse, improving data load performance by 20%
  • Automated data ingestion with Azure Functions and Azure Logic Apps, enhancing data processing efficiency by reducing manual steps
  • Built PowerBI dashboards with optimized SQL for real time customer insights
  • Established robust data governance using Azure Purview, enabling automated lineage tracking and metadata management for SOC2 and HIPAA compliance across enterprise datasets
  • Built a Python-based predictive model to classify high-risk defaulters, increasing debt recovery rates by 15%
  • Collaborated closely with cross-functional teams to refine data requirements and ensure alignment with business objectives, reducing redundant processing steps by 30%
  • Conducted data validation and implemented data quality checks, leading to a 25% reduction in data errors across workflows
  • Deployed CI/CD pipelines with Azure DevOps to streamline ETL workflows, enhancing system scalability and reducing deployment time by 40%
  • Created detailed documentation for ETL processes, data models, and dashboards, which improved team efficiency by ensuring consistent knowledge sharing
  • Integrated real-time data streams using Azure Stream Analytics and Apache Kafka, providing timely insights for critical decision-making processes
  • Utilized big data processing tools like Apache Spark, Scala, and Airflow to process and analyze large-scale datasets efficiently
  • Integrated Snowflake into the data architecture to enhance query performance, streamline data warehousing, and support seamless analytics across Azure-based ETL workflows
  • Assisted in architecting cloud-native solutions on Azure for processing high-volume data using Databricks and Spark
  • Transamerica, Cedar Rapids, IA

Data Analyst/Engineer

United Tekinfo Hub
07.2021 - 10.2022
  • Company Overview: India
  • Gathered and cleaned datasets from multiple sources using Excel, SQL, and Python (Pandas), achieving a 95% data readiness accuracy for analysis
  • Designed and implemented efficient data workflows and ETL pipelines to streamline data extraction, transformation, and loading processes
  • Conducted exploratory data analysis (EDA) with Python (Matplotlib, Seaborn) to uncover trends and patterns, providing actionable business insights
  • Developed interactive and dynamic data models in Power BI to facilitate intuitive reporting and data exploration for end-users
  • Collaborated with client-facing teams to identify key performance indicators (KPIs), develop actionable metrics, and implement automated reports to measure success across product lines
  • Optimized Power BI data models for performance, ensuring fast queries and responsive visualizations for large datasets
  • Designed tailored data visualizations and reports using Power BI and Excel to support unique project requirements
  • Conducted advanced data analysis using Python, R, and SQL to uncover insights and inform strategic decisions
  • Designed and delivered intuitive dashboards using Tableau, Looker, and Power BI to present complex data insights to diverse stakeholders effectively
  • Used SQL extensively for querying large datasets, optimizing joins, and preparing data repositories for reporting and visualization layers
  • Extracted, transformed, and analyzed data from sources such as SQL, BigQuery, Hive, and Presto, ensuring relevance to specific business challenges
  • Designed and deployed ETL workflows using AWS Glue and Amazon S3 for structured data ingestion, supporting batch and real-time use cases alongside Azure services
  • India

Education

Master’s - Information Technology Management

Webster University
05.2024

Skills

  • Python
  • SQL
  • Scala
  • Java
  • Unix/Shell Scripting
  • Azure
  • AWS
  • Apache Spark
  • PySpark
  • Apache Kafka
  • Hadoop
  • Azure Stream Analytics
  • Apache Airflow
  • Scikit-learn
  • Pandas
  • NumPy
  • Predictive Modeling
  • Basic ML Algorithms
  • Power BI
  • Tableau
  • Looker
  • SSRS
  • SSAS
  • Excel
  • Azure Data Factory
  • Informatica
  • Azure SQL Database
  • Snowflake
  • NoSQL databases
  • Oracle
  • DB2
  • Data Validation
  • Cleansing
  • Azure Purview
  • Data Lineage
  • Governance Frameworks
  • SQL Scripting
  • Azure DevOps CI/CD
  • Git
  • Jenkins
  • Docker

Personal Information

Title: Data Engineer

Timeline

Data Engineer

United Tek info Hub LLC
08.2023 - Current

Data Analyst/Engineer

United Tekinfo Hub
07.2021 - 10.2022

Master’s - Information Technology Management

Webster University
Supriya M