Seasoned Data Architect adept at understanding mandates, developing plans, and implementing enterprise-wide solutions. Complex problem-solver with an innovative approach. Ready to bring several years of progressive experience and take on a challenging new role with growth potential.
Overview
5
5
years of professional experience
1
1
Certification
Work History
Data Engineer
Optum
, MN
02.2024 - Current
Architect and maintain scalable, real-time, and batch data pipelines using Azure Data Factory, Kafka, and Databricks. Implement ETL/ELT workflows to ingest, transform, and store data across ADLS, Snowflake, and SQL Server, ensuring reliability and scalability.
Develop and optimize Medallion Architecture (Bronze, Silver, Gold) using Databricks and Delta Tables. Leverage Kafka and Databricks Structured Streaming for real-time data processing, with low-latency transformations and analytics.
Ensure data governance, security, and compliance with Unity Catalog for access control and lineage tracking. Manage Azure Key Vault for credential storage, encryption, and secure access.
Optimize Snowflake data modeling, query performance, and storage. Fine-tune Spark jobs and Delta Lake performance in Databricks for efficient, large-scale data processing.
Automate deployments with CI/CD pipelines using GitHub Actions and Azure DevOps. Collaborate with cross-functional teams to design scalable data solutions, document processes, and support data-driven decision-making.
Data Engineer
Datapro Information Technology
Pune, Maharashta
08.2020 - 11.2022
Developed and managed ADF pipelines for ingesting data from Oracle and ADLS, implementing incremental loading, scheduling automation, and performance optimizations.
Designed and optimized PySpark/Scala scripts in Databricks for data cleansing, transformation, and performance tuning, improving processing efficiency and scalability.
Built and maintained fact and dimension tables in Azure Synapse, optimizing queries with indexing, partitioning, and materialized views for enhanced analytical performance.
Implemented Azure Key Vault and Managed Identities for secure credential storage and access control, while automating deployments through Azure DevOps CI/CD pipelines.
Monitored and troubleshot pipeline failures, job performance, and query workloads, ensuring data integrity, governance, and seamless collaboration with BI teams and stakeholders.
Education
Master of Science - Data Analytics
Indiana Wesleyan University
Marion, IN
08-2024
Skills
Programming languages: Python, SQL, Scala
Cloud Technologies: Azure Databricks, Azure Data Factory, Azure Data Lake Storage, Key Vault
Big Data Tools: Apache Kafka, Apache Spark, Delta Lake
Databases and Warehouses: Snowflake, SQL Server
Packages and frameworks: PySpark, Delta tables
Productivity and DevOps: Git, GitHub, Azure DevOps, CI/CD Pipelines
Certification
Data Engineering on Microsoft Azure from Microsoft - DP 203