Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Venkatarao Kolluri

Jersey City,NJ

Summary

Senior Data Engineer with 11+ years of expertise in Azure, Snowflake, AWS, ETL and Data Warehouse as well as Quality Testing. Specialized in designing scalable data architectures, optimizing cloud-based data pipelines, and automating workflows using Airflow, DBT and had experience in Application Production Support. Expertise in AWS cloud services such as S3, IAM and Lambda. Skilled in Snowflake Data Warehouse, including Snow pipe, Streams, Time Travel, Cloning and Managed large-scale data processing. Experience in ETL Development and Data Integration across AWS and Snowflake. Built robust data transformation workflows. Designed and developed ADF pipelines to orchestrate data ingestion, transformation, and movement over ADLS for HDInsight and Synapse Analytics. Integrated ADF with Azure Databricks, Azure Data Lake, Synapse Analytics, Snowflake, Blob Storage and SQL Databases for seamless data flow. Enabled incremental loads and delta processing using watermark and control tables to optimize performance. Configured Linked Services, Datasets and Triggers to automate and schedule ETL processes efficiently. Deep understanding of modern data formats including Delta Lake and Apache Iceberg. Developed and managed notebooks, orchestrated workflows using Databricks Jobs, and automated deployments using CI/CD integration with Git. Implemented robust logging, error handling and alerting mechanisms using ADF activity outputs and Azure Monitor. Implemented parameterized pipelines and dynamic content expressions for scalable and reusable data workflows. Implemented Real-time data ingestion pipelines from event sources into Kusto using Functions and ADF for high-throughput analytics and for large volumes data in Azure Data Explorer (ADX). Strong knowledge of Lakehouse architecture, including structured streaming, data versioning, and ACID transactions with Delta tables. Implemented Unity Catalog for centralized data governance across multiple workspaces, enabling fine- grained access control for tables, views and files. Implemented Delta Lake features such as ACID transactions, Time Travel, and Schema Evolution to ensure data consistency and auditability. Led a team of data engineers in designing scalable data pipelines, mentoring junior members, conducting code reviews and aligning solutions with business objectives through stakeholder collaboration. Strong knowledge of SQL, Postgres SQL and database management. Designed and optimized relational SQL databases. Experience in Data Modelling (Star/Snowflake Schema) and knowledge on Data Governance. Ensured data integrity, security and compliance. Experience in Apache Airflow, DBT, basic Python for workflow automation. Automated end-to-end ETL processes. Hands-on experience with CI/CD pipelines and Git for deployment automation. Ensured smooth integration and code deployment. Expertise in database performance tuning and optimization for Snowflake. Improved query execution efficiency. Developed scalable Snowflake architecture with dynamic schema management. Ensured cost-effective and high-performance storage. Automated data validation and transformation using Snowflake Streams and Tasks. Enhanced data quality and consistency. Designed and implemented scalable Data Vault 2.0 architecture in Snowflake, including Hubs, Links, and Satellites to support auditability, traceability, and flexible schema evolution. Developed Business Vault and reporting views from Raw Vault, enabling self-service analytics and regulatory compliance through consistent, historical, and source-agnostic data models. Experience with Role-Based Access Control (RBAC) for data security in Snowflake. Managed secure access control for multiple users. Implemented data masking and encryption for compliance with security standards. Ensured protection of sensitive data. Created optimized Snowflake queries for high-performance data retrieval. Reduced query latency and improved data processing. Designed ETL pipelines using Snowflake and DBT for seamless data integration. Improved data transformation and analytics. Implemented zero-copy cloning for efficient data testing and versioning. Enabled faster data recovery and environment replication. Designed CDC (Change Data Capture) processes for real-time data sync. Ensured accurate and timely data updates. Strong understanding of Snowflake caching mechanisms for query optimization. Improved overall system responsiveness. Experienced in Snowflake Resource Monitors to control compute cost. Implemented cost-effective resource allocation strategies. Successfully managed concurrent involvement in two projects. Proficient in functional, regression and end-to-end testing in Agile and Waterfall methodologies. Ensured data quality through comprehensive testing. Demonstrated leadership by mentoring and training junior team members to maintain scripting standards within the project. Diagnose and troubleshoot application problems, including software bugs, performance issues, and configuration errors. Utilize log files, monitoring tools, and other diagnostic resources to identify root causes. Apply strong technical skills and good business knowledge together with investigative techniques and problem-solving skills to identify and resolve issues efficiently and in a timely manner. Create and maintain apps run book documentation and knowledge base articles. Participate in post-incident reviews and contribute to the development of preventive measures. Respond and resolve application-related issues reported by internal and external users. Work with various teams to resolve application-related problems and enhance user experience. Identifying system improvement opportunities based on tracking product support requests or repetitive issues and making recommendations to development and engineering on potential solutions. Work on initiatives and continuous improvement process around proactive application health monitoring, reporting, and technical support. Involved in Enhancement and resolving the bugs.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

T-Mobile, USA
USA
08.2025 - Current
  • Architected and owned an enterprise-scale data platform using Azure Data Lake Storage (ADLS Gen2) and Snowflake, supporting high-volume transactional and reference data from upstream systems.
  • Built end-to-end ingestion pipelines using Azure Data Factory (ADF) to extract data from Oracle, SQL Server, and MySQL, landing data into raw and curated zones in ADLS and Snowflake.
  • Implemented incremental and CDC-based ingestion using watermarking, control tables, and change tracking, enabling accurate and near real-time data availability in Snowflake and ADLS.
  • Engineered streaming and near–real-time pipelines using Azure Event Hubs and Structured Streaming, persisting processed data into ADLS and Snowflake for operational and analytical use cases.
  • Designed and maintained Delta Lake tables in Databricks with ACID transactions, schema enforcement, time travel, and versioning for consistency and auditability.
  • Designed and optimized Snowflake schemas, tables, and views to support scalable analytics and reporting, ensuring high query performance and concurrency.
  • Implemented data validation, reconciliation, and quality checks across ingestion and transformation layers to ensure data integrity for financial and regulatory reporting.
  • Built parameterized, reusable ADF pipelines, configuring linked services, datasets, and triggers to support automated, scalable ETL workflows.
  • Integrated Azure Databricks with ADF to orchestrate complex multi-step batch pipelines across development, QA, and production environments.
  • Implemented secure access controls using Azure RBAC, Managed Identities, Key Vault, and Snowflake RBAC, aligning with enterprise security and governance standards.
  • Supported production releases using Azure DevOps CI/CD pipelines, automating build, test, and deployment of data engineering artifacts.
  • Monitored pipelines and workloads using Azure Monitor, Log Analytics, Databricks metrics, and Snowflake resource monitoring, proactively resolving performance issues.
  • Performed schema evolution and data type validation to support changing business requirements without impacting downstream consumers.
  • Supported capacity planning and workload sizing for Databricks, Synapse, and Snowflake to handle peak financial processing volumes.
  • Developed and maintained robust data pipelines using DBT (data build tool) for transformation and modelling on Snowflake.
  • Designed, developed, and enhanced system capabilities as defined in functional specification documents and project requirements.
  • Built modular, reusable, and testable SQL models, macros, snapshots, and seeds to create scalable and maintainable transformation layers.
  • Tested transformation logic across control layers to ensure data accuracy and reliability.
  • Identified and resolved performance and scalability issues in the Snowflake environment.
  • Writing SQL Queries to business analysis & for reporting teams up on request.

Snowflake Data Engineer

DTCC, USA
USA
03.2017 - 07.2025
  • Interacted with BA team to understand the process flow and the business.
  • Worked on PUT, LIST, COPY and GET commands in Snow SQL to bulk load the data into Snowflake tables from internal/external stage.
  • Worked on snow pipe for continuous data load from the AWS S3 bucket.
  • Cloned production data for code modifications and testing.
  • Strong understanding of various data formats such as CSV, XML, JSON etc.
  • Batch loading from the AWS S3 to snowflake stages and then moved to tables by using the COPY command.
  • Implemented the entire flow of data ingestion in the snowflake data warehouse.
  • AWS Integration & ETL work is the major role in this project.
  • Involved in design and development of Snowflake Database Components.
  • Involved in monitoring the workflows and in optimizing the load times.
  • Writing and developing the SQL queries to Extract, Load, and Transform data.
  • Implement and Leverage Time Travel, Cloning, Streams(CDC), Tasks.
  • Collect Load starts of start time, end time, total records loaded and notify production team with load details.
  • Developed and optimized stored procedures using SQL for efficient data extraction, transformation, and loading across databases.
  • Working on stored procedure to check if the row count between source and target matches.
  • Working on stored procedure to trigger the pipeline only if the file is received before 10 AM.
  • Working on stored procedure to insert the records into multiple tables.
  • Design, develop and maintain scalable data models and transformations using DBT in conjunction with Snowflake.
  • Utilize DBT to convert raw data into structured datasets, enabling efficient analysis and reporting.
  • Write and optimize SQL queries within DBT to enhance data transformation processes and improve overall performance.
  • Practical understanding of the Data modelling concepts like star/snowflake schemas & Facts dimension tables.
  • Comprehensive knowledge and experience in normalization.
  • Working on setting up workflow using Airflow for managing & scheduling the tasks.
  • Executing the DAG's that has been created already using python to run all the Root, child & Final tasks.
  • Using python in VS code for Data cleansing, Data processing & transformations using pandas & NumPy packages in python.
  • Responsible for creating and managing the integrations between AWS & snowflake, DBT, DB objects, stages, file formats as a admin tasks.
  • Worked on importing data from marketplace as a admin tasks.
  • Created & manage the users, roles, privileges, data masking.
  • Created snow pipes to load streaming data from S3 bucket. Created SQS and implemented IAM policies.
  • Integrated datastage with AWS services (S3) and snowflake cloud platforms to support robust, cloud-native data workflows.
  • Collaborated with cross-functional teams including data analysts, data scientists, and business stakeholders to translate requirements into efficient datastage jobs and deliver actionable insights.
  • Developed custom plugins and transformations in Data cloud Fusion to meet complex data integration and cleansing requirements.

Software Engineer

DELL, India
India
03.2016 - 10.2016
  • Worked on UNIX scripting to extract dates of previous Quarter.
  • Writing SQL queries in Netezza database connector.
  • Created HPQC Test Cases and running in HPQC Test labs.
  • Created Unit Test case document, run book and release notes docs to support Testing teams.
  • Understand the business rules completely and implements the data transformation methodology.
  • Extracted Data from different source systems and transformed as per the requirements and loaded into Fixed Width files.
  • Involved in development using IBM info sphere Data stage V11.3
  • Involved in Code Deployment to different Regions and Prepared deployment documents for every code release.

Senior Software Engineer

UHG, India
India
10.2015 - 02.2016
  • Worked on UNIX scripting to do required operations on Landing Files.
  • Worked on Creating XML Canonical snippets using XML stages in data stage.
  • Understand the business rules completely and implements the data transformation methodology.
  • Extracted Data from different source systems and transformed as per the requirements and loaded into the staging area.
  • Involved in development using IBM info sphere Data stage V9.1
  • Extensively worked with Data Stage Shared Containers for Re-using the Business functionality.
  • Involved in Code Deployment to different Regions and Prepared deployment documents for every code release.
  • Involved in implementing common ETL framework process for all the jobs in projects.
  • Fixed the bugs generated in SIT, UAT and PROD.

Software Engineer

Ness Technologies, India
India
04.2015 - 09.2015
  • Understand the STM (Source to Target Mapping) documents for individual subject area.
  • Developed ELT and ETL Jobs in the project involved Complex transformations, performance enhancements.
  • Prepared OPER docs for External Dependencies and Change configuration items.
  • Involved in Daily, weekly status calls, issue resolution meetings and onsite code acceptance meetings.
  • Automation is done by using batch logic, scheduling jobs on a daily, on a weekly and yearly basis depending on the requirement using ESP Tool.
  • Involved in Code Deployment to different Regions and ETL 490 documents for every code release.
  • Create and execute unit test plans based on system and validation requirements.
  • Fixed the bugs generated in QAT, UAT and PROD.

Education

B.Tech - undefined

JNTU Kakinada
04.2012

Skills

Operating Systems: Windows, XP, MacOS

Management Tools: HP QC, JIRA

Databases: Snowflake, Green Plum, SQL Server, PostgreSQL, Oracle

ETL Tools: Data Stage, DBT

Cloud Technologies: Azure (ADF, Databricks, Synapse,, Kusto, Service Bus, HDInsight, Logic Apps) AWS S3, SNS, SQS, IAM, Snowflake

Reporting Tool: Weaver, Power BI

Methodologies: Agile, Scrum, Waterfall

Scheduling: Autosys, Control-M, Triggers, Airflow

Certification

  • Trained in IBM Data stage and Quality stage tool
  • Certified AWS cloud Practitioner
  • Trained in RPA tool Blue Prism.

Timeline

Senior Data Engineer

T-Mobile, USA
08.2025 - Current

Snowflake Data Engineer

DTCC, USA
03.2017 - 07.2025

Software Engineer

DELL, India
03.2016 - 10.2016

Senior Software Engineer

UHG, India
10.2015 - 02.2016

Software Engineer

Ness Technologies, India
04.2015 - 09.2015

B.Tech - undefined

JNTU Kakinada
Venkatarao Kolluri