Summary
Overview
Work History
Education
Skills
Timeline
Generic

Surya Sai Chilukoti

Plantsville,CT

Summary

Results-driven Azure Data Engineer with a proven track record at GDIT, specializing in big data processing and ETL pipeline development. Expert in SQL and Python, I successfully enhanced analytics capabilities, optimizing data ingestion and reporting processes. Strong analytical skills complemented by effective collaboration with cross-functional teams to deliver impactful data solutions.

Overview

6
6
years of professional experience

Work History

Azure Data Engineer

GDIT
Rensselaer, NY
02.2025 - Current
  • Developed robust data platform on Azure to enhance analytics and reporting for operational efficiency.
  • Streamlined data ingestion using Azure services, facilitating real-time customer behavior analysis.
  • Harnessed Apache Spark Scala APIs for predictive analytics and insights into usage patterns.
  • Estimated and monitored Spark Databricks cluster size for error-free operations.
  • Designed Snowflake Data Warehouse solutions for large-scale data management.
  • Executed end-to-end PySpark ETL pipelines with Azure Data Factory orchestration.
  • Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.
  • Created stored procedures for automating periodic tasks in SQL Server.
  • Streamlined data flow from diverse sources using ETL tools such as Talend, Informatica, and Airflow.
  • Managed version control and deployment of data applications using Git, Docker, and Jenkins.
  • Configured and maintained cloud-based data infrastructure on platforms like AWS, Azure, and Google Cloud to enhance data storage and computation capabilities.
  • Optimized SQL queries and database schemas for performance improvements in data retrieval operations.
  • Implemented and optimized big data storage solutions, including Hadoop and NoSQL databases, to improve data accessibility and efficiency.
  • Conducted data analysis using SQL and Python to derive insights and support decision-making processes.
  • Migrated Microsoft SQL Server databases to Azure SQL, including monitoring and restoration activities.
  • Created Kafka topics for seamless data ingestion into Spark applications.

Big Data Engineer

HDFC ERGO General Insurance
India
10.2021 - 08.2023
  • Modernized and centralized enterprise-wide data operations for HDFC ERGO General insurance client.
  • Built scalable, secure data lake architecture and real-time analytics platform across Azure and GCP ecosystems.
  • Developed ETL solutions using Spark SQL in Azure Databricks for data extraction and transformation.
  • Created data pipelines in Python to load API-based JSON responses into BigQuery for analytics reporting.
  • Designed database solutions in Azure SQL Data Warehouse and Azure SQL.
  • Executed data extraction, transformation, and loading from source systems using Azure Data Factory and T-SQL.
  • Implemented data ingestion techniques from various source systems into a cohesive architecture.
  • Utilized GitHub for version control and Jenkins for scheduling data pipeline executions.
  • Implemented ETL processes for efficient data extraction and transformation.
  • Developed Python scripts for extracting data from web services API's and loading into databases.

Data Engineer

Asian Payments Bank
India
06.2020 - 09.2021
  • Designed and implemented secure, scalable cloud-native data platform integrating multiple data sources.
  • Streamlined batch and streaming data pipelines, enhancing enterprise-level reporting and compliance.
  • Configured logic apps for email notifications to end users and key stakeholders via web services.
  • Developed Notebooks using Databricks, Scala, and Spark for Delta table data capture.
  • Standardized pipeline components to maximize data usability between Azure and Google Cloud Platform.
  • Executed tool-based data transfer from Azure MSSQL Server to Google Cloud BigQuery.
  • Created and managed Azure Data Factory policies while utilizing Blob storage for backup solutions.
  • Built frameworks for snapshot management in Azure Blob Storage, including lifecycle policy setup.

Data Analyst Intern

Globex Pvt Ltd
Hyd, India
11.2019 - 04.2020
  • Understand the data visualization requirements from the Business Users.
  • Writing SQL queries to extract data from the Sales data marts as per the requirements on Linux.
  • Developed Tableau data visualization using Scatter Plots, Geographic Map, Pie Charts and Bar Charts and Density Chart.
  • Developed SQL queries and Python scripts for data cleansing, validation, and transformation.
  • Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.
  • Explored traffic data from databases connecting them with transaction data, and presenting as well as writing report for every campaign, providing suggestions for future promotions.
  • Managed source code and workflow changes using Git and followed Agile sprints with Jira.
  • Worked with Sqoop commands to import the data from different databases.
  • Designed and developed Map Reduce jobs to process data coming in different file formats like XML Built reports and report models using SSRS to enable end user report builder usage.
  • Implemented SQL functions to receive user information from front end C# GUIs and store it into database.
  • Worked on report writing using SQL Server Reporting Services (SSRS) and in creating various types of reports like table, matrix, and chart report, web reporting by customizing URL Access.
  • Environment: Python, Hadoop, SQL, SSRS, Jira, OLTP, PL/SQL, Oracle, Log4j, ANT, Clear-case, Windows, Tableau, Map Reduce.

Education

Master of Science - Computer Science

Rivier University
Nashua, NH
05-2025

Bachelor of Science - Civil Engineering

Newtons Institute of Science And Technology
INDIA
05-2020

Skills

  • SQL and NoSQL databases
  • Python, R, and Scala
  • Java and JavaScript
  • Windows, UNIX, and Linux
  • Oracle and MySQL
  • DB2 and SQL Server
  • MS Access and HBase
  • Data visualization with Tableau
  • QlikView and Qlik Sense
  • SSRS and SSIS reporting
  • AWS, Azure, and GCP cloud services
  • Big data processing with Spark
  • Kafka and PySpark integration
  • WebLogic and WebSphere administration
  • Apache Tomcat and JBOSS management
  • ETL processes and data pipelines
  • Data analysis and quality assurance
  • Agile methodologies in development
  • API development and integration

Timeline

Azure Data Engineer

GDIT
02.2025 - Current

Big Data Engineer

HDFC ERGO General Insurance
10.2021 - 08.2023

Data Engineer

Asian Payments Bank
06.2020 - 09.2021

Data Analyst Intern

Globex Pvt Ltd
11.2019 - 04.2020

Master of Science - Computer Science

Rivier University

Bachelor of Science - Civil Engineering

Newtons Institute of Science And Technology
Surya Sai Chilukoti