Data Engineer with expertise in AWS and Azure, led successful database migrations at Carelon. Skilled in SQL and Python, optimizing ETL pipelines and enhancing data processing efficiency. Collaborative team player driving solutions for improved system performance and scalability. Committed to customer satisfaction and delivering exceptional service.
Overview
8
8
years of professional experience
1
1
Certification
Work History
Data Engineer Support
Elevance Health
Dallas, USA
06.2024 - Current
Optimized over 1,000 SQL Server databases and SSIS platforms, achieving 99.99% uptime.
Led migration of SQL Server and Oracle databases to AWS, ensuring less than 30 minutes downtime.
Enhanced post-migration query performance by 40% through strategic cloud integration.
Automated large-scale database transfers with Azure Database Migration Service to minimize manual tasks.
Implemented High Availability and Disaster Recovery solutions using SQL Server Always On for uninterrupted operations.
Developed ETL pipelines with IBM DataStage, increasing job efficiency and reliability.
Automated batch processing via Control-M and Unix shell scripts to streamline workflows.
Software Engineer-Data
New Era Technology
Sunnyvale, USA
01.2023 - 06.2024
Implemented data models on AWS Redshift and RDS, facilitating faster analytics for business teams.
Optimized batch job processes in Teradata SQL, achieving a 30% reduction in processing time.
Built live analytics pipelines with Kafka, PySpark, Apache Flink, and Hive to enhance customer engagement.
Automated machine learning workflows within managed Databricks environments to ensure job reliability.
Migrated on-premise applications to AWS Cloud (EMR and EC2), increasing scalability and minimizing downtime by 40%.
Led code reviews to ensure quality and maintainability of software products.
Collaborated with cross-functional teams to develop and implement software projects.
Software Engineer III -Data
Freewheel
New York, USA
07.2022 - 12.2022
Implemented Continuous Integration and Continuous Delivery pipelines with Git, GitHub, and Terraform, automating deployment processes.
Troubleshot and resolved software issues, ensuring minimal disruption to client services.
Designed automated data workflows in Databricks using PySpark and Scala to enhance processing speed for large datasets.
Developed and optimized ETL pipelines in Databricks, enabling real-time data integration with Delta Lake.
Created robust data transformation layers using Python-based Spark applications to improve efficiency and scalability.
Engineered Spark Streaming applications for low-latency processing of real-time data streams from Kafka.
Developed backend modules utilizing Spring Boot within a microservices architecture deployed on AWS.
Authored SQL scripts and Lambda functions for log aggregation and customized ETL processes.
Supported cloud infrastructure setup including EC2, S3, IAM policies, and monitored services via CloudWatch.
Data Engineer
Albertsons
Dallas, USA
01.2022 - 06.2022
Optimized Hive queries utilizing best practices, Hadoop, YARN, Python, and PySpark to enhance performance.
Optimized ETL processes for efficient data integration from various sources.
Developed detailed project plan for data migration from legacy systems to Snowflake database.
Engineered scalable data pipelines for ingestion, aggregation, and consumption of AWS S3 data into Snowflake.
Designed SSIS packages for ETL processes, transferring existing data to SQL Server for SSAS cubes.
Led development of ETL/ELT pipelines using Cloud Data Fusion, ensuring efficient data flow for high-volume datasets.
Created custom SQL scripts to boost database efficiency and decrease load times.
Executed ETL testing activities, extracting and transforming data for upload into Data Warehouse servers.
Established real-time and batch processing workflows with GCP tools like Data Flow and Composer.
Software Engineer
Apex CoVantage – Em Raaga Informatics
Herndon, USA
05.2017 - 12.2020
Designed scalable backend systems using Snowflake, ensuring efficient data storage and retrieval.
Optimized ETL pipelines with AWS Glue and Python scripts to enhance data processing efficiency.
Created complex SQL queries for Snowflake, improving performance through indexing and restructuring.
Developed AWS Lambda functions for real-time data processing, enhancing system responsiveness.
Automated data integration workflows within Snowflake, maintaining continuous data consistency.
Collaborated on cloud-native solutions, ensuring high availability and scalability for real-time analytics.
Integrated software solutions with AWS services to support critical data applications efficiently.
Documented architectural designs for future enhancements and maintenance.
Education
Master's Degree - Computer & Information Science
Southern Arkansas University
AR
01.2022
Bachelor of Technology - Electronics, Communication & Computer Engineering
Kakatiya University
India
01.2017
Skills
Java and Python programming
Data analysis with NumPy and Pandas
Data visualization with Seaborn and Matplotlib
SQL and NoSQL databases
Cloud platforms: AWS and Azure
Big data technologies: Spark, Kafka, Airflow
Data processing frameworks: HDFS, Hive, Sqoop
Containerization: Docker and Kubernetes
CI/CD tools: Jenkins, GitLab, Bamboo
REST APIs and data formats: JSON, XML
Data structures and SQL optimization
Cloud migration strategies
ETL development
Certification
Microsoft Certified: GCP cloud Associate
Certified in AWS Academy Data Analytics by AWS Academy Graduate
Timeline
Data Engineer Support
Elevance Health
06.2024 - Current
Software Engineer-Data
New Era Technology
01.2023 - 06.2024
Software Engineer III -Data
Freewheel
07.2022 - 12.2022
Data Engineer
Albertsons
01.2022 - 06.2022
Software Engineer
Apex CoVantage – Em Raaga Informatics
05.2017 - 12.2020
Master's Degree - Computer & Information Science
Southern Arkansas University
Bachelor of Technology - Electronics, Communication & Computer Engineering