Summary
Overview
Work History
Education
Skills
Websites
Certification
Projects
Timeline
Generic

Navya Gorantla

Frisco,TX

Summary

Data Engineer with 4 years of experience following the desire to use data and technology to address issues in the Real world.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

Advithri Technologies
09.2023 - Current
  • Implemented data quality checks and validation processes to ensure the accuracy and consistency of data loaded into Amazon Redshift during ETL processes
  • Implemented version control and CI/CD practices for ETL code, ensuring reliable and efficient updates and deployments
  • Worked on data encryption and security protocols to protect sensitive data during transit and at rest, ensuring compliance with data privacy regulations
  • Proficiently used data profiling and data lineage tools to track and analyze data movements and transformations within the ETL pipeline
  • Conducted thorough performance tuning and optimization of data pipelines, enhancing data throughput, and reducing latency in processing
  • Worked with cloud-based orchestration tools like AWS Step Functions to create complex workflows that integrate Lambda functions, S3 storage, and EC2 instances seamlessly
  • Developed custom data connectors and adapters to facilitate the integration of various data sources into the ETL process, enhancing data extraction capabilities.

Data Engineer

GCI, USA
12.2019 - 12.2021
  • Experience with migrating structured, and unstructured data to and from DynamoDB using AWS Data Pipeline or other ETL tools
  • Automated a previously manual ETL process to integrate data from multiple sources and load it into Amazon Redshift using Python
  • Experience integrating AWS Step Functions with other AWS services, such as Lambda, S3, and EC2
  • Hands-on experience building and deploying machine learning models using AWS Sage Maker
  • Creating and cloning the jobs and Job streams in the TWS tool and promoting them to higher environments
  • Design and develop data pipeline architectures using Hadoop, Spark, and related AWS Services
  • Strong understanding of Sage Maker features such as Jupiter notebooks, model hosting, and hyperparameter tuning
  • Extensive experience designing, developing, and implementing MongoDB database solutions that meet specific business requirements, and proficiency in using MongoDB to manage unstructured and semi-structured data
  • Experience with ETL pipelines in and out of a data warehouse using a combination of Python and Snowflakes Snow SQL
  • Built the Logical and Physical data models for snowflake as per the changes required
  • Proficient in using Talend's cloud-based data integration services such as Talend Cloud Data Integration, Talend Cloud Data Preparation, and Talend Cloud Data Quality.

Data Engineer

COGECO, CANADA
05.2018 - 11.2019
  • Designing and highly implementing performant data ingestion pipelines from multiple sources using Apache Spark and/or Azure Databricks
  • Increased the efficiency of the data fetching by approximately 30% using query optimization and indexing
  • Used Scala to store streaming data to HDFS and to implement Spark for faster processing of data (40% faster)
  • Familiar with using Databricks to build and deploy machine learning models using libraries such as MLlib and PySpark
  • Expertise in Microsoft Azure Cloud Services (PaaS and IaaS), Application Insights, Document DB, Internet of Things (IoT), Azure Monitoring, Key Vault, Visual Studio Online (VSO), and SQL Azure
  • Experience managing Azure Data Lakes (ADLs) and Data Lake Analytics and a good understanding of integrating ADLs with other Azure services Knowledge of USQL for data transformations, which is part of a cloud data integration strategy.

Education

M.S. Computer Science -

University of Missouri- Kansas City
Kansas City, MO

Skills

  • Python
  • SQL
  • R
  • Java
  • Scala
  • PySpark
  • Glue ETL
  • SAS
  • Shell Scripting
  • Power Shell
  • Bash
  • AWS
  • Azure
  • Cloud Security
  • S3
  • EC2
  • DataLakes
  • DataFactory
  • Data Bricks
  • CloudSQL
  • Tableau
  • Power BI
  • Matplotlib
  • Seaborn
  • Grafana

Certification

  • Google Cloud Computing Foundations, Google Cloud Skills, 2023
  • Cyber Security Tools & Cyber Attacks, IBM, 2023

Projects

Convolutional Neural Network-Based Facial Emotions Recognition System  01/2022-05/2022

UMKC, USA 

  • A CNN-based emotion recognition system, built with Scikit-Learn, Tensorflow, and OpenCV with a High degree of accuracy (86%) and confidence for real-time emotion recognition
  • Potential applications in - Behavioural analysis, real-time analysis of customer satisfaction in retail settings, and automatic photography.

Image Classification in Big Data                                                          08/2022-12/2022

UMKC, USA 

Accomplished in extracting valuable insights from vast image collection, enabling data-driven decision-making, and enhancing business intelligence in the big data domain. 

  • By leveraging advanced algorithms, scalable solutions, and distributed computing frameworks, this approach enables accurate and efficient image classification on massive datasets
  • The outcomes include improved data-driven decision-making, enhanced business intelligence, and streamlined processes for extracting valuable insights from large-scale image collections.

Timeline

Data Engineer

Advithri Technologies
09.2023 - Current

Data Engineer

GCI, USA
12.2019 - 12.2021

Data Engineer

COGECO, CANADA
05.2018 - 11.2019

M.S. Computer Science -

University of Missouri- Kansas City
Navya Gorantla