Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Nikhitha Nagirimadugu

Houston,Tx

Summary

Proven Data Engineer with 6+ years of professional experience working with AWS, python, Scala. A highly dedicated and approachable individual with an ability to multi-task and perform well in fast paced environment. Strong desire to move forward, face and overcome new challenges by expanding my skillset to strive and be successful in a modern day – data driven environment.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Data Engineer

UNVW Inc.
02.2023 - Current
  • Provide data to the company analysts and decision makers by supporting and developing massive data pipelines using ETL Process
  • Collaborate with stakeholders to define data requirements and deliver solutions and communicate progress and challenges
  • Work closely with data scientists, analysts and software engineers
  • Support and evolve the underlying infrastructure of the company's data platform
  • Migrated the data from Hadoop to Snowflake by refactoring using Pyspark, AWS Glue and SQL
  • Developed Terraform and Python scripts to deploy AWS resources and run Glue jobs in AWS for Redshift Loads/Unloads
  • Performance tuning for longer running queries in snowflake to reduce runtime
  • Adding the AWS Glue jobs to Datadog to create the alarms and monitors the performance of the jobs
  • Creating the runbooks that require the schema of all processes and their dependencies and schedules
  • Perform ETL to necessary Braintree tables after extracting the data from SRE team to help finance team run the monthly validations
  • Implement Data quality checks and identify potential data reliability issues and providing timely corrective actions on the issues/defects

Data Engineer

HCL Global
11.2021 - 02.2023
  • Emphasized on retrieving the data from Cloud Hawk and scan for sensitive data since their last extracted date
  • Used DynamoDB to insert new resources that are to be scanned
  • Worked with AWS Lambda to schedule a trigger for the EMR where the job runs to identify whether the buckets are scanned before to create manifest file for the entire bucket or to identify what objects has been changed since last scan
  • Used Scala to develop codes for Jobs that run on EMR to identify the changes in the object stored in S3
  • Worked on Bogie files to define infrastructure for the AWS components
  • Worked on Config files where the data is hardcoded
  • Extensively used CloudWatch to create alarms on the graphs based on metrics collected from AWS services and to read logs
  • Worked on end-to-end pipeline using AWS services like EMR, S3, EC2, VPC, DynamoDB, IAM, Lambda, CloudWatch etc

Data Engineer

Larsen & Toubro Infotech
12.2016 - 12.2019
  • Design, Develop and Document the new architecture and development process to convert existing ETL pipeline by using Big Data tools like Spark, Python, Hive and Sqoop
  • Worked on replacing existing Hive scripts with Spark Data-Frame transformation and actions for faster analysis of the data
  • Expertise in all components of Big Data Ecosystem- Spark, Hive, Sqoop, Oozie, Impala, Kafka Presto and Hue
  • Extensively used hive, spark performance tuning for reducing the execution time of the scripts
  • Developed Oozie workflows and sub workflows to orchestrate the Spark scripts, hive queries and the Oozie workflows are scheduled through Autosys

Education

Master of Science - Data Science

University of New Haven
West Haven, CT
12.2021

Bachelor of Science - Computer Science

GSSSIETW
06.2016

Skills

  • AWS
  • Python
  • Scala
  • Spark
  • Git
  • Agile
  • Scrum
  • JIRA
  • Snowflake
  • SQL
  • NLP
  • Machine Learning
  • Data Interpretation
  • Artificial Intelligence
  • R
  • DBT
  • BitBucket
  • HDFS

Certification

  • Snowflake SnowPro Core Certification – Snowflake Computing Inc.
  • AWS Certified Developer – Associate - Amazon Web Services (AWS).

Timeline

Data Engineer

UNVW Inc.
02.2023 - Current

Data Engineer

HCL Global
11.2021 - 02.2023

Data Engineer

Larsen & Toubro Infotech
12.2016 - 12.2019

Bachelor of Science - Computer Science

GSSSIETW

Master of Science - Data Science

University of New Haven
Nikhitha Nagirimadugu