Summary
Overview
Work History
Education
Skills
Key strengths:
Timeline
Generic

PRANEETH VARMA PENMATSA

Charlotte,NC

Summary

With three years of experience, I'm an adept AWS Data Engineer and Cloudera specialist, skilled in MapReduce, Hive, Python, PySpark, Scala, Kafka, and Spark streaming. Holding a Master's degree in IT, I integrate diverse technologies for robust data solutions. Accomplished engineer proffering extensive cloud monitoring, deployment and troubleshooting skills. Defined, built and maintained infrastructure using vendor-neutral and platform-specific tools.

Overview

5
5
years of professional experience

Work History

AWS Data Engineer

Vertex Analytics
09.2023 - Current
  • Spearheaded the design and implementation of pivotal AWS data solutions, optimizing ETL processes and bolstering team productivity.
  • Seamlessly integrated data applications with AWS services, ensuring secure storage and access through IAM roles and key management.
  • Led the migration of legacy data systems to AWS, achieving substantial cost savings and performance improvements.
  • Implemented and optimized AWS databases for enhanced performance and scalability.
  • Configured AWS monitoring tools for comprehensive logs collection and security measures, fortifying the data infrastructure's troubleshooting capabilities and overall security posture.
  • Utilized AWS profiling tools to identify and address performance bottlenecks in data processing, resulting in improved overall system performance.
  • Strong command of SQL for querying and manipulating data in relational databases, with a focus on AWS-supported databases such as Amazon RDS (Relational Database Service).

AWS Data Engineer

BYJU’S
12.2019 - 12.2021
  • Spearheaded the design and implementation of AWS-based data solutions to enhance the efficiencyand scalability of data processing workflows.
  • Collaborated with cross-functional teams to gather data requirements and translated them into effective AWS data engineering solutions.
  • Managing metadata associated with ingested data, including schema definitions, lineage, and data dictionaries, using tools like Apache Atlas or Cloudera Navigator.
  • Developed and optimized ETL processes using AWS Glue, ensuring seamless extraction, transformation, and loading of data from diverse sources.
  • Implemented and managed AWS databases, including Amazon RDS and Amazon Redshift, for effective storage and retrieval of structured and unstructured data.
  • Utilized SQL queries for data analysis, reporting, and troubleshooting within AWS-supported databases.
  • Monitoring cluster health, resource utilization, and security compliance using Cloudera Manager.
  • Engineered and maintained data pipelines, leveraging AWS services such as Lambda, S3, and Step Functions.
  • Established and maintained CI/CD pipelines for AWS-based data applications using tools like AWS CodePipeline and AWS DevOps.
  • Conducted performance tuning and optimization of SQL queries and database structures to improve overall system efficiency.
  • Developing ETL processes to extract, transform, and load data efficiently, leveraging tools like Apache NiFi, Sqoop, or custom scripts.

JR.SQL DEVELOPER

DR.RAJU’S
06.2019 - 12.2019
  • Executed and optimized SQL queries to extract and manipulate data, contributing to improved query performance.
  • Implemented data validation processes, ensuring accuracy and reliability of datasets, and conducted quality assurance checks.
  • Contributed to documentation of database schemas and query logic, and generated basic reports to support decision-making processes.
  • Addressed issues related to query performance and data inconsistencies, showcasing problem-solving skills under supervision.
  • Demonstrated adaptability in handling multiple tasks and priorities within a fast-paced work environment.

Education

Master of Science - Information Technology And Management, Business

The University of North Carolina At Greensboro
Greensboro, NC
05.2023

Bachelor of Technology - computer science and Engineering

Sarvepalli Radhakrishnan University
2020

Skills

Languages: SQL, Python, MapReduce, Hive, Python, PySpark, Scala, Kafka, Spark streaming

Role Based: Data Engineering on AWS, ETL Process Optimization, Database Management, Data Analysis and Reporting, CI/CD Pipeline Setup and Management

Technical Tools: AWS Services (S3, Glue, Athena, Redshift, EMR, Lambda), SQL Profiling Tools, AWS Monitoring Tools, AWS CodePipeline, AWS DevOps

Key strengths:

  • Extensive knowledge and hands-on experience with AWS services such as S3, Glue, Athena, Redshift, EMR, and Lambda, alongside proficiency in Cloudera's ecosystem for comprehensive data management.
  • Strong skills in designing, developing, and optimizing ETL processes for efficient data extraction, transformation, and loading, utilizing both AWS and Cloudera technologies.
  • Proficient in Python, SQL, and other programming languages commonly used in data engineering for big data processing, integrating seamlessly with both AWS and Cloudera platforms.
  • Solid understanding of data modeling techniques for relational and NoSQL databases, ensuring effective and scalable data storage and retrieval across AWS and Cloudera environments.
  • Experience in managing and optimizing databases on both AWS and Cloudera platforms, implementing best practices for performance and scalability.
  • Familiarity with best practices for securing data in both AWS and Cloudera environments, including AWS IAM, encryption, and Cloudera's security features.
  • Proficient in setting up and managing CI/CD pipelines for data applications using tools like AWS CodePipeline or AWS DevOps, with the ability to incorporate Cloudera technologies as needed.
  • Adept at identifying and resolving data-related issues across AWS and Cloudera environments, leveraging analytical skills and attention to detail.
  • Ability to collaborate effectively with cross-functional teams, including data scientists, analysts, and stakeholders, to deliver comprehensive data solutions spanning both AWS and Cloudera ecosystems.
  • Demonstrated commitment to continuous learning and professional development, staying updated on the latest AWS services, Cloudera technologies, and data engineering best practices.

Timeline

AWS Data Engineer

Vertex Analytics
09.2023 - Current

AWS Data Engineer

BYJU’S
12.2019 - 12.2021

JR.SQL DEVELOPER

DR.RAJU’S
06.2019 - 12.2019

Master of Science - Information Technology And Management, Business

The University of North Carolina At Greensboro

Bachelor of Technology - computer science and Engineering

Sarvepalli Radhakrishnan University
PRANEETH VARMA PENMATSA