Summary
Overview
Work History
Education
Skills
Timeline
Generic

Maniteja Paritala

Sandy Springs,GA

Summary

Data Engineer with 7 years of programming experience across all phases of the Software Development Life Cycle (SDLC). Proficient in Big Data development, specializing in Hadoop ecosystem components including Spark, Hive, and Sqoop. Demonstrated expertise in developing and debugging Spark jobs for large datasets, along with strong skills in SQL and data analytics using HiveQL and Snowflake. Experienced in cloud migration, job scheduling, and optimizing complex SQL queries to enhance data processing performance.

Overview

8
8
years of professional experience

Work History

Data Engineer

Elevance Health
Atlanta, GA
01.2019 - Current
  • Developed Bigdata Fabric Project, managing membership, claims, and drug details from various medical sources.
  • Implemented scalable data pipelines using Apache Spark and Hive in Cloudera Hadoop to process over 500GB of daily claims and member data.
  • Executed data cleaning operations in RAWZ with Unix and Python scripts for raw data storage.
  • Transformed datasets in APPZ according to client requirements using PySpark, exporting data for downstream processing.
  • Created Hive views by masking PHI columns to restrict unauthorized access to sensitive data.
  • Automated weekly business report generation with Python scripts, enhancing reporting efficiency.
  • Optimized Hive queries and partitioning strategies, improving performance on large HDFS datasets.
  • Maintained Cloudera Manager for performance tuning, cluster health checks, and service alerts.

Spark/Hadoop Developer

American Family Insurance
Madison, WI
08.2017 - 11.2018
  • Stacked client property and auto insurance data from multiple sources into Hadoop Data Lake.
  • Transformed unstructured data into structured format using Apache Spark with Python.
  • Stored transformed data in Hive and HBase tables for business client access.
  • Applied various PySpark APIs to execute necessary data transformations on diverse file types.
  • Implemented data security measures to protect sensitive information in the system.
  • Ingested data into HDFS using Sqoop while performing transformations via Python scripts.
  • Developed Hive tables, implementing partitioning and dynamic partitions for optimized querying.
  • Deployed significant datasets in Hive and HBase while utilizing Spark SQL for efficient querying.

Education

Master of Science - computer Science

Colorado Technical University
Denver, CO
01.2017

Bachelor's - computer science

Jawahar Lal Nehru Technological University-Kakinad
India
05-2013

Skills

  • Big data technologies: HDFS, Hive, Spark, Sqoop, Kafka
  • Programming languages: Python, SQL
  • Development tools: Control M, PyCharm, VSCode
  • Scripting languages: Python, Unix
  • Cloud services: AWS (S3, Lambda, EMR, Glue)
  • Databases: PostgreSQL, MySQL, Snowflake

Timeline

Data Engineer

Elevance Health
01.2019 - Current

Spark/Hadoop Developer

American Family Insurance
08.2017 - 11.2018

Master of Science - computer Science

Colorado Technical University

Bachelor's - computer science

Jawahar Lal Nehru Technological University-Kakinad
Maniteja Paritala