Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

MANOJ KUMAR THIMAPURAM

SOFTWARE DEVELOPER
Austin,TX

Summary

Software Developer with five years of experience designing and implementing scalable, cloud-native data pipelines and ETL workflows. Built real-time and batch data ingestion systems using MySQL, Apache Kafka, and AWS Glue with Py Spark, reducing data latency and improving pipeline performance. Orchestrated complex workflows using Apache Airflow, integrating Amazon S3, Py Spark, and Pandas to automate data processing and reduce manual effort. Optimized SQL queries and performed schema migrations to enhance database efficiency and support analytics. Developed Python scripts to analyze application and pipeline log files for error detection and anomaly identification. Created monitoring dashboards using Splunk and Mosaic to maintain data quality and provide operational transparency. Experienced in containerizing applications with Docker, managing Kubernetes clusters, and automating CI/CD pipelines with Jenkins to ensure reliable and maintainable data infrastructure.

Overview

5
5
years of professional experience
2
2
Certifications

Work History

Software Developer

Apple Inc., Tata Consultancy Services Limited
Austin, TX
11.2020 - Current
  • Built a real-time ETL pipeline using Sql Server, Apache Kafka, and AWS Glue to ingest, transform, and store data in Amazon S3, reducing data availability latency from 5 minutes to approximately 2 minutes.
  • Implemented incremental and partitioned data processing in AWS Glue, improving query performance by approximately 35% and lowering storage costs by approximately 15%.
  • Orchestrated batch and streaming ETL workflows in Apache Airflow, integrating S3, Py Spark, and Pandas, cutting manual intervention by 20% through automated scheduling.
  • Designed and optimized complex SQL queries, stored procedures, and functions, reducing query execution times by approximately 10%, and improving overall data pipeline throughput.
  • Created data models and transformations to support SSRS dashboards and analytics, enabling more accurate KPI tracking, and reducing report generation time by approximately 15%.
  • Developed a Slack chat bot data ingestion pipeline integrated with AWS services, automating the capture and storage of operational metrics into S3, saving approximately 4 hours per week of manual reporting effort.
  • Performed DDL and schema migrations across multiple environments, ensuring zero-downtime deployments, and compliance with enterprise security standards.
  • Extracted and transformed sensitive data from encrypted and non encrypted databases, leveraging Spring Boot and AWS SDK to securely store datasets in S3.
  • Implemented Git-based version control for data pipelines to enable collaborative development, automated testing, and rapid rollback when needed.
  • Developed Python/Pandas scripts to parse and analyze pipeline and application log files, identifying error patterns and data anomalies, reducing investigation time by approximately 35%.
  • Built Splunk dashboards to monitor ETL job performance, S3 ingestion metrics, and data pipeline health, enabling faster detection and resolution of failures.
  • Created Mosaic dashboards to visualize data quality metrics, processing throughput, and SLA compliance, improving transparency for engineering and analytics teams.

Software Developer

Tata Consultancy Services Limited
Dallas, TX
08.2020 - 11.2020
  • Developed and deployed RESTful services using Spring Boot to facilitate reliable data ingestion and communication between microservices, enhancing pipeline scalability and fault tolerance.
  • Integrated Spring Boot applications with AWS services, securely storing ingested data in S3 buckets, and leveraging modular design for maintainable data workflows.
  • Automated data extraction, transformation, and loading (ETL) processes with Python scripts improve pipeline efficiency and reduce manual intervention.
  • Utilized Pandas for data cleansing, aggregation, and validation, accelerating data preparation, and improving data quality for downstream analytics.
  • Optimized SQL queries and database performance by creating indexes and analyzing execution plans, reducing query runtimes, and enhancing data retrieval speed.
  • Designed interactive dashboards using SSRS, and crafted optimized SQL queries in SQL Server Management Studio (SSMS) to provide clear data insights and monitor pipeline health.
  • Managed source code and data pipeline versions using Git, enabling collaborative development, automated testing, and smooth deployment cycles.

Education

Master’s - Computer science

University of North Texas
Denton, TX
05.2020

Bachelor of Science - Computer Science

Jawaharlal Nehru Technological University
India
05.2018

Skills

AWS Glue, Apache Kafka, Apache Airflow, Amazon S3, Python, Pandas,Py Spark, Java, SQL, MySQL, SQL Server Management Studio (SSMS), Redis, Docker, Kubernetes, Jenkins, Git, Selenium, Splunk,Mosaic, SSRS, Postman, Spring Boot ,ETL Development, Data Warehousing, CI/CD Pipelines, REST API Development,Cloud Computing, Big Data Processing, Data Modeling

undefined

Certification

AWS Database Specialist

Timeline

AWS Database Specialist

10-2022

AWS Certified Developer Associate AWS

01-2021

Software Developer

Apple Inc., Tata Consultancy Services Limited
11.2020 - Current

Software Developer

Tata Consultancy Services Limited
08.2020 - 11.2020

Master’s - Computer science

University of North Texas

Bachelor of Science - Computer Science

Jawaharlal Nehru Technological University
MANOJ KUMAR THIMAPURAMSOFTWARE DEVELOPER
Created at Zety.com