Experienced Software Engineer skilled in Java, Python, Apache Kafka, and AWS, with a proven track record in delivering efficient and scalable data solutions. Expertise in data migration, optimization, ETL automation, containerization, and cloud technologies. Proven ability to improve application performance, reduce operational costs, and lead cross-functional teams.
Overview
4
4
years of professional experience
Work History
Software Engineer
Comcast
02.2021 - Current
Migrated more than 80TB of data from MySQL, PostgreSQL, and Teradata databases to MinIO and AWS S3 using Python and Apache Spark, improving application performance by 40%, scalability by 50%, and cost by 40%.
Reduced analyst workload by 70% by developing and implementing comprehensive data documentation standards for SQL datasets, making it easier for analysts to find and understand data they need, and reducing time they spend on data discovery and preparation.
Developed and optimized Spark scripts using PySpark, SparkSQL, and Python libraries such as Pandas and NumPy, and deployed them to Kubernetes-driven Spark environment using containers, achieving 90% performance improvement.
Created and managed containerized infrastructure using Docker and Kubernetes for microservices architecture, enabling scalability, reliability, and efficiency.
Automated ETL processes using Bash environments to extract, transform, and load data from various sources to cloud storage, streamlining data workflows and freeing up time for more complex tasks.
Implemented daily job flows using X-gears and Airflow to orchestrate data pipelines and automate data-driven tasks, boosting productivity and efficiency gains by up to 40%.
Proficient in working with variety of file formats, including Parquet, Sequence, JSON, ORC, and CSV, for efficient data extraction, transformation, and loading.
Leveraged Apache Spark and Hadoop to transform and process data at scale, increasing processing speed by up to 50%.
Collaborated with and led cross-functional and offshore teams in development, analysis, and understanding of data scripts and insights.
Built and managed Kafka pub/sub systems to collect real-time usage data from Xfinity, enabling real-time monitoring and analysis.
Developed and implemented ETL pipelines using Java and Spring Boot to load incremental data from Kafka to cloud storage, ensuring up-to-date data warehouses.
Optimized Kafka performance and scalability to handle high volumes of real-time data, ensuring reliable processing and delivery even during peak periods.
Troubleshooted and resolved Kafka production issues, minimizing downtime and ensuring smooth data pipeline operation.
Tech stack:
Databases: PostgreSQL, Teradata, Oracle, SQL Server
Other tools and technologies: Hadoop, Apache Spark, Apache Kafka, Bash & Shell, Airflow, X-gears
Software Engineer
Orrba Systems
Rock Hill, SC
09.2020 - 01.2021
Built and optimized data pipelines to extract, transform, and load data from diverse sources into data warehouses, improving data quality and accessibility by 20%.
Developed and maintained scalable and reliable batch and real-time data processing systems using Big Data technologies including Hadoop, Spark, and Kafka, enabling real-time data-driven decision making.
Derived valuable insights from websites and analyzed data using NumPy, Pandas, and scikit-learn Python modules, supporting data-driven product development and business optimization.
Collaborated closely with data scientists and stakeholders to ensure data pipelines met business requirements and delivered actionable insights, fostering collaborative and data-driven culture.
Software Engineer
Truphone
Cleveland, OH
07.2020 - 09.2020
Spearheaded backend performance enhancements, boosting data accuracy by 20% and slashing system response times by 30%.
Designed Angular-based UI components, ensuring 99% on-time and precise data delivery.
Efficiently employed Docker and Kubernetes for backend service containerization and workflow orchestration, achieving 50% reduction in deployment time and 99.9% increase in system uptime.
Assistant Engineer
Cleveland State University
Cleveland, OH
08.2019 - 03.2020
Architected and orchestrated cutting-edge workforce management application through Spring MVC, ensuring seamless connection between front-end and back-end, highlighting ability to form genuine connections.
Revolutionized university databases by capitalizing on established storage frameworks, and further enhancing it with flawless data API integration, culminating in significant 30% storage efficiency.
Adopted and implemented advanced algorithms to innovate equipment lending protocols, realizing 50% uptick in equipment processing velocity, reflecting self-starter attitude.
Led and supervised comprehensive testing initiatives and resolved complex software issues, propelling application towards user experience that received top-tier reviews, showcasing leadership attributes, and keen eye for detail.
Education
Master of Science - Computer Engineering
Cleveland State University
Cleveland
05.2020
Bachelor of Science - Electrical and Electronics Engineering
Osmania University
Hyderabad, INDIA
04.2018
Skills
Proficient in Programming Languages: Java, C#, JavaScript, TypeScript, Python
Proven expertise in Stream Processing: Apache Kafka
Excellent documentation skills in API Development & Automation:
Postman, Swagger
Demonstrated experience in Containerization & Orchestration: Docker, Kubernetes
Practical experience in Cloud Platforms: AWS (EC2, RDS, S3, Lambda)
Professional communication skills aided with Version Control and Collaboration Tools: Git, GitHub, JIRA, Confluence
Solid background in Web Development: HTML, CSS, Angular, Spring Boot, Spring MVC, Maven, jQuery, JSP Servlets, Nodejs, REST API
Timeline
Software Engineer
Comcast
02.2021 - Current
Software Engineer
Orrba Systems
09.2020 - 01.2021
Software Engineer
Truphone
07.2020 - 09.2020
Assistant Engineer
Cleveland State University
08.2019 - 03.2020
Master of Science - Computer Engineering
Cleveland State University
Bachelor of Science - Electrical and Electronics Engineering