Summary
Overview
Work History
Education
Skills
Websites
Portfolio Projects
Timeline
Generic

RAM CHARAN SHASHANK JANGAM

Denver,CO

Summary

Experienced Data Engineer with 3+ years of expertise in designing, building, and implementing scalable ETL pipelines. Strong command of Big Data technologies, including Apache Spark, Kafka, Databricks, and HBase, to optimize data processing and storage. Extensive knowledge of cloud systems such as AWS (S3, Glue, EC2, Redshift, CloudWatch, Lambda) and Azure (ADF, Blob Storage, AKS). Proficient in presenting data and providing insights through Tableau, Power BI, and personalized dashboards for stakeholders.

Overview

4
4
years of professional experience

Work History

Data Engineer

C&S Wholesale Grocers
07.2024 - Current
  • Company Overview: C&S Wholesale Grocers is one of the largest wholesale grocery supply companies in the United States, serving supermarkets, independent retailers, and institutions with a vast range of food and non-food products
  • C&S plays a critical role in the food supply chain, ensuring efficient inventory management and delivery services for its customers
  • Deployed ETL pipelines using AWS Glue and Spark to process transactional, customer data on daily basis
  • Optimized data storage and retrieval in Amazon S3, enhancing query performance by 20% and ensuring reliable access for regulatory reporting and Business Intelligence
  • Containerized ETL jobs using Docker and deployed them to Amazon ECS, increasing the scalability of data pipelines and reducing deployment time for reporting applications
  • Implemented error handling, logging, and monitoring for Lambda functions using AWS CloudWatch Logs and SNS alerts
  • Built PySpark-based batch processing jobs for data cleansing, deduplication, and aggregation, improving pipeline efficiency by 30%
  • Ensured seamless ingestion and transformation of over 500 GB of sales data daily
  • C&S Wholesale Grocers is one of the largest wholesale grocery supply companies in the United States, serving supermarkets, independent retailers, and institutions with a vast range of food and non-food products
  • C&S plays a critical role in the food supply chain, ensuring efficient inventory management and delivery services for its customers

Jr. Data Engineer

Cisco Systems
06.2023 - 05.2024
  • Company Overview: Cisco Systems is a global technology leader specializing in networking hardware, software, and telecommunications equipment
  • The company plays a key role in cloud computing, IoT, AI-driven networking, and security innovations
  • Developed and optimized ETL workflows using AWS Glue, reducing processing time of sales data by 30%
  • Designed and implemented serverless ETL pipelines using AWS Lambda, processing large-scale data ingestions from Amazon S3 to Redshift
  • Designed and implemented data integration pipelines using Informatica PowerCenter, increasing processing efficiency by 25%
  • Developed and integrated RESTful APIs to facilitate secure and efficient data exchange between systems and downstream applications
  • Optimized data storage and retrieval for financial datasets using DynamoDB, achieving sub-50ms query latency for real-time applications
  • Cisco Systems is a global technology leader specializing in networking hardware, software, and telecommunications equipment
  • The company plays a key role in cloud computing, IoT, AI-driven networking, and security innovations

IT Analyst

Office of Information Technology – UC Denver
06.2022 - 12.2022
  • .Created and maintained databases to facilitate efforts to enhance performance
  • Using tools like PowerBI, visualized the data into a variety of charts to comprehend trends, patterns, and connections
  • Involved in activities involving data inquiry, extraction, compilation, and reporting
  • Skilled in creating SQL queries for data transformation and retrieval adept at using a variety of Excel functions, including pivot tables and VLOOKUP, to collect data

Intern Data Analyst

Farmova Global
01.2021 - 05.2021
  • Transformed sales data into insightful knowledge, enabling the business to make better data-driven decisions
  • Developed budget forecasting and analysis using Apache Nifi and Unix, creating management ad hoc reports that improved evaluation processes
  • Increased throughput by 20% using business intelligence methods like forecasting and regression analysis
  • Used Tableau dashboards to visualize the sales data into tables, charts, and narratives to gain a deeper comprehension of the market and rivals

Education

Master of Science - Electrical Engineering

University of Colorado
Denver, CO
05.2023

Bachelor of Technology - ECE

JAIN University
Bengaluru, IN
06.2021

Skills

  • Apache Spark
  • Hadoop
  • Informatica PowerCenter
  • Databricks
  • Kafka
  • AWS
  • S3
  • Glue
  • EC2
  • Lambda
  • Redshift
  • RDS
  • VPC
  • CloudWatch
  • Microsoft Azure
  • MySQL
  • MSSQL Server
  • PostgreSQL
  • Python
  • SQL
  • PySpark
  • R programming
  • Shell Scripting
  • Apache Airflow
  • Amazon EventBridge
  • Kubernetes
  • Docker
  • Jenkins
  • CI/CD
  • Git
  • GitHub
  • Tableau
  • Power BI
  • Matplotlib
  • MS Excel
  • ETL development
  • Data warehousing
  • Data modeling
  • Data pipeline design
  • Data migration
  • Performance tuning
  • Big data processing
  • Spark framework
  • SQL expertise
  • Machine learning

Portfolio Projects

  • Batch Processing with PySpark on AWS EMR, Designed a scalable ETL pipeline using PySpark on AWS EMR to process large datasets, leveraging S3, Glue, and Redshift for data storage and transformation.
  • Databricks Real-Time Streaming, Implemented a real-time data ingestion pipeline using Apache Spark Structured Streaming and Kafka on Databricks, integrating with Azure Blob Storage and Power BI for analysis.
  • Azure Data Factory and Databricks Project, Developed an end-to-end data pipeline using Azure Data Factory to orchestrate Databricks-based ETL workflows, utilizing Delta Lake for optimized storage and processing.
  • Big Data Project using Hadoop, Built a distributed data processing solution with Hadoop, Hive, and HBase, implementing MapReduce and Sqoop for batch processing and data ingestion from RDBMS.
  • SQL Project for Data Analysis, Designed and optimized complex SQL queries for data analysis and reporting, leveraging Stored Procedures in MySQL to automate data transformations, improve performance, and streamline report generation from structured datasets.

Timeline

Data Engineer

C&S Wholesale Grocers
07.2024 - Current

Jr. Data Engineer

Cisco Systems
06.2023 - 05.2024

IT Analyst

Office of Information Technology – UC Denver
06.2022 - 12.2022

Intern Data Analyst

Farmova Global
01.2021 - 05.2021

Bachelor of Technology - ECE

JAIN University

Master of Science - Electrical Engineering

University of Colorado
RAM CHARAN SHASHANK JANGAM