Summary
Overview
Work History
Education
Skills
Timeline
Generic

Saran Prasad Balasubramaniam

Manchester ,CT

Summary

Experienced Data Engineer with a strong foundation in building efficient data pipelines and integrating cutting-edge AI technologies to enhance data analytics and automation. Proficient in SQL, Python, and cloud technologies, with a proven track record of translating complex data into straightforward business insights. Adept at collaborating with teams to innovate and streamline data processes for optimal business outcomes.

Overview

10
10
years of professional experience

Work History

Senior Data Engineer

CGI Technologies
Hartford, CT
12.2019 - Current
  • Collaborated in developing an AI tool that simplifies writing SQL queries, making data access easier for non-technical users.
  • Performed extensive research on embedding data schemas for optimal vector storage performance, and stored the embedding in the Redis Vector Database.
  • Designed POC models to categorize various types of claim documents with OpenAI.
  • Designed a POC model for a chatbot, leveraging Streamlit to respond based on data from the Confluence pages.
  • Developed a report by determining KPIs, like claim status and turnaround time, in partnership with business teams.
  • Developed Databricks notebooks to create reports to analyze the claims life cycle, and used MWAA DAGs to schedule the notebook.
  • Enhanced Pyspark workflows by incorporating partitioning and caching, boosting execution speed by 30%
  • Worked alongside data and infrastructure teams to design an AWS application for enhanced reporting and analytical capabilities.
  • Implemented infrastructure as code with Terraform to manage AWS resources such as Lambda, Glue, SNS and SQS.
  • Engineered, implemented, and sustained numerous AWS reports with Glue and Athena.
  • Utilized Amazon RDS and DynamoDB for database management, ensuring high availability and performance.

Associate Data Engineer

Axtria Limited
Boston, MA
11.2018 - 11.2019
  • Assisted in providing architecture design for an end-to-end data migration from the Veeva database to the AWS Datalake, and was also responsible for documenting the design specification.
  • Worked on creating a AWS cloud-based data solution with scalable infrastructure leveraging Terraform for the Data Science team
  • Used Data Migration Services in AWS to migrate data from SQL Server to S3.
  • Designed and implemented ETL workflows within the AWS Glue environment.
  • Involved in the code review of my fellow teammates, and I provided solutions for code optimization.

Research Assistant - Data Analyst

Department of Human Science -TTU
Lubbock, TX
11.2017 - 06.2018
  • Accumulated structured and unstructured data, cleansed, and transformed the data for behavioral statistical analysis.
  • Manipulated large datasets using SPSS for effective statistical analysis by hypothesis testing to validate data.
  • Performed cluster analysis in R to identify the students' behavioral patterns to improve the designated coursework.

Software Engineer

Mindtree Limited
07.2015 - 08.2017
  • Integrated data from multiple data sources and developed interactive dashboards in Tableau to illustrate the progress of new customer acquisition KPIs for non-American operational regions of AmEx.
  • Running Support operations to resolve issues through JIRA tickets and find RCA for the issues.
  • Optimized the performance of long-running SQL queries by creating indexes and table partitions in the database.
  • Coordinated disaster recovery server validation globally to ensure 100% production system stability.
  • Created an automation tool to collect and trigger the load run-time reports to Business Analysts of AmEx.

Education

Master of Science - Data Science

Texas Tech University
Lubbock, Texas
08.2018

Bachelor of Engineering - Electrical and Electronics Engineering

Anna University
Chennai, India
04.2015

Skills

  • Databricks
  • Cloud computing
  • Programming (Python, PySpark, SQL)
  • Collaborative Problem Solving
  • Generative AI
  • ETL development
  • Project Management
  • Data analysis

Timeline

Senior Data Engineer

CGI Technologies
12.2019 - Current

Associate Data Engineer

Axtria Limited
11.2018 - 11.2019

Research Assistant - Data Analyst

Department of Human Science -TTU
11.2017 - 06.2018

Software Engineer

Mindtree Limited
07.2015 - 08.2017

Master of Science - Data Science

Texas Tech University

Bachelor of Engineering - Electrical and Electronics Engineering

Anna University
Saran Prasad Balasubramaniam