Summary
Overview
Work History
Education
Skills
Related Experience
Certification
Timeline
Generic

Kavi Bharathi Chinnathambi

Senior Data Engineer
Richardson

Summary

Experienced data engineer with 8 years of hands-on experience in designing, developing, and maintaining data pipelines. Skilled in leveraging various tools and technologies to extract, transform, and load data from diverse sources into analytical platforms. Proficient in optimizing data workflows to enhance performance and efficiency. Demonstrated ability to collaborate with cross-functional teams to deliver data-driven solutions aligned with business objectives.

Overview

9
9
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Eviden
01.2021 - Current
  • Developed adaptable, resilient, and dynamic data pipelines leveraging BigQuery's INFORMATION_SCHEMA.
  • Consolidated a complex architecture consisting of over 40 pipelines into a unified dynamic pipeline across four distinct domains. • Enhanced pipeline performance and code organization through the implementation of a configuration-based approach.
  • Spearheaded a triumphant Proof of Concept (POC) venture to assess migrating the current Composer/SQL-based native solution to the cloud-native No Code/Low Code ETL Data Fusion tool.
  • Developed Airflow/Cloud Composer Directed Acyclic Graphs (DAGs) to orchestrate dataflow tasks, facilitating the extraction and transformation of data from diverse origins, and scheduling Dataflow pipeline executions.
  • Conducted thorough data validation processes to ensure consistency and accuracy between source and target datasets, employing robust techniques and tools.
  • Orchestrated the comprehensive design and implementation of ETL (Extract, Transform, Load) flow in the transition project from Composer-SQL-based pipelines to Google Cloud Data Fusion.
  • Developed detailed technical flowcharts to document the intricacies of GO pipelines, enabling a comprehensive understanding of the data processes and workflow.
  • Implemented and deployed a near real-time CDC approach to capture and replicate changes from a Postgres database hosted on Google Cloud SQL to Big Query using DataStream

Data Engineer

Visual BI solutions
05.2019 - 12.2020
  • Redesigned 15+ low performing data models to improve performance and readability
  • Developed and implemented DBT models to transform and load data from SQL Server into Snowflake, ensuring data integrity and accuracy.
  • Developed dynamic SQL scripts templates for data validation resulting in quick and easy data validation process

SQL developer

TATA Consultancy Services
11.2014 - 08.2018
  • Developed SQL based reports displaying the invoice, sales and material data based on customer, date, etc., COE & Internal projects.

COE & Internal Projects

Eviden
01.2021 - Current

GCP pricing & Big query Performance optimization:

· Collaborated with project managers and the pre-sales team to organize a session focusing on the recent changes in Google Cloud Platform (GCP) pricing and strategies for optimizing Big Query performance

· I addressed queries from multiple project managers regarding effective techniques for optimizing Big Query and adapting to the new GCP pricing models

· Additionally, I seized the opportunity to share my recommendations on setting up cost optimization measures for clients, aligning them with the updated GCP pricing structure

Streamlit - snowflake cost overview dashboard:

· Led the internal project focused on creating a Streamlit-based Snowflake cost overview dashboard

· Spearheaded project planning and coordination with the team, overseeing setup, development, and task allocation

· Identified key performance indicators (KPIs) and stories, contributing to both backend and frontend development

· Took charge of wireframe design for the dashboard and collaborated with the stakeholders for inputs & reviews

Education

Master of Science - Information Technology Project Management

The University of Texas At Dallas
Richardson, TX
05.2001 -

Skills

Cloud Computing Platforms: GCP, Snowflake

undefined

Related Experience

  • Eviden, Senior Data Engineer, 01/21, Present, Developed detailed technical flowcharts to document the intricacies of GO pipelines, enabling a comprehensive understanding of the data processes and workflow., Implemented and deployed a near real-time CDC approach to capture and replicate changes from a Postgres database hosted on Google Cloud SQL to Big Query using DataStream
  • Home Depot, Senior Data Engineer, Developed detailed technical flowcharts to document the intricacies of GO pipelines, enabling a comprehensive understanding of the data processes and workflow., Implemented and deployed a near real-time CDC approach to capture and replicate changes from a Postgres database hosted on Google Cloud SQL to Big Query using DataStream
  • AAA, Senior Data Engineer, Led a successful Proof of Concept (POC) initiative to evaluate the migration of the existing Composer/SQL-based native solution to the cloud-native No Code/Low Code ETL Data Fusion tool., Conceptualized and executed the end-to-end design and development of ETL (Extract, Transform, Load) flow as part of the project to transition from Composer-SQL-based pipelines to Google Cloud Data Fusion
  • UPS, Data Engineer, Built flexible, robust, and dynamic data pipelines utilizing big query’s INFORMATION_SCHEMA, Converted a multi-pipeline architecture over 40 pipelines into a single dynamic pipeline for four different domains, Improved performance and code management of pipelines by using the configuration approach
  • Visual BI solutions, Data Analyst, 05/19, 12/20, Efficient in gathering business details & data from the business & IT team to improve and solve business issues, Extracted millions of planned and actual data using complex SQL & stored procedures from multiple systems, Analyzed insights using data mining techniques on sales data using python’s data analysis & visualization packages to increase sales of a region leading to increase in sales by 7%, Designed & published 7+ stories/dashboards on tableau depicting business insights for decision making & analysis
  • Visual BI solutions, Business Intelligence, Communicated & coordinated with end-users & IT team in gathering requirements, issues & business improvements, Designed 15+ data models for report analysis & to increase the efficiency between the dashboards & stories by 4%, Migrated data from MySQL server to snowflake, replicating & performance tuning the data model logics., Transformed data using DBT & applied T-SQL for effective data transformation improving the performance by 5%
  • TATA Consultancy Services, SQL developer, 11/14, 08/18, Developed SQL based reports displaying the invoice, sales and material data based on customer, date, etc.,, COE & Internal projects, Big Query Omni:, Demonstrated a working theory of BQ Omni with a real time example connecting GCP & S3 buckets, GCP pricing & Big query Performance optimization:, Collaborated with project managers and the pre-sales team to organize a session focusing on the recent changes in Google Cloud Platform (GCP) pricing and strategies for optimizing Big Query performance, I addressed queries from multiple project managers regarding effective techniques for optimizing Big Query and adapting to the new GCP pricing models, Additionally, I seized the opportunity to share my recommendations on setting up cost optimization measures for clients, aligning them with the updated GCP pricing structure, Streamlit - snowflake cost overview dashboard:, Led the internal project focused on creating a Streamlit-based Snowflake cost overview dashboard, Spearheaded project planning and coordination with the team, overseeing setup, development, and task allocation, Identified key performance indicators (KPIs) and stories, contributing to both backend and frontend development, Took charge of wireframe design for the dashboard and collaborated with the stakeholders for inputs & reviews

Certification

GCP Certified Professional Data Engineer – Google Cloud Platform

Timeline

GCP Certified Professional Data Engineer – Google Cloud Platform

01-2023

Senior Data Engineer

Eviden
01.2021 - Current

COE & Internal Projects

Eviden
01.2021 - Current

Data Engineer

Visual BI solutions
05.2019 - 12.2020

SQL developer

TATA Consultancy Services
11.2014 - 08.2018

Master of Science - Information Technology Project Management

The University of Texas At Dallas
05.2001 -
Kavi Bharathi ChinnathambiSenior Data Engineer