Summary
Overview
Work History
Education
Skills
Timeline
Generic

Karthick Raja Selvam

Cary,NC

Summary

Senior Data Engineer with expertise in developing, testing, and maintaining data architectures. Proficient in database management systems, Big Data processing frameworks, and data modeling. Proven track record of leading teams to create innovative data solutions that optimize system efficiency and inform business decision-making. Success in improving data availability and accuracy in previous roles.

Overview

11
11
years of professional experience

Work History

Senior Data Engineer

Capgemini Technology Services
Cary, NC
09.2022 - Current
  • Utilized snowflake metadata from 50 multiple accounts to construct an efficient datamart/data warehouse.
  • Ensured accurate reconciliation of snowflake usage statement with metadata in data warehouse.
  • Engaged in regular communication with the Snowflake support team, effectively addressing issues and seeking clarification.
  • Utilized Snowpark, Pandas, and other relevant package utilities to create a highly effective end-to-end dynamic pipeline in Python.
  • Implemented streamlined data modeling techniques to enhance efficiency of metadata pipeline.
  • Employed data warehouse metadata to design and launch Power BI dashboards, enabling analysis of monthly snowflake consumption across a wide range of service types within the organization.
  • Successfully implemented DBT proof of concept, facilitating the creation of dimension and fact tables within the Transformation and Loading layer.
  • Utilized different techniques to optimize costs on the Snowflake platform, leading to a noticeable decrease in expenses.
  • Generated Power BI reports to depict the analysis of cost optimization techniques.
  • Established connections with other BU/Teams to access required data from datamart, ensuring adherence to data governance protocols.

Senior Data Engineer

Capgemini Technology Services
Cary, NC
12.2019 - 08.2022
  • Examined source file and formulated relevant workflow
  • Developed a versatile shell script to transfer files from S3 to Snowflake for various sources.
  • Established a streamlined execution Workflow through implementation of the control -m tool.
  • Implemented the use of Stash to enhance data organization and efficiency.
  • Effectively utilized XML, JSON, CSV, and TAB file formats to import data into the snowflake staging layer.

Data Engineer

MathCo (TheMathCompany)
Cary, NC
04.2019 - 11.2019
  • Utilized a Dataproc Cluster to effectively transform, process, and store billions of data in BigQuery.
  • Developed data model diagram and shell scripts utilized in the execution of Hive and Spark scripts on a Dataproc cluster.
  • Managed job scheduling with Airflow.
  • Developed Python and PySpark scripts for data transformation and loading onto GCP Cloud Platform

Data Engineer

Sri mookambika info solutions
Cary, NC
10.2013 - 03.2019
  • Performed requirement analysis, estimated project duration, and developed mapping documents for data modeling purposes.
  • Utilized strong proficiency in creating PL/SQL packages, procedures, functions, triggers, and views.
  • Efficiently performed data transfers from Oracle, Postgres, MySQL, and Flat files through the creation of effective ETL jobs for successful loading onto S3.
  • Provided the Product team with data analysis capabilities through the creation and accessibility of a S3 Data Lake.
  • Performed production deployment tasks and assisted with support responsibilities.
  • Extensively used AWS services including S3, EC2, Redshift, Athena, Glue and Database Migration Services.
  • Engaged in responsibilities encompassing data backup/restore procedures, database construction activities, schema organization efforts for efficient access control measures with associated users and roles.
  • Collaborated with Application team to generate tables for KPI dashboards and Statement Reports.

Education

Master of Science - Computer And Information Sciences

Kalasalingam University
Cary, NC
04-2012

Bachelor of Science - Computer And Information Sciences

Yadava College
Cary, NC
04-2009

Skills

  • Snowflake
  • Pyspark
  • Python Programming
  • Redshift
  • Data Visualization
  • Analytical Skills
  • Data Modeling
  • Performance Tuning
  • Data Warehousing
  • Jenkins CI-CD

Timeline

Senior Data Engineer

Capgemini Technology Services
09.2022 - Current

Senior Data Engineer

Capgemini Technology Services
12.2019 - 08.2022

Data Engineer

MathCo (TheMathCompany)
04.2019 - 11.2019

Data Engineer

Sri mookambika info solutions
10.2013 - 03.2019

Master of Science - Computer And Information Sciences

Kalasalingam University

Bachelor of Science - Computer And Information Sciences

Yadava College
Karthick Raja Selvam