Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Rakesh Sahu

Woodbridge

Summary

Senior Data Engineer with 14 years of experience in Data Engineering, Architecture, and Analysis. Expertise in managing software development lifecycles, ensuring smooth transitions from requirements to production. Specialized in financial products, including Treasury management and Deposit Products, with a focus on optimizing data solutions. Dedicated to continuous learning and implementing innovative technologies to improve production efficiency.

Overview

14
14
years of professional experience

Work History

Sr Data Engineer

City National Bank
New York
01.2023 - Current
  • Led engineering and data architecture for data lake and hub environment, ensuring compliance with MRA and regulatory requirements.
  • Designed and implemented ETL/ELT pipelines using Informatica Cloud, Python, Snowpark, and SQL Views for enterprise data warehouse in Snowflake.
  • Developed logical and physical data models, entity flow diagrams, and source-to-target mappings for banking products including deposits, loans, and derivatives.
  • Managed migration from Oracle to Snowflake, overseeing 40 systems of record and 80 processes utilizing Snowflake, IICS, and Python.
  • Created frameworks for Audit Balance Control, Data Quality, Data Reconciliation, and Data Obfuscation using Snowflake and Python.
  • Automated data lineage tracking through IICS Metadata API, Snowflake Lineage, and SAP DS lineages using Python.

Data Architect/Data Engineer

Lending Club Bank(vendor-KForce Inc)
Stamford
08.2021 - 01.2023
  • Worked with Bank operation team to understand requirements for Deposit, Personal Loan products and Fraud and Risk Management
  • Performed Data Analysis, Data profiling ,Data discovery on banks' different source system (focused on deposit ,Loan and KYC Data)
  • Developed and designed data models, Data mappings documents to create data mart for Deposit Operation Analytics team
  • Developed complex SQL and ETL infrastructure in Hadoop framework (Hive and Presto), Spark SQL, Python to develop analytics data mart
  • Created ETL Pipe like using Hive Workflow Framework to Automated all manual processes (Excel based reporting) to SQL driven solutions/Tableau reporting/Python programming to keep process regulatory and InfoSec compliance
  • Created Analytics layer to support KPI metric for deposit data (ACH, Wires, Checks, ATM/Debit card, Book Transfer data sets) and Loan servicing (Loan origination, disbursement, collection) datasets
  • Created complex python scripts to analyze data from different file format/sources, adding transformation logic and loading into Database

Data Engineer/Data warehouse Analyst

GE Capital(Vendor-Tata Consultancy Services)
Norwalk
08.2011 - 07.2021
  • Requirement gathering and brainstorming sessions with Business users to Understand requirement and benefits
  • Performed and documented Gap analysis, Inventory analysis and Impact analysis, feasibility analysis of existing system and jobs to be migrated from oracle to PostgreSQL
  • Preformed data analysis, Data profiling, Data Quality, data exploration on large datasets
  • Created Conceptual, Logical and Physical data models leveraging dimensional modeling, SCD Type I and SCD Type II concepts, Snapshot table using Erwin Modeling tool for enterprise data warehouse
  • Created ELT framework using PostgreSQL functions, Informatica Power center, Python
  • Developed Audit control and error handling framework in PostgreSQL function for ELT jobs
  • Created complex SQL query for Data consumption layer view, Data validation rules, Data comparison and ad-hoc analysis
  • Developed Data virtualization layer using Denodo, created base views, complex derived views, Data caching for reporting
  • Led team to re-factor and optimize ETL process, stored procedures, adhoc scripts and SQL utilities for data extraction and analysis
  • Developed Spotfire and Tableau data visualization using Cross Tab reports, Summary tables, Line charts, Bar charts, Cross Map, Scatter Plots, Geographic Map, Pie Charts

Education

Bachelor of Computer Applications -

Utkal University
01.2011

Skills

  • SQL
  • PL/SQL
  • Python
  • Shell Script
  • ETL
  • ELT
  • Pyspark
  • Snowpark
  • Oracle
  • Snowflake
  • SQL Server
  • PostgresSQL
  • Hive
  • Presto
  • Informatica Power Center
  • IICS
  • SAP Data Services
  • Denodo
  • AWS
  • Azure
  • Erwin
  • ER studio
  • Data warehousing
  • Data Analytics
  • Modeling
  • CI/CD
  • Cognos BI
  • Spotfire
  • Tableau

Timeline

Sr Data Engineer

City National Bank
01.2023 - Current

Data Architect/Data Engineer

Lending Club Bank(vendor-KForce Inc)
08.2021 - 01.2023

Data Engineer/Data warehouse Analyst

GE Capital(Vendor-Tata Consultancy Services)
08.2011 - 07.2021

Bachelor of Computer Applications -

Utkal University
Rakesh Sahu