Summary
Overview
Work History
Education
Skills
Certification
PIMCO USA
BARCLAYS SOUTH AFRICA
Santerder UK
American Express USA
Westpac Australia
Timeline
Generic

PRATEEK SAMAL

Newport Beach,CA

Summary

Data Engineer with 11+ Year experience and a proven track record of designing, developing, and maintaining robust data infrastructure. Armed with a solid foundation in data engineering principles, I bring expertise in ETL processes, data modeling, and database management. Proficient in utilizing cutting-edge technologies, I have successfully implemented scalable solutions for data integration, transformation, and analysis. My hands-on experience with cloud platforms, such as AWS and Azure, coupled with a deep understanding of big data technologies, positions me to drive innovation and optimize data workflows. With a meticulous approach to data quality and a commitment to continuous improvement, I excel in collaborating with cross-functional teams to deliver data-driven insights that empower informed decision-making. Seeking to leverage my technical skills and strategic mindset to contribute to dynamic projects in a challenging Data Engineer role.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Principal Consultant

GENPACT
Newport Beach, CA
04.2018 - Current
  • Designed and implemented data migration workflows using DBT, ensuring efficient extraction, loading and transformation (ELT) processes tailored to Snowflake's architecture.
  • Optimized ETL processes for enhanced performance, leveraging best practices and adopting techniques to streamline data processing.
  • Engaged in code reviews, meticulously analyzing code written by team members, offering insights on code quality, readability, and compliance with coding standards.
  • Integrated Data Quality automation into the continuous integration and delivery (CI/CD) pipeline, automating the validation of data quality as an integral part of the development lifecycle.

Lead Consultant

ITC INFOTECH INDIA LTD.
09.2015 - 03.2018
  • Led the end-to-end design and implementation of Extract, Transform, Load (ETL) mappings in alignment with precise business requirements.
  • Documented performance tuning methodologies and best practices, providing guidelines for the team to follow in future development and optimization efforts.
  • Played a key role in communicating data model designs to cross-functional teams, ensuring a shared understanding of the database structures and contributing to enhanced data management and database efficiency.

Senior System Engineer

INFOSYS
09.2012 - 08.2015
  • Translated complex business requirements into functional ETL mappings, demonstrating a deep understanding of data structures, source systems, and destination data models.
  • Engineered and implemented robust Shell Scripts to automate end-to-end workflow execution, optimizing operational processes and reducing manual intervention.
  • Proactively managed and monitored all Daily, Weekly, Monthly, and Quarterly jobs in the scheduler, ensuring seamless workflow execution and adherence to timelines.
  • Spearheaded the design and implementation of an automated system for generating comprehensive status reports on alerts, streamlining communication and enhancing operational efficiency

Education

Bachelor of Engineering (Computer Science) -

Veer Surendra Sai University of Technology
05.2012

Skills

  • Data Warehousing
  • Data Mapping
  • Informatica Developer (powercenter/powerexchange/IDQ/MDM/IICS)
  • Data Quality
  • Data Analysis
  • Data Modeling
  • ETL Developer (Talend/Abinitio)
  • Cloud Skills (Niagara framework, AWS, S3, Snowflake, DBT and Airflow)
  • Database (Oracle, Sql Server,Teradata,Sybase,Snowflake)
  • Programming Languages (Oracle SQL, PL/SQL, UNIX Shell Scripting)
  • Scheduling Tools (Control M, Event Engine Scheduler,Autosys)
  • Incident Management (Service Now, BMC Remedy,JIRA)

Certification

  • Informatica Certified
  • Snowflake Certified
  • AWS Certified

PIMCO USA

Enterprise Data Platform :Using DBT transform data from Oracle and land over Snowflake

Enterprise Data Warehouse :EDW is the centralized repository of disparate data source that is conformed into a dimensional model design used for the core purpose of business intelligence and analytics. There is a unified approach for the representation of data that allows for analytical queries without impacting operational system using a Robust Data warehouse.

BARCLAYS SOUTH AFRICA

Rest Of Africa : Design the Data Model for the entire request related to the Project Scott which ensures the reference data to be picked from EDW and load into to Adratic using ETL.

Santerder UK

RDA : The term Risk Data Aggregation means defining, gathering and processing risk data according to the bank’s risk reporting requirement to enable the bank to measure its performance against its risk tolerance/ appetite. This includes sorting, merging or breaking down sets of data.

Santander Group has requested a number of metrics and parameters that must be automatically generated by Santander UK, these metrics will be generated for local and Group purposes.

American Express USA

CMIT : The project involved migration of the entire CMIT platform from AIX/Abinitio/EventEngineScheduler/Sybase to Linux/Informatica/Control-M/Teradata based platform in the Customer Marketing Capabilities portfolio. This project involved migration of multiple Abinitio ETL jobs to Informatica jobs with re-engineering wherever applicable. Informatica PowerCenter and PowerExchange tools were extensively used in this project

Westpac Australia

MAMA : The project involved migration of the entire MAMA (Multiple accounts multiple access) to GCM (Global corporation marketing. This project involved Enhancement and production support.

Timeline

Principal Consultant

GENPACT
04.2018 - Current

Lead Consultant

ITC INFOTECH INDIA LTD.
09.2015 - 03.2018

Senior System Engineer

INFOSYS
09.2012 - 08.2015

Bachelor of Engineering (Computer Science) -

Veer Surendra Sai University of Technology
PRATEEK SAMAL