Summary

Overview

Work History

Education

Skills

Certification

Timeline

Saravanan Mani

Morrisville,NC

Summary

Having 14+ years of DW/BI and Analytics experience Using Amazon Web Services (AWS), worked on application migration from On-Prem Server to Cloud which involves end-to-end data transformation using S3, EC2, EMR, Lambda, SNS, Lake, RDS, RedShift, Snowflake, Apache AirFlow Workflow Extensive knowledge in migrating on-primes to AWS cloud, worked on Ab-Initio to PySpark and DataStage cloud migration projects. Expertise in developing, implementing, optimizing, and troubleshooting complex data warehouse databases on snowflake Migrated AWS RedShift & Teradata objects into Snowflake environment Good knowledge of Snowflake database, Schema and Table Structures Good Knowledge in snowflake data modeling, ETL using Snowflake SQL and standard ETL concepts. Development on Big Data on Cloud with EMR/EC2 Instances involving Spark, Hadoop, Pig & Hive Played DevOps Engineer role and managed all the cloud Servers & services, fixing vulnerabilities and make them compliant in audit reports. Hands on experience in creating new servers with proper IAM roles, security groups and S3 bucket policies Extensive Knowledge on SQL databases like Amazon RedShift, Snowflakes, Oracle, DB2 and Teradata Support experience with monitoring built on Splunk Dashboards and CloudWatch logs and alerts through PagerDuty and Slack channels Proficient in ETL development using IBM Web Sphere DataStage, Strong in ETL Architecture design and DWH & writing SQL Queries Completed AWS Solutions Architect Associate Course with Udemy Development and Implementation through SDLC Methodology (Agile, Scrum/Iterative Development) Obtained detailed understanding of data sources, EBCDIC, Flat, Parquet, COBOL Schema files and its variations (redefines, Arrays), complex data schemas I have proved to be an astute individual in Banking and Retail services where I had an environment to showcase my extensive knowledge in the aspects of Banking & Retail domain. Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills. Detail-oriented team player with strong organizational skills. Ability to handle multiple projects simultaneously with a high degree of accuracy.

Overview

years of professional experience

Certification

Work History

Senior Data Engineer

Fidelity Investments

Durham, NC

05.2021 - Current

Description:

Product MTW(Managing The Work) uses different sources of record to track People, Work, Finance and objective data. People data comes from Workforce Connect, Work data comes from Jira/Jira Align, and Objective data comes from DOMO Goals. Financial data is then costed and generated based on combination of data from these tools. In order to make any sense of data going into these separate tools, there would be hefty learning curve, multiple ACR's - and frankly some skilled analysts to put it all together, this is where MTW data model comes into play.
DataSavvy squad has managed to consolidate all this data into One model that is easy to consume, lightweight, and provides business insights it needs out of these tools for accurate planning, data, costing, and management

Responsibilities:

• Create Data Models by using dimension modeling.
• As technical consultant, provide guidance and support for team members to complete their stories/tasks.

• Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.

• Contributed to internal activities for overall process improvements, efficiencies and innovation.

• When business needs, have created Lens, Attribute, Hierarchy, Matrix materialized views.
• Build/Add Alerts exceptions/Super Scores for Transparency Reporting (Enterprise Level).
• Extract data from Work Force Connect Oracle and Stage it into SnowFlake and feed data to reporting.
• Extract data from AWS S3 Buckets by using snowpipe and stage it to snowflake internal stage and load into their corresponding target tables/views.
• Implemented and maintaining Checksum logic for Data Quality and Data Lineage Purpose.
• Responsible for production batch SLA/job Failures.
• Addressing User Queries raised.
• Maintaining C2C boxes all environments, responsible for applying patches and make sure servers are in complaint
• Working on AtScale Analytic tool, Proof of Concept to build aggregates as service layer
• Planned to Convert On-Prem C2C boxes into AWS EC2 Instances
• Add/update MTW objects metadata into Alation tool.
• Applied loss functions and variance explanation techniques to compare performance metrics.

Tools : Snowflake, AWS, AtScale, Alation, DOMO, Python, WFC, Oracle, Power BI, Tableau, Control-M, C2C

Technical Lead /DevOps

Capital One

Chicago, IL

05.2020 - 05.2021

Description:

As part of tech modernization, migrate existing Ab-Initio ETL jobs to PySpark ETL jobs and load data into OneLake and Snowflake Database
Extract EBCDIC data from sources like First Data and Zoot, read files by using COBOL copy books, create schema datasets in metadata management tools, convert EBCDIC format data to ASCII format by using Cobrix tool, then do tokenization for security fields and write data into S3 then move in to OneLake S3 and write it into Snowflake database.

Responsibilities:

Register source file schema details in Metadata management internal tools Exchange.
Read Source EBCDIC file from S3 by using COBOL Copy Book rearranging occurs and formats.
Convert EBCDIC data into ASCII format by using Cobrix tool
Encrypt security fields by Turing process (Tokenization/Detokenization).
Write data into OneLake S3 buckets.
Using PySpark Snowflake connector, read data from S3 and write it into Snowflake database.
Use Enterprise GIT repositories for CI/CD Pipeline
Apache Airflow for scheduling jobs by creating DAG's.
Monitor existing batch jobs using Splunk dashboard and Cloudwatch logs
Support jobs through PagerDuty and Slack channel notifications
Created and implemented contingency plans to address potential risks.
Developed and introduced IT strategies to improve operational efficiency.
Coordinated with external vendors to deliver IT project components.

Tools : AWS Web Services, Big Data, Spark Master, DataStage, Python, Pig, Jenkins, Snowflake, RedShift, Oracle 12c, Teradata

Technical Lead/ Architect

Capital One

Chicago, IL

09.2019 - 05.2020

Description:

As part of this Know Your Customer (KYC) Program there were reports sent out to Federal team on a monthly basis. This report had multiple issues as the downstream analytics team were facing and it was required to re-factor the existing KYC jobs to address the concerns and issues related to reconciliation, validation were addressed in the new design by using true sourced files from Mainframe and First data source teams.

Responsibilities:

Managed project planning, resource allocation, scope, schedule, status and documentation.
Organized system operating procedures to strengthen controls
Analyze existing jobs which were written on AWS - Lambda function.
Update Python code to address security related compliance issues by replacing LOB IAM Roles to Enterprise IAM Role accessed via TALOS API - designed internally to secure access to One Lake S3 buckets.
Use Enterprise GIT repositories for CI/CD Pipeline.
Deploy lambda code through Terraform scripts.
Monitor existing batch jobs using Splunk dashboard and Cloud Watch logs.
Support batch jobs through PagerDuty and Slack channel notifications.
Timely updates to KYC consumers team, which shouldn’t interrupt downstream team Service Level Agreements.
Managed project scope, schedule, status and documentation.
Conducted research to evaluate systems design and process efficiency.
Provided educational expertise and mentoring to junior team members
Automated monitoring and security measures to reduce required employee attention

Technical Lead /Architect

Capital One

Chicago, IL

08.2018 - 09.2019

Description:

Fill The Lake is a Horizontal Bank wide initiative for Capital One which also includes partners within DTS Enterprise wide. The key goal of this initiative is to align with the Target Bank Data Platform (Hadoop Data Lake). There are multiple teams working towards getting the data to the lake for various subject areas. Migrate all the legacy system to Cloud Platform.

Responsibilities:

Responsible for interacting with business team & IT stakeholders for requirement gathering, risk assessment, finalization of technical specifications
Managing code Build activities & helping team members in case of troubleshooting
Integral part of Technical team providing support for development as Technical Lead
Responsible for evaluating alternate solutions and prepare detail design and present to technical panel and ensure that design is fully approved
Responsible for data delivery within SLA.
Managed project planning, resource allocation, scope, schedule, status and documentation.

Tools : AWS Web Services, Big Data, Spark Master, DataStage, Python, Pig, Jenkins, Snowflake, RedShift, Oracle 12c, Teradata

Technical Lead

Capital One

Chicago, IL

03.2018 - 08.2018

Description:

Data Center Exit team will decommission the Whirl and migrate the data into Cloud Such as AWS S3 Buckets &One Lake. For User Consumption, the data will be registered in Nebula and provided the data access through Cerebro Views.

Responsibilities:

Responsible for interacting with business team & IT stakeholders for requirement gathering, risk assessment, finalization of technical specifications
Managing code Build activities & helping team members in case of troubleshooting
Act as a single point of contact for the project and coordinating with external teams such as Database Administrators, Release Management, and Operation Team & Infrastructure teams
Responsible for evaluating alternate solutions and prepare detail design and present to technical panel and ensure that design is fully approved
Responsible for data delivery within SLA.
Designed support structure needed to confirm scalable application rollouts.

Tools : AWS Web Services, Spark Master, IBM InfoSphere DataStage, Python, Pig, Jenkins, Cerebro, Redshift

Technical Delivery Lead

Monsanto

Saint Louis, MI

09.2015 - 03.2018

Description:

Crop Data Warehouse (CDW) is independent data warehouse application that supports many Monsanto U.S. Row Crops business sectors. Its data content is focused around domestic sales history but also includes agronomic data from external sources. CDW has supported the analytic and reporting.

Responsibilities:

Responsible for interacting with business team & IT stakeholders for requirement gathering, risk assessment, finalization of technical specifications
Managing code Build activities & helping team members in case of troubleshooting
Monitored and tracked project progress to support timely completion.
Act as a single point of contact for project and coordinating with external teams such as Database Administrators, Release Management, and Operation Team & Infrastructure teams.
Updated customers and senior leaders on progress and roadblocks.
Created and implemented contingency plans to address potential risks.

Tools : PL/SQL, LINUX, Oracle 12c, Teradata, Informatica, Business Objects (BOBJ) .

SME / Project Delivery

Walmart Stores INC

Chennai, India

03.2015 - 09.2015

Description:

Migrate current Logistics ETL code from “ETL Manager Tool” Runs in Perl Scripts and Teradata BTEQ to DataStage v9.1 which is Wal-Mart enterprise standard for data. As there is No alert mechanism to highlight ETL job performance issues, Additional effort in issue resolution or new script development and needed manual effort and multi-skill support towards ETL maintenance, hence Business wanted to move it from Perl to DataStage.

Responsibilities:

Responsible for interacting with customer/business team for requirement gathering, risk assessment, finalization of technical specifications
Liaise with Business Analyst to align project requirements and define scope of warehouse into low level design documents
Managing code Build activities & helping team members in case of troubleshooting
Responsible for preparing Unit Test plan & test cases perform Unit Testing, regression testing
Partnered with project team members to identify and quickly address problems.
Resolved staff conflicts and identified potential areas of improvement.

Tools : IBM AIX, IBM InfoSphere DataStage v9.1, LINUX, Oracle 12c, Teradata, Informatica Power Center

DataStage Developer

Walmart Stores INC

Chennai, India

12.2014 - 03.2015

Description:

Work with the International Compliance team to ensure that the software platform is compatible with the risk assessment methodology being developed by International Compliance Monitoring team.

Responsibilities:

Responsible for conducting system analysis and finalizing technical / functional specifications
Data Mapping and understanding data modeling of new tables
Code tuning of under performing DataStage jobs and fixing production fixes as per business requirement.
Created and maintained database processes to support key business systems.
Built and maintained enterprise ETL processes IBM InfoSphere DataStage.

Tools : IBM InfoSphere DataStage v9.1, LINUX, Oracle 11g, SOAP UI.

DataStage Developer / Team Lead

Walmart Stores INC

Chennai, India

07.2014 - 10.2014

Description:

The Anti-Money Laundering system is responsible for retrieving information from the Market Basket

System, generating Alerts and allowing Case Managers to handle the alerts appropriately. It is used to identify the higher amount transactions via card/cash in each facility across globe except US.

Responsibilities:

Responsible for preparing high level and low level design documents from Business functional specifications
Responsible for maintaining coding standards and code optimization for any new code delivered.
Responsible for preparing test cases and performing Unit Testing, regression testing, building test data
Evaluated, designed, implemented and modified databases and database applications.
Aided with development of presentation layer for custom reporting and business intelligence.

Tools : IBM DataStage v9.1, korn Shell Scripting, Oracle 11g

ETL Developer

Marks & Spencer

London, United Kingdom

07.2012 - 03.2014

Description:

Altitude Program is a centralized data warehouse which is critical to the UK Retail company Sales, Waste, Availability and retail operations. It captures data from their transactional sources. Its initial goal is to pool data from their all the locations at the given interval, integrate and create business reports.

Responsibilities:

Handling DataStage production job failure, Ad-hoc request of ETL jobs, low performance jobs, File System/permission issues
Modify Business Object web reports on request from Business Objects web reports on request from Business Objects web users
Analyzed technical feasibility and suggests solution based on requirements, Specify Report design, resolving user queries, responding tickets and maintaining SLAs, Testing and fixing bugs
Involved in production deployment, Performance job tuning activities.
Wrote and optimized in-application SQL statements.
Managed data quality issues during ETL processes, directing qualitative failures to team lead for amelioration.

Tools : IBM InfoSphere DataStage v8.1, DB2, Control -M, Oracle 11g, Business Objects.

DataStage Developer & Administrator

Brakes Food Service, Kent, UK

Chennai, India

03.2011 - 05.2012

Description:

The client is a food services retailer having its sales outlets across UK. As a part of its business reporting purposes its data warehouse is loaded on daily basis which involves the technical integration of product, customer, billing, cost & budget, inventory, orders and deliveries information from diverse source system. The data feeds are extracted from ERP SAP/R3 after transformation data loaded to its CBM.

Responsibilities:

Systematic analysis of jobs performed on long running jobs to raise RFC to CAB for better performance of job.
Involved in scheduling DataStage jobs& Involved in Migration of DataStage 7.5.2 to 8.01.
Involved in performance tuning of DS jobs and Adhoc historic data push.
Co-ordinate with Onsite team primarily during downtime of different systems like SAP R/3, BW and SQL Server.
Created and maintained reporting and auditing to support sales, client success, professional services, data production and accounting teams.
Delivered complex business requirements without any issues by leveraging strong troubleshooting skills and extensive oversight.
Identified issues within databases, performed troubleshooting and implemented effective solutions.

Tools : IBM InfoSphere DataStage v8.1, HP UNIX, Oracle 10g

DataStage Developer / Support Analyst

Brakes Food Service, Kent, UK

Chennai, India

03.2010 - 02.2011

Description:

The objective of this project is to integrate three ERP’s (Minster, Concerto and SAP) which is used in an organization. Message Integration (MI) is the DataStage Message Integration service that sits between the three current ERP’s. The four main processes within MI are MIP, MTP, MOP and SRM. MIP processes I/B messages, CSV files and Idoc’s.

Responsibilities:

Worked with FTP stage, send mail activity stage, terminator activity, sequencers and job control as part of the development assignment
Involved in preparing Unit test cases
Involved in Coding using Source to Target Mapping.
Identified issues within databases, performed troubleshooting and implemented effective solutions.
Developed scripts and processes for data integration and maintenance.

Tools : IBM InfoSphere DataStage v8.1, HP UNIX, Oracle 10g

Education

Bachelor of Engineering - Electronics & Communication

Anna University

Chennai, India

2008

Skills

Active Listening

Teambuilding

Data analysis

Self Motivated

Decision-making

Training and Development

Cloud Tech:

AWS Services - EC2, EMR, S3, Lambda, SNS, Terraform, Cloud Watch
Azure : Data Bricks, Data Factory
DevOps: Jenkins, Artifact, Con-Course
IaaS and PaaS

Databases :

Oracle, IBM DB2, Teradata, Netezza, AWS RedShift, Aurora, Snowflake, PostgreSQL

Extraction Transformation and Loading (ETL) :

AtScale, NiFi, PySpark, DataStage 75x, 81, 85 & 91, 117, Informatica Power Center, DMExpress DMX-H (SyncSort) 931,

Big Data Ecosystems :

Hadoop, HDFS, Hive, Presto, Pig, Yarn, Spark

Programming Languages :

PL/SQl, Python, Scala

Scheduling Tools:

Apache Airflow, Automic Cloud based - Arow, Control -M, Crontab

Agile Methodologies:

Waterfall, Kanban

Data Visualization & Reporting :

SAP Business Objects, Power BI, Tableau

Certification

➢ Completed IBM Certified Solution Developer on InfoSphere DataStage v8.5 & v9.1

➢ Completed IBM Netezza Technical Mastery Professional v1

Timeline

Senior Data Engineer

Fidelity Investments

05.2021 - Current

Technical Lead /DevOps

Capital One

05.2020 - 05.2021

Technical Lead/ Architect

Capital One

09.2019 - 05.2020

Technical Lead /Architect

Capital One

08.2018 - 09.2019

Technical Lead

Capital One

03.2018 - 08.2018

Technical Delivery Lead

Monsanto

09.2015 - 03.2018

SME / Project Delivery

Walmart Stores INC

03.2015 - 09.2015

DataStage Developer

Walmart Stores INC

12.2014 - 03.2015

DataStage Developer / Team Lead

Walmart Stores INC

07.2014 - 10.2014

ETL Developer

Marks & Spencer

07.2012 - 03.2014

DataStage Developer & Administrator

Brakes Food Service, Kent, UK

03.2011 - 05.2012

DataStage Developer / Support Analyst

Brakes Food Service, Kent, UK

03.2010 - 02.2011

Bachelor of Engineering - Electronics & Communication

Anna University