
Having 14+ years of DW/BI and Analytics experience Using Amazon Web Services (AWS), worked on application migration from On-Prem Server to Cloud which involves end-to-end data transformation using S3, EC2, EMR, Lambda, SNS, Lake, RDS, RedShift, Snowflake, Apache AirFlow Workflow Extensive knowledge in migrating on-primes to AWS cloud, worked on Ab-Initio to PySpark and DataStage cloud migration projects. Expertise in developing, implementing, optimizing, and troubleshooting complex data warehouse databases on snowflake Migrated AWS RedShift & Teradata objects into Snowflake environment Good knowledge of Snowflake database, Schema and Table Structures Good Knowledge in snowflake data modeling, ETL using Snowflake SQL and standard ETL concepts. Development on Big Data on Cloud with EMR/EC2 Instances involving Spark, Hadoop, Pig & Hive Played DevOps Engineer role and managed all the cloud Servers & services, fixing vulnerabilities and make them compliant in audit reports. Hands on experience in creating new servers with proper IAM roles, security groups and S3 bucket policies Extensive Knowledge on SQL databases like Amazon RedShift, Snowflakes, Oracle, DB2 and Teradata Support experience with monitoring built on Splunk Dashboards and CloudWatch logs and alerts through PagerDuty and Slack channels Proficient in ETL development using IBM Web Sphere DataStage, Strong in ETL Architecture design and DWH & writing SQL Queries Completed AWS Solutions Architect Associate Course with Udemy Development and Implementation through SDLC Methodology (Agile, Scrum/Iterative Development) Obtained detailed understanding of data sources, EBCDIC, Flat, Parquet, COBOL Schema files and its variations (redefines, Arrays), complex data schemas I have proved to be an astute individual in Banking and Retail services where I had an environment to showcase my extensive knowledge in the aspects of Banking & Retail domain. Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills. Detail-oriented team player with strong organizational skills. Ability to handle multiple projects simultaneously with a high degree of accuracy.
Description:
Product MTW(Managing The Work) uses different sources of record to track People, Work, Finance and objective data. People data comes from Workforce Connect, Work data comes from Jira/Jira Align, and Objective data comes from DOMO Goals. Financial data is then costed and generated based on combination of data from these tools. In order to make any sense of data going into these separate tools, there would be hefty learning curve, multiple ACR's - and frankly some skilled analysts to put it all together, this is where MTW data model comes into play.
DataSavvy squad has managed to consolidate all this data into One model that is easy to consume, lightweight, and provides business insights it needs out of these tools for accurate planning, data, costing, and management
Responsibilities:
• Create Data Models by using dimension modeling.
• As technical consultant, provide guidance and support for team members to complete their stories/tasks.
• Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
• Contributed to internal activities for overall process improvements, efficiencies and innovation.
• When business needs, have created Lens, Attribute, Hierarchy, Matrix materialized views.
• Build/Add Alerts exceptions/Super Scores for Transparency Reporting (Enterprise Level).
• Extract data from Work Force Connect Oracle and Stage it into SnowFlake and feed data to reporting.
• Extract data from AWS S3 Buckets by using snowpipe and stage it to snowflake internal stage and load into their corresponding target tables/views.
• Implemented and maintaining Checksum logic for Data Quality and Data Lineage Purpose.
• Responsible for production batch SLA/job Failures.
• Addressing User Queries raised.
• Maintaining C2C boxes all environments, responsible for applying patches and make sure servers are in complaint
• Working on AtScale Analytic tool, Proof of Concept to build aggregates as service layer
• Planned to Convert On-Prem C2C boxes into AWS EC2 Instances
• Add/update MTW objects metadata into Alation tool.
• Applied loss functions and variance explanation techniques to compare performance metrics.
Tools : Snowflake, AWS, AtScale, Alation, DOMO, Python, WFC, Oracle, Power BI, Tableau, Control-M, C2C
Description:
Responsibilities:
Tools : AWS Web Services, Big Data, Spark Master, DataStage, Python, Pig, Jenkins, Snowflake, RedShift, Oracle 12c, Teradata
Description:
As part of this Know Your Customer (KYC) Program there were reports sent out to Federal team on a monthly basis. This report had multiple issues as the downstream analytics team were facing and it was required to re-factor the existing KYC jobs to address the concerns and issues related to reconciliation, validation were addressed in the new design by using true sourced files from Mainframe and First data source teams.
Responsibilities:
Description:
Fill The Lake is a Horizontal Bank wide initiative for Capital One which also includes partners within DTS Enterprise wide. The key goal of this initiative is to align with the Target Bank Data Platform (Hadoop Data Lake). There are multiple teams working towards getting the data to the lake for various subject areas. Migrate all the legacy system to Cloud Platform.
Responsibilities:
Tools : AWS Web Services, Big Data, Spark Master, DataStage, Python, Pig, Jenkins, Snowflake, RedShift, Oracle 12c, Teradata
Description:
Data Center Exit team will decommission the Whirl and migrate the data into Cloud Such as AWS S3 Buckets &One Lake. For User Consumption, the data will be registered in Nebula and provided the data access through Cerebro Views.
Responsibilities:
Tools : AWS Web Services, Spark Master, IBM InfoSphere DataStage, Python, Pig, Jenkins, Cerebro, Redshift
Description:
Crop Data Warehouse (CDW) is independent data warehouse application that supports many Monsanto U.S. Row Crops business sectors. Its data content is focused around domestic sales history but also includes agronomic data from external sources. CDW has supported the analytic and reporting.
Responsibilities:
Tools : PL/SQL, LINUX, Oracle 12c, Teradata, Informatica, Business Objects (BOBJ) .
Description:
Migrate current Logistics ETL code from “ETL Manager Tool” Runs in Perl Scripts and Teradata BTEQ to DataStage v9.1 which is Wal-Mart enterprise standard for data. As there is No alert mechanism to highlight ETL job performance issues, Additional effort in issue resolution or new script development and needed manual effort and multi-skill support towards ETL maintenance, hence Business wanted to move it from Perl to DataStage.
Responsibilities:
Tools : IBM AIX, IBM InfoSphere DataStage v9.1, LINUX, Oracle 12c, Teradata, Informatica Power Center
Description:
Work with the International Compliance team to ensure that the software platform is compatible with the risk assessment methodology being developed by International Compliance Monitoring team.
Responsibilities:
Tools : IBM InfoSphere DataStage v9.1, LINUX, Oracle 11g, SOAP UI.
Description:
The Anti-Money Laundering system is responsible for retrieving information from the Market Basket
System, generating Alerts and allowing Case Managers to handle the alerts appropriately. It is used to identify the higher amount transactions via card/cash in each facility across globe except US.
Responsibilities:
Tools : IBM DataStage v9.1, korn Shell Scripting, Oracle 11g
Description:
Altitude Program is a centralized data warehouse which is critical to the UK Retail company Sales, Waste, Availability and retail operations. It captures data from their transactional sources. Its initial goal is to pool data from their all the locations at the given interval, integrate and create business reports.
Responsibilities:
Tools : IBM InfoSphere DataStage v8.1, DB2, Control -M, Oracle 11g, Business Objects.
Description:
The client is a food services retailer having its sales outlets across UK. As a part of its business reporting purposes its data warehouse is loaded on daily basis which involves the technical integration of product, customer, billing, cost & budget, inventory, orders and deliveries information from diverse source system. The data feeds are extracted from ERP SAP/R3 after transformation data loaded to its CBM.
Responsibilities:
Tools : IBM InfoSphere DataStage v8.1, HP UNIX, Oracle 10g
Description:
The objective of this project is to integrate three ERP’s (Minster, Concerto and SAP) which is used in an organization. Message Integration (MI) is the DataStage Message Integration service that sits between the three current ERP’s. The four main processes within MI are MIP, MTP, MOP and SRM. MIP processes I/B messages, CSV files and Idoc’s.
Responsibilities:
Tools : IBM InfoSphere DataStage v8.1, HP UNIX, Oracle 10g
Active Listening
Teambuilding
Data analysis
Self Motivated
Decision-making
Training and Development
Cloud Tech:
Databases :
Extraction Transformation and Loading (ETL) :
Big Data Ecosystems :
Programming Languages :
Scheduling Tools:
Agile Methodologies:
Data Visualization & Reporting :
➢ Completed IBM Certified Solution Developer on InfoSphere DataStage v8.5 & v9.1
➢ Completed IBM Netezza Technical Mastery Professional v1