Vibhor is an accomplished data leader with a proven track record of technology leadership delivering data and analytics solutions for fortune 100 corporations for over 16 years. Current he is working with Citibank NA within the Global Spread Products data team as lead architect for various cloud based big data solutions , in house big data solutions , service owner for olap systems like Apache Pinot , Trino . He is also responsible for delivery of the multiple real time risk projects with GSP Risk team called Sparta .
Project #23: - Service Architect and Technical Anchor – MBS Products Snowflake Data Warehouse
Duration: October 2022 – Till date
Project Description– Technical Anchor for onboarding Mortgage Based Securities datasets to Snowflake in AWS. Create the data lake ecosystem joining on premise and cloud based data warehouse and expose them using jupyter Hub , Tableau , Qlik and variety of utilities .Exactly same as project 20 but at different platform .
Tech Stack: AWS ,Kafka , Confluent , Snowflake, Java ,Jupyter Hub , Starburst aka Trino.
Project #22: Service Architect and Service Leader – Apache Pinot as a Service – Sparta - Spread Products Real time Risk System
Duration: October 2021 – Till date
Project Description– Leader and Technical Anchor to create a ultra low latency real time risk warehouse which can cater to real time upsert use case with scalability at petabyte level.
Tech Stack: Kafka , Apache Pinot Platform , Apache Trino /Starburst.
Project #21: Service Architect and Technical Anchor – Starburst/ Trino as a Service – Spread Products End of Day Risk System
Duration: October 2021 – Till date
Project Description– Leader and Technical Anchor to create an end of the data risk warehouse which can integrate multiple systems like Kafka , KDB , Pinot , S3 , Oracle , SQL etc and run the big data analytics without much ETL effort.
Tech Stack: Kafka, Apache Pinot Platform , Apache Trino
Project #20: Service Architect and Service Leader – MBS Products Big Query Data Warehouse (Discontinued…..)
Duration: October 2018 – October 2020
Project Description– Technical Anchor for onboarding Mortgage Based Securities datasets to Big Query in AWS. Create the data lake ecosystem joining on premise and cloud based data warehouse and expose them using jupyter Hub , Tableau , Qlik and variety of utilities .
Tech Stack: Kafka , Confluent , Big Query , Google Cloud Storage , Java .
Project #19: Service Architect and Service Leader – Flink Platform As a Service
Duration: October 2018 – October 2020
Tech Stack: Apache Flink
Project #18: Service Architect and Service Leader – Jupyter Hub as a Service
Duration: October 2018 – October 2021
Project Description– The existing securitized markets group had a huge system build on top of hadoop . The traders execute their data analytics on Impala and Spark using Jupyter Notebooks . The shared cluster usage leads to the performance issues and concerns . Analyzing and working on the system to make it more robust and fault tolerant..
Tech Stack: Spark , Jupyter Hub , Impala , Livy , Hive , Presto.
Project #17: Pfizer- Service Architect and Service Leader – Spark as a Service on AWS-EC2
Duration: January 2018 – September 2018
Project Description– Designed and developed Spark as a service from Scratch as a non HDFS service on NAS and AWS S3 . Integrated Sentry , Jupiter , Hive and Spark ,Apache Livy to provide a scalable , fault tolerant service to the Pfizer stakeholders .
Tech Stack: Sentry , Jupiter , Hive and Spark ,Apache Livy.
Project #15: Pfizer - AWS/Big Data Architect –PGS DATA LAKE
Duration: January 2018 – September 2018
Project Description- Working with Pfizer as a Platform/Application Architect / Consultant /Big data and Data Warehousing Expert to for Architecting PGS Data Lake Project . Build UDH data lake as a single source to access data from 10 source systems with estimated size of 170 TB .
Tech Stack: Core Java , Python ,AWS Lambda , S3 , EC2 , AWS SES , SNS ,Dynamo DB , Redshift ,RDS, Service Catalog , EMR, Hive ,Spark , Talend, Snowflake ,TEZ , AWS ,Impala ,API Gateway , Custom Developed Work Bench Layer for presentation .
.
Project #14: Pfizer -AWS/Big Data Architect –-GBI Platform and Analytics Team
Duration: June 2017 – April 2018
Project Description- Working with Pfizer as a Solution Architect / Consultant /Big data and Data Warehousing Expert to cater to different internal teams of Pfizer on AWS and Big Data . Working as the Service Owner for Redshift / RDS / NO SQL Engines. Providing Consulting and support to PGS , CSAT , SDC project mainly with Pfizer..
Tech Stack: Core Java ,Python ,AWS Lambda , S3 , EC2 , AWS SES , SNS ,Dynamo DB , Redshift ,RDS, Service Catalog , EMR, Hive ,Spark , Snowflake ,TEZ , AWS ,Impala .
Project #13: Dun And Bradstreet - AWS/Big Data Architect – WEB VISITOR ID
Duration: August 2016 – May 2017
Project Description- Dun & Bradstreet wanted a digital platform to venture into a new line of business called Audience Intelligence as part of their sales and marketing team . This required a digital streaming platform to be developed using lean principles .The platform is required to cater to unlimited number of customer with real time stream processing as corner stone of the business model .
Tech Stack: Core Java , Spring ,Python ,AWS Lambda , Kinesis Firehose , S3 , Glacier , EC2 , AWS SES , SNS ,Dynamo DB , Redshift , AWS API Gateway , Custom developed Work Bench Layer for presentation .
Project #12: Walgreens -Architect- Tokenization and Encryption –
Duration: February 2016 – August 2016
Project Description- The objective of this project is to enhance the existing data security measures for credit card data during capture, in-transit, or storage within any Walgreen’s systems. In other words, a patient’s credit card number entered via Walgreens’ Point Of Sale, web applications, mobile app, etc. will be encrypted.
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring , Spring Boot,Enterprise Architect , HP Voltage.
Project #11: Walgreens -Architect- Patient Monograph
Duration: Oct 2015 – February 2016
Project Description- The Patient Education Monograph project seeks to replace the existing drug monograph file received from the vendor (Wolters Kluwers/Medispan) with a more updated file. The new file supports more granular information based on NDC codes, and is available in 19 languages whereas for now Walgreens will support additional nine languages.
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring ,Webservices , SOA ,Hibernate , Spring Boot.
Project #10: Walgreens-Architect -Single EAR
Duration: May 2014 – September 2015
Project Description- This project is initiated from RUN team to have 1 EAR file for all environments. This will allow the Run team (Non Production) to add/modify environment configurations such as arch version, service broker end-point, data sources, authenticator configuration, and SSL certificates without requiring a rebuild of the application .
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring ,Hibernate , Spring Boot.
Project #9: Walgreens -Architect -WePre WAS Migration
Duration: Jan 2014 - May 2014
Project Description- The Well Experience Pharmacy Rollout Expansion (WE-PRE) program enables the expansion of Well Experience Bridge model in all regulatory environments with the long-term vision of supporting chain-wide volumes. We were assigned to migrate the whole infrastructure from Tomcat to Web Sphere.
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring ,Hibenate.
Project #8 : – Dun and Bradstreet-Architect -Integration Manager
Duration: June 2013-December 2013
Project Description-Current limitations in traditional D&B Entity Matching and Hoover’s name and address search capabilities prevent some customers from being able to use D&B data. Some customers (e.g. Intelligence Community Analysts) need a way to rapidly search D&B data using partial information (i.e., nickname) in an unspecified field. Traditional Match engine requires a specific country and/or state, can’t use industry categories, and doesn’t handle nicknames. Hoover’s search API does not perform these functions either. Solr Cloud came out as the solution to this problem.
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring ,Solr Cloud.
Project #7 : DNB -Golf Match System Reengineering
Duration: April 2013 – June 2013
Project Description-The Golf Reengineering task was assigned to ensure a better match system. I have developed a parallel match system which can provide better results than existing MDM. This was done using Solr Search Engine and J2EE.We ran through multiple cycles of the Diagonal Analysis which ensured a much better match rate than what MDM can provide.
Tech Stack: Core Java ,JSP ,Ajax, Javascript, Spring ,Solr.
Project #6: Toyota, Social Listening Solution
Duration: July 2011-March 2013.
Project Description-Social Prism solution provides necessary infrastructure for data mining, processing and brand monitoring with large amount of Social Media Data. This product was capable of doing Sentiment Analysis and Topic Extraction . My role was to design/architect/code the process data flow and the search layer .
Tech Stack: Solr , HDFS ,Hadoop, AWS EC2 , CloudWatch, EMR .
Project #4,5: Amex -Devleoper- NGP-FILE DISTRIBUTOR -Atlas
Duration: March 2009-July 2011.
Project Description-
· The Next Generation Profile system is a global repository of traveler profile data. This repository will be used to provide profile data to downstream systems and allow a traveler to manage their profile directly.
· American Express Clients enter travel reservation data into the CRS, either through a travel agent or via an American Express automated product. The data are entered into the CRS pseudo city (PCC) queue in the form of Passenger Name Records (PNR’s).
Tech Stack: Core Java,J2EE, EJB , Spring JDBC ,JAXB ,JXL ,Websphere ND server 6.1, MQ 6.1,MS SQL 2005,SQL Server Integration Services 2005(SSIS).
Project #1,2,3: John Hancock - Developer –Common Log Fixes, Annual Contract Review, Contract Number Expansion
Duration: Jan 2007-Feb 2009.
Project Description-
· Common Log fixes are existing production issues in EComm side.
· The Annual Contract Review (ACR) is an existing retention tool utilized for business people and sales office personals . We have to fix the exiting issue on the application .
· John Hancock Retirement Plan Services offers qualified retirement plan products and services for corporate and small to mid-sized businesses. The scope of the project was to increase the contract number wherever it has been used from 5 digit to 7 digit.
Tech Stack: Java, J2EE, struts, ELB, DAOs,Oracle etc.
Programming Languages: Java , Pyspark , Python