Summary
Overview
Work History
Education
Skills
Certification
Projects
Timeline
Generic

IPSITA DANDAPATH

Winter Garden,US

Summary

12+ years of experience in analyzing large volumes of data for efficient business solution. Experience in health insurance and banking domain. Expertise in data modelling and data warehousing concepts. Expertise in ingestion of data from legacy systems to AWS s3 buckets and refreshing Elasticsearch. Proficient knowledge in statistics, data visualization and predictive analytics. Skilled ETL professional with expertise in extracting data from various source systems, transforming and loading into target systems, ensuring data quality and integrity. Extensively worked on data preparation, proficient in data mining activities. Good knowledge of programming languages like Python, R and Julia. Worked on SQL and NoSQL databases. Expertise in big data processing using framework such as Hadoop and Spark. Worked on Profiling, Classification and Masking of sensitive data. Expertise in Data provisioning, Data masking, Reviews and Validations. Worked on projects involving privatization of sensitive data in test environment using TDM tools like Delphix, MaskIT, IBM Optim and Informatica TDM. Collaborating with data analytics and business stake holders to understand data requirements and develop ETL solutions. Worked on synthetic data creation using Facets. Prior experience of 5 yrs. in mainframe development. Masked data in mainframe environment using JCL scripts and DB2 queries. Proficient with all stages of Software Development Life cycle (SDLC). Experience in IT service management tool Servicenow. Self-motivated individual with good communication, interpersonal skills and problem-solving capabilities.

Overview

17
17
years of professional experience
1
1
Certification

Work History

Lead Data Developer -Master of Data

Elevance Health
Virginia
02.2023 - Current
  • Ingestion of large datasets from legacy systems to AWS S3 buckets, ensuring minimal downtime and data loss
  • Extracted data from various sources and transformed and loaded data into SQL databases using informatica workflows
  • Loaded data into Oracle RDS from MySQL databases using airflow dags
  • Used AWS EMR service to run big data processing applications in cloud
  • Loaded data into Amazon S3 for analytics and Reporting
  • Refreshed Elasticsearch indexes for provider finder application, improving search performance and accuracy
  • Extracted data from Amazon S3 for various applications
  • Transformed data using AWS Glue transforms and python scripts
  • Collaborated with cross functional teams to ensure seamless data integration and testing
  • Proficient in ensuring data integrity, scalability and efficiency
  • Optimized ETL workflows for performance and scalability
  • Worked in production support ensuring smooth running of the application
  • Prepared extensive documentation for application flow and logging production issues
  • Collaborated with development team to identify and resolve root causes
  • Provided handover to offshore team members and ensured smooth transition to avoid any discrepancy in data loads and meeting the SLAs for production loads.

Lead Data Developer -TDM (GBD facets)

Elevance Health
Virginia
01.2022 - 02.2023
  • As a data analyst in test data management, I was involved in analyzing Business requirements tracked through Jira epics and stories and understanding the data dependencies from various scrum teams
  • Working with teams to understand the requirement and scope to ensure complete test data availability within given timeline
  • Working on synthetic data creation including membership, claims data using facets application
  • Identify automation opportunities and create automated solutions to improve the data management process
  • Wrote complex SQL queries using tools like Oracle SQL developer to identify data from various systems for analysis, reporting and data warehousing
  • Used SOAP UI and Postman to validate the data setup
  • Coordinate with other dependent systems to ensure seamless navigation for members between various portals and applications.

Project Lead - ServiceNow

Sidel-Delaval
India
03.2015 - 12.2016
  • Understanding gaps between core model ADM and requirements detailed in qualification phase of 'Delaval/Sidel'
  • Understanding the different processes identified during the qualification phase with Delaval and Sidel
  • Requirement gathering and testing for different ServiceNow modules like 'Change Management', 'IncidentManagement',' Problem Management' and 'Knowledge management' in service now
  • Working with team lead on design documents for custom attributes on managed instances, processes to be implemented and creating test case document changes done on managed instances
  • Working, preparing and testing cases on SAP Solman and AD integration.

Project Lead - Test Data Management

Assurant
India
08.2014 - 02.2015
  • Gathering and analyzing requirements for IBM Optim assessment, working on the assessment methodology
  • Designed and developed ETL workflows using IBM optim
  • Understanding the current TDM setup in various applications of AEB, listing the key findings and limitations of existing data masking processes using traditional techniques
  • Listing approaches for optim implementation with their advantages and disadvantages and providing the criteria to prioritize the implementation of 'IBM Infosphere Optim' on various applications using bucket classification/prioritization
  • Providing a high-level implementation roadmap; define and document project processes and best practices as a part of the team transition to using agile methodologies for software development.

Test Analyst - Test Data Management

Westpac
India
07.2013 - 08.2014
  • Analysis of data for sensitive fields, deciding the masking rule and getting approval from clients
  • Coding JCL scripts, DB2 queries for masking and code review; masking flat files and VSAM files using JCL scripts and masking DB2 tables using queries
  • Client coordination for issue/query resolution and maintaining application trackers and delivering masked data to application teams
  • Training of less experienced team members.

Test Analyst - Test Data Management

US Capital Groups
California
09.2012 - 06.2013
  • Proficient handling of data masking requests from application teams and of Delphix profiler to determine sensitive columns in the database
  • Creation of custom algorithms to handle masking of specific columns
  • Classification of data based on sensitive information like PII, PHI, Entity/transactional data
  • Deciding masking rule for database, procuring client approval on fields to be masked
  • Creation of ruleset, inventory and jobs in Delphix tool, execution of masking jobs
  • Analyzing, fixing issues and maintaining application trackers, delivering masked data to applications the team
  • Training/orientation of less experienced project team members.

Programmer Analyst - Test Data Management

Anthem
Ohio
08.2011 - 07.2012
  • Data provisioning as per testing team requirements (SIT and UAT), capturing and prioritizing client requests
  • Creation of members and claims data for various applications like NASCO, WDS, WMDS, etc
  • Coordinating with different stakeholders of the project, offshore team members working on different modules and with application teams for fixing issues
  • Data creation and provisioning on a mainframe environment, usage of macros for large volumes, creating data for performance testing and resolution of issues raised by offshore team
  • Maintenance/delivery of daily and weekly reports to team, data to testing teams and monitoring data progress via application tracker.

Senior Software Engineer

Anthem
India
09.2009 - 07.2011
  • Requirement analysis for clients, production of detailed design documents and proficient in upstream and downstream applications of PPR systems
  • Coding on mainframe applications, development and maintenance of COBOL DB2 programs
  • Experienced in peer code reviews and issue fix, was responsible for managing defect tracker and defects of the application, loading data from mainframe DB2 to Teradata using informatica, unit and system testing
  • Implementation planning/support activities, mentoring new team members.

Software

American Express
India
12.2007 - 08.2009
  • Worked on AMEX, 'Customer Privacy Capability' system between Nov 2008 - Aug 2009; Responsible for working on 'Credit Card Fraud' system between Dec 2007 - Oct 2008
  • Requirement analysis for clients, production of detailed design documents
  • Coding on mainframe applications, development and maintenance of COBOL DB2 programs
  • Experienced in peer code reviews and issue fix, responsible for managing defect tracker and application defects, unit integration testing of mainframe applications and implementation of planning/support activities.

Education

MS in Data Analytics -

University of Central Florida
12.2022

B.E. in Biotechnology -

BVB College of Engineering and Technology
12.2007

Skills

  • Python
  • R
  • SQL
  • JavaScript
  • COBOL
  • COBOL-DB2
  • JCL
  • Amazon EMR
  • AWS glue
  • Amazon S3
  • Jupyter Notebook
  • Observable Notebook
  • R studio
  • Google Colab
  • Power BI
  • IBM optim
  • Delphix
  • Mask IT
  • Informatica
  • ServiceNow
  • BO reporting
  • Jira
  • Clarity
  • Excel-vlookup
  • Linear Regression
  • Logistic Regression
  • XGBoost
  • Decision Trees
  • Random Forest
  • SVM
  • Time-series Analysis (ARIMA)
  • Predictive Modeling
  • K-means clustering
  • Hierarchical clustering
  • PCA analysis
  • Bayesian Networks
  • Natural Language Processing
  • MySQL
  • Oracle DB
  • DB2
  • Sybase
  • MongoDB
  • Machine Learning
  • Statistical analysis
  • Network science
  • Parallel and Distributed Databases
  • Parallel and Cloud Computation
  • Data Mining Methodology
  • Interactive Data Visualization

Certification

  • Code Clinic: R
  • Data Wrangling in R
  • Introduction to Programming Using Python
  • Essential Math for Machine Learning: Python Edition
  • Python Statistics Essential Training
  • Python for Data Science Essential Training Part 1
  • Python for Data Science Essential Training Part 2
  • IBM Certified Specialist Infosphere Optim for Distributed Systems Fundamentals (March 2020)
  • Mask IT
  • ITIL concepts
  • Delphix
  • Mainframes

Projects

Southern State Toyota Lift, 2022-01-01, 2022-05-31, The purpose of this project is to bring value to the end-user which would be resulting in improved customer loyalty and opening new revenue sources for the Southern States Toyota lift (SST). The results of this project would help SST customers in understanding about their forklift economic life cycle which would eventually save them a lot of time and money resulting in improved customer satisfaction., Preprocess data based on statistical analysis of maintenance work orders and domain knowledge., Develop a multivariate regression model which will accurately predict the repair cost of a forklift., Determine the optimal replacement point for the forklifts (intersection of acquisition cost curve and regressed repair cost curve). This visualization will be effective for making decisions related to forklift replacement., Used Python programming language to build the required machine learning model. Student Performance Data Analysis, 2021-05-01, 2021-08-31, Study of the student performance data in an exam was done to determine the optimal number of clusters using unsupervised learning techniques like Partitioning and Hierarchical clustering using R, Logistic regression model was built to determine if a student is going to pass or fail based on the categorical predictors in the data Super Market Sales Data Analysis, 2021-01-01, 2021-05-31, Overall predictive analysis of the sales volume, profit and market layout to gage the future sustainability of the supermarket based on its sales., Preliminary machine learning models were built such as Linear Regression, Random Forest, Gradient boosting Regression, Support Vector Regression model. Their performance was calculated using MSE and R squared value., Time Series analysis of Gross income data was done for a period of 3 months., ARIMA model was implemented for the time series forecasting. Life Expectancy & Its Influencing Factors, 2021-01-01, 2021-05-31, This is a Data Visualization project where we provided an overview on the life expectancy of people living across the world and influence of mortality rates & GDP using javascript on observable notebook., Observed the influence of health care workforce and road accidents over life expectancy of humans, Observed the influence of diseases and efficient management of essential services (like drinking water & sanitization) on life expectancy.

Timeline

Lead Data Developer -Master of Data

Elevance Health
02.2023 - Current

Lead Data Developer -TDM (GBD facets)

Elevance Health
01.2022 - 02.2023

Project Lead - ServiceNow

Sidel-Delaval
03.2015 - 12.2016

Project Lead - Test Data Management

Assurant
08.2014 - 02.2015

Test Analyst - Test Data Management

Westpac
07.2013 - 08.2014

Test Analyst - Test Data Management

US Capital Groups
09.2012 - 06.2013

Programmer Analyst - Test Data Management

Anthem
08.2011 - 07.2012

Senior Software Engineer

Anthem
09.2009 - 07.2011

Software

American Express
12.2007 - 08.2009

MS in Data Analytics -

University of Central Florida

B.E. in Biotechnology -

BVB College of Engineering and Technology
IPSITA DANDAPATH