Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Srinath Mandadi

Summary

Highly motivated, certified professional with 11+ years of experience, specializing in Enterprise Data Warehousing, Cloud Computing, Business Intelligence, and Database Administration.

Expertise in analysis, design, development, testing, implementation, enhancement, and support of BI applications, including strong experience in data warehousing (ETL and OLAP) environments as a data warehouse consultant.

Experienced in working end-to-end on downstream applications, handling the engineering and administrative responsibilities of the ETL, ELT, BI, and database applications.

Proficient in data modeling, database design, tuning databases, identifying ETL bottlenecks, and providing quick workarounds to achieve real-time analytics.

Proven ability to collaborate with stakeholders and implement data governance standards, driving efficiency and data integrity across projects.

Overview

12
12
years of professional experience
2
2
Certifications

Work History

Data Engineer II

Memorial Sloan Kettering Cancer Center
New York, NY
09.2020 - Current

MSK is building a MODE(data platform), which is a first-of-its-kind centralized data platform aimed at expanding access to sophisticated analytics for MSKCC's clinicians, researchers, and administrators on a large scale, to translate institutional information into actionable insights, and improve patient care, hospital operations, and clinical research.

Responsibilities:

  • Responsible for the end-to-end delivery of data pipelines using a variety of technologies (AWS Glue Blueprints, Databricks, dbt, Palantir, Pyspark, Kubernetes, Informatica Data Integration, etc.) to process a high volume of data on a daily, or more frequent, basis.
  • Design, Develop, Test and deploy the efficient, reusable data pipelines in Databricks that drive complex applications to support high-quality backend systems, leveraging several programming languages, with a focus on Python and PySpark.
  • Develop an Enterprise Data Lake and Delta Lake to support various use cases, including analytics, processing, storage, and reporting of rapidly changing, high volumes of data.
  • Ensure consistent data quality, establish data standards and governance, and integrate data between on-premises and cloud, both operationally and analytically, using cloud services.
  • Working with business teams to understand business reporting and analytic requirements.
  • Involve in the daily stand-up meetings to analyze the new requirements and propose the technical solutions.
  • Responsible for ingesting the data quickly from a variety of data sources in a scalable way, with optimization techniques and parallel processing, to meet the near-real-time data delivery use cases.
  • Participate in the full development life cycle (Agile Methodology) to support the MODE data platform, requiring integration with other systems, including analysis, design, programming, implementation, and support.
  • Responsible for automating the permissions for objects in the Unity catalog by developing a process to assign tags and permissions using an ABAC (attribute-based access control) approach.
  • Participate in the development work with the replatforming team to migrate the data ingestion processes from Palantir to Databricks, and design the workflow to automate the access management on Databricks.
  • Work closely with the Release and Automation team to automate the deployment of the ETL code to higher environments.
  • Implement the reusable data pipelines using AWS Glue Blueprints, which source the data from APIs and various relational databases for building the S3 data lakes and Delta lakes, leveraging the AWS Glue workflows/jobs to load data in S3 and Redshift.
  • Participated in designing the agnostic PySpark data pipelines using AWS Glue to manage and automate the operational processes. Worked with AWS Data Pipeline to configure data loads from S3 to Redshift.
  • Develop data pipelines in Palantir to synchronize the data from Cloud Foundry to Databricks, leveraging the data connector.
  • Collaborate with Data Stewards and develop standards and workflows to handle the Data Governance activities (data discovery, data profiling, data quality, classifications, etc.).
    solutions.

Environment:
Databricks,dbt, Palantir, Cloud Foundry, Informatica DI, MDM, RDM, IBM Cloud Services - Cloud Pak for Data 4.0.4, Data Stage 11.3, Cloud Object Storage, BigSQL, Data Virtualization, DB2 database, Kubernetes Cluster, AWS Cloud Services - S3, EC2, AWS Glue 4.0, Lake Formation, Athena, AWS EMR Cluster, Redshift Cluster, REST API, Python 3.7.5, Spark, Tableau 10.5, Cognos, Linux, git, Azure Repos, Docker Images, Helm charts, Argo CD, Rclone, Terraform, etc.

Senior ETL/BI Engineer

REMEDY BPCI PARTNERS LLC.
New York, NY
04.2017 - 08.2020

Remedy Partners is an innovative healthcare services and technology company specializing in 'bundled payment' programs. Bundled payments are an innovative new payment model that incorporates both financial and performance accountability within episodes of care. Episode payment programs represent an important advance in the organization and financing of health care services in both the public and private sectors.

Responsibilities:

  • Participated in the development lifecycle process by developing ETL code and reports through Pentaho, and collaborated with Product Managers and the QA team.
  • Worked with business teams to understand business reporting and analytic requirements. Involved in the daily stand-up meetings to analyze the new requirements and proposed technical solutions.
  • Converted the legacy JavaScript code, which performs the ETL operations from the application MongoDB to the MySQL database, to Pentaho Data Integration ETL code.
  • Identified the bottlenecks and optimized the ETL code through techniques like parallel partitioning, multi-threading, etc.
  • Implemented the slowly changing dimensions (SCD) type 1 and type 2 to maintain current information and historical information in the warehouse tables.
  • Designed and developed highly interactive reports and dashboards through Pentaho and Tableau.
  • Worked with dependent teams, such as DBA, Admin, etc. For tasks that require another team's assistance.
  • Collaborated with the DevOps team for the successful deployment of code to higher environments. Worked with the QA team to resolve the identified bugs and implement the code fix.
  • Created the technical design documents to define and document the code changes.

Environment:

MySQL 5.7, MongoDB 2.6, Pentaho Data Integration 6.1, Pentaho Reports Designer 6.1, Pentaho Schema Workbench 6.1, Pentaho Dashboard Designer, Tableau 2019.4, etc.

Programmer Analyst

Deloitte(XpertTech Inc)
Mechanicsburg, Pennsylvania
05.2014 - 04.2017

Worked for the three initiatives below:
1.DHS Interactive, Commonwealth of Pennsylvania [Department of Human Services]. Feb 2016- Apr 2017
DHS Interactive is a new visualization solution to report and track a set of strategic performance indicators across the DHS programs. At a high level, the DHS Interactive solution will contain a set of visualizations to provide stakeholders the ability to view and analyze information across the Secretary's five core strategic priorities. The strategic priorities included as part of the DHS Interactive Data Visualization initiative are as follows:
· Improve Customer Service
· Provide Access to High Quality Services
· Serve More People in the Community
· Increase Employment Opportunities
· Modernize Program Integrity.

2.IRS form 1095-B project, Commonwealth of Pennsylvania [Department of Human Services]. Oct 2015- Feb 2016
IRS form 1095-B project, Commonwealth of Pennsylvania [Department of Human Services]
The Affordable Care Act provides that individuals must either have health insurance coverage throughout the year, qualify for an exemption, or make an individual shared responsibility payment when filing their taxes. IRS Form 1095-B reports certain information about individuals who are covered by minimum essential health coverage, and therefore are not required to make a payment.

3.Healthy PA, Commonwealth of Pennsylvania [Department of Human Services]. May 2014- Sep 2015
Under Healthy PA, Pennsylvania will extend health care coverage to adults ages 21 through 64 with incomes up to 133 percent of the federal poverty level (FPL) who do not currently qualify for Medicaid. Rather than simply enrolling these individuals into the existing Medicaid program, the state intends to use Medicaid dollars to purchase them coverage through a Private Coverage Option (PCO).

Responsibilities:

  • Participated in requirement gathering, business analysis, user meetings, discussing the issues to be resolved, and translating user inputs into ETL design documents.
  • Created an ER diagram of the data model using Erwin Data Modeler to transform business rules into a logical model.
    Involved in the extraction, transformation, and loading of data from source flat files and RDBMS tables to target tables. Created reusable transformations and mapplets, and used them in mappings.
    Used Informatica Power Center 9.1/9.01/8.6.1 for extraction, loading, and transformation (ETL) of data in the data warehouse.
  • Created complex mappings in PowerCenter Designer using Aggregate, Expression, Filter, Sequence Generator, Update Strategy, SQL, Union, Lookup, Joiner, XML Source Qualifier, and unconnected lookup transformations.
    Implemented the slowly changing dimensions (SCD) type 1 and type 2 to maintain current information and historical information in the dimension tables.
  • Involved in client interaction, analyzing issues with the existing requirements, proposing solutions, and implementing the same.
  • Optimized the performance of the mappings by various tests on sources, targets, and transformations. Identified the bottlenecks, removed them, and implemented performance tuning logic on targets, sources, mappings, and sessions to provide maximum efficiency and performance.
  • Tuned the performance of Informatica sessions for large data files by implementing pipeline partitioning, increasing block size, data cache size, sequence buffer length, and target-based commit interval, and resolved bottlenecks.
  • Debugged code, tested, and validated data after processes are run in development/testing according to business rules.
  • Prepared unit test plans and maintained defect logs to resolve issues. Worked with the QA team to determine the data validation, and performed the data validation at the source and target database levels.
  • Hands-on experience as an administrator involving maintaining the repository manager for creating repositories, user groups, folders, and migrating code from Dev to Test, and Test to Prod environments.
  • Worked on data request tickets and assisted business users (non-technical) to understand the quality of the data.

Environment:
Informatica PowerCenter 8.6.1/9.1, Informatica PowerExchange, Oracle 11g, SQL Server 2005/2008, T-SQL, MS Excel, Windows XP/2003/2008, CA Scheduler.

Education

Masters - Computer Information Systems

Wilmington University
New Castle, DE

Skills

  • ETL Tools: Databricks, Palantir, Informatica 91/90/861, Pentaho Data Integration 601, AWS Glue, DataStage 113
  • Business intelligence reporting tools: Pentaho Reports Designer 61, Pentaho Schema Workbench 61, IBM Cognos 1021, Pentaho Dashboard Designer
  • Data visualization tools: Tableau 105, Tableau 20193, Pentaho Dashboard Designer, Pentaho CTools, QlikView 90
  • Databases: Oracle (9i/10g/11g), SQL Server 2005/2008, MS Access, MySQL 57, MongoDB 26, Postgres, Redshift, DB2 115
  • OS: Unix, Linux, Windows
  • Data modeling tools: Erwin, Visio
  • Cloud services AWS: S3, EC2, Glue, Lambda, EMR cluster, Redshift cluster, SNS, SQS, RDS, Kinesis, Athena, IAM, Secrets Manager, data pipeline, etc, IBM Cloud Services, Cloud Pak for Data 404, Cloud Object Storage, BigSQL, Data Virtualization, Container Registry, Kubernetes Cluster
  • Programming languages: Python 375, PySpark,Unix Shell Scripting, and JavaScript
  • Utilities: Bamboo, Bitbucket, Jira, git, Docker images, Helm charts, Azure Repos, Argo CD, Rclone, Terraform, etc

Certification

Pentaho Data Integration Certified Specialist.

Timeline

Data Engineer II

Memorial Sloan Kettering Cancer Center
09.2020 - Current

Senior ETL/BI Engineer

REMEDY BPCI PARTNERS LLC.
04.2017 - 08.2020

Programmer Analyst

Deloitte(XpertTech Inc)
05.2014 - 04.2017

Masters - Computer Information Systems

Wilmington University
Srinath Mandadi