Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

Sayantrini Saha

San Mateo,CA

Summary

Data Engineer possessing in-depth knowledge of ETL and SAS/Python programming paired with expertise in integrating and implementing new data pipelines. Offering 5+ years background managing various aspects of development, design and delivery of reliable data.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Data Engineer III

Acumen LLC
Burlingame , CA
2019.06 - 2024.05
  • Developed and implemented database designs, data access, and in-house tables to optimize performance and quick retrieval, and also for maintenance of codes.
  • Optimized query performance through indexing, partitioning, and by creating skinny tables.
  • Responsible for developing, maintaining and optimizing data pipeline using PySpark and assist with integrating new data sources or data designs into the company's data management systems.
  • Created stored procedures for automating weekly data pull tasks from Snowflake.
  • Performed ETL on big data and verified data integrity by performing data validation checks across multiple data sources.
  • Documented all changes made to the database structure during maintenance activities on Confluence.
  • Trained new hires in SAS and other non-technical users and answered technical support questions.

Accomplishments:

1. Developed Final Action delta algorithm instead of pulling 100% data, which reduced the query time and also memory and resource utilization. Query time reduced by 8 hours and data processing time reduced by 6 hours.

2. Automated data pipeline using PySpark that replaced SAS code. Validations were performed on Medicaid Statistical Information data and finally generated the National Summary report. This automation reduced the time of report generation from 8 hours to 30 mins per report.

Data Quality Analyst

Revenue Management Systems LLC
Oklahoma City , Oklahoma
2018.04 - 2024.06
  • Worked on process automation and automated parts of the regression testing framework for error reduction in the RMS core application process.
  • Conducted root-cause analysis, defect prediction, defect prevention reporting, and recommendations for mitigations.
  • Worked in an Agile methodology and participated in daily SCRUM rituals such as daily stand-up meetings, sprint planning, sprint retrospective and sprint review.
  • Designed complex SQL and PostgreSQL queries to validate the integrity of database objects such as tables, views and indexes.

Accomplishments: Developed tool in Java and PowerShell scripts to automate the process of moving data from production to staging database which saved time and effort of 12 hours/week.

Systems Engineer

Infosys Ltd.
Bhubaneswar , Orrisa, India
2013.03 - 2016.07
  • Conducted all stages of the software testing lifecycle, starting from the project initiation phase to the post-deployment stage.
  • Processed files from CMS and developed specific test cases for validating the data integrity of these files before sending them to the downstream processes and to other third-party vendors.
  • Worked in Waterfall, V-model, and Agile delivery models.

Accomplishments: Developed tool in VBA and Excel macros that performed compliance checks for Aetna. Aetna achieved CMMI Level 3 for the year 2015. Reduced manual effort and resulted in savings of $150/year.

Education

Master of Science - Industrial Engineering

University of Houston
Houston, TX
2017-12

Skills

  • Big data technologies
  • Database Design
  • Data Acquisitions
  • Data Migration
  • SQL and Databases
  • NoSQL Databases
  • Python Programming
  • Spark in Hadoop environment
  • SAS Programmer
  • RDBMS

Certification

  • SAS Certified Professional: Advanced Programming using SAS 9.4 02/2020
  • SAS Certified Base Programmer for SAS 9 01/2019

Languages

English
Full Professional

Timeline

Data Engineer III

Acumen LLC
2019.06 - 2024.05

Data Quality Analyst

Revenue Management Systems LLC
2018.04 - 2024.06

Systems Engineer

Infosys Ltd.
2013.03 - 2016.07

Master of Science - Industrial Engineering

University of Houston
  • SAS Certified Professional: Advanced Programming using SAS 9.4 02/2020
  • SAS Certified Base Programmer for SAS 9 01/2019
Sayantrini Saha