Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Timeline
Generic

Reshma Shaik

Bayonne,USA

Summary

10 Years of experience as Data Engineer and Software development engineer with expertise on extraction, transformation and analysis of data using Pyspark and Hadoop ecosystem technologies. In-depth knowledge of Pyspark Architecture and its components such as HDFS, Hive, Spark Context, Spark SQL, and Spark Streaming. Good knowledge in P&C Insurance and CPG domain obtained from working with various clients.

Overview

10
10
years of professional experience

Work History

Data Engineer

Tata Consultancy
07.2018 - Current

Overview: Tata Consultancy, New York, USA

  • Worked on building a data ingestion tool which can ingest data from various sources like API, Teradata, Files, Kafka using Pyspark programming
  • Addressed challenges like updating the structure of existing tables in Azure Datalake/hive according to incoming data for incremental loads, data type conversions and finding the error records for file ingestions, extracting data from complex API’s during the proof of concepts before building the tool
  • Increased the performance of ingestions by 20 times faster than existing vendor tool(streamsets) by utilizing multiple cores based on the size of data the pipeline is handling
  • Optimized SQL queries for reading the table as partitions to achieve better performance with parallel read
  • Solved challenges with data manipulations that must be done on data stored as Parquet and Avro files
  • Developed Pyspark and SQL scripts in Azure Databricks to create sales reports specific to each state which will be used by the business for analysis of overall sales and serving customers based on their report history
  • Worked on Azure Data pipeline to configure data loads from SAP S4.
  • Handled importing of data from various data sources, performed transformations using spark and storing into Hive and S3 buckets
  • Scheduled TWS/Airflow jobs to run multiple spark jobs with shell scripts, which independently run with time and availability of resources
  • Worked flexible hours across night, weekend, and holiday shifts
  • Participated in team projects, demonstrating an ability to work collaboratively and effectively
  • Responsible for handling critical issues going on in portfolio
  • Consulted regularly with customer on project status, proposals, and technical issues
  • Proper knowledge of ITIL framework and Change, Incident management
  • Monitoring and supporting the job that are running through ESP/Control-M/Autosys schedulers
  • Create weekly report for business meetings.

Mainframe Developer

NTT DATA Services
01.2015 - 07.2018
  • Company Overview: NTT DATA Hyderabad, India
  • Participating in Software Development Lifecycle (SDLC) right from requirement analysis, documentation (functional specifications, technical design), coding and testing (preparation of test cases along with implementation) to maintenance of applications.
  • Handling requests from Onsite SME's (Subject Matter expert) which comes in form of Request Module Project Organizer (RMPO).
  • Executing Work requests which involve analysis, coding, testing, implementation, and documentation of rate changes for current effective date with respect to states.
  • Issue policies in UAT and other test environments to achieve desired rates for a particular LOB.
  • Evaluated and implemented enhancement design solutions to improve cost, quality, and performance of software applications.
  • Developed and executed complex system test plans to support quality.
  • Collaborated with business lines to understand business requirements and participate in development for SDLC documentation.
  • Tested and deployed scalable and highly available software products.
  • Collaborated effectively with cross-functional teams to deliver high-quality mainframe software projects on time.
  • Conducted comprehensive user training sessions on new mainframe features and functionalities; streamlined adoption process for end users.
  • Troubleshot business logic and performance issues in existing mainframe applications.
  • Implemented CICS transaction management strategies to optimize performance within high-traffic environments.

Education

Bachelor of Technology - Electronics and Communication Engineering

Jawaharlal Nehru Technological University Kakinada
Kakinada, Andhra Pradesh
05.2014

Skills

Python

  • Pyspark
  • COBOL
  • Shell/Perl scripting
  • Java
  • C
  • C
  • Azure Data Lake
  • Azure Data bricks
  • Agile
  • Data Integration
  • Spark
  • Teradata Studio
  • Oracle Data base
  • SQL
  • Hive
  • Mongo

Accomplishments

  • Received numerous expressions of appreciation from various clients for outstanding performance.
  • Participated in the TCS-HACKATHON and was awarded the first.

Timeline

Data Engineer

Tata Consultancy
07.2018 - Current

Mainframe Developer

NTT DATA Services
01.2015 - 07.2018

Bachelor of Technology - Electronics and Communication Engineering

Jawaharlal Nehru Technological University Kakinada
Reshma Shaik