Proven expertise as a Data Engineer with 2 years in data integration and pipeline development, coupled with 3 years of experience in HR/Recruitment Analytics and 5 years in Research and Development. Adept at leveraging advanced technologies to streamline data processing and enable real-time, data-driven decision-making. Skilled in managing end-to-end project lifecycles for both on-premises and cloud-based applications, utilizing traditional and agile methodologies. Renowned for solving complex data challenges, fostering collaboration with stakeholders and vendors, and consistently delivering projects on time in high-pressure, deadline-driven environments.
Overview
16
16
years of professional experience
1
1
Certification
Work History
Data Engineer
RWJBarnabas Health
USA, Mar 2023-present
03.2023 - Current
Migrated 10TB of data from Microsoft Azure to Snowflake database using Snowpark
Worked on Snowpark and developed a custom python script to automatically load thousands of orc and parquet files present in the snowflake internal stage into thousands of tables
Successfully managed and executed end-end project from initial project planning to production deployment within the timeline in the Galen Project
Worked on data migration from Inter Systems Cache Database to Snowflake using custom Python script that loads all the schema tables
Successfully managed and ensured the deliverables of the project delivery and interacted with the customer teams on resolving issues
Used ETL tools such as Talend and Informatica IICS to build data pipelines and to carry out data transformations
Used HVR, a replication software to replicate the data and capture CDC changes
Monitoring and creating dashboards on patient data helped in understanding the overall patient experience and helped to streamline the process and improve patient satisfaction
Designed, developed, and maintained daily and monthly summary, trending, and benchmark reports in Tableau Desktop
Good hands-on experience using COPY command, bulk loading from the internal stage and external stage (AWS S3) to the snowflake cloud
Using Snowsql to load data from the internal stage into snowflake tables
The internal stage files were validated using the COPY, LIST, PUT, and GET commands
Created ETL (Extract, Transform, Load) pipelines and data migration scripts in IICS and Talend to move data from local to Snowflake
Performed Data Analysis tasks like Profiling, Validating and Cleansing data
Ensured code quality and maintainability by implementing best practices for version control, testing, and documentation throughout IICS & Snowflake projects
Prepared detailed functional/technical specifications, and ETL Design documents.
Data Science Extern
Rutgers Externship Exchange Program
Scotch Plains, NJ
09.2022 - 12.2022
Managed the entire project individually, starting from data gathering to the modeling phase.
Involved in collecting the pedestrian/bicyclist incidents dataset and merging other datasets, considering factors such as weather and road conditions, for the analysis using the Pandas library.
Worked on data cleaning and ensured data quality, consistency, and integrity using Pandas and NumPy. Dealt with missing values by removing unnecessary features and replacing missing data with statistical methods.
Performed exploratory data analysis (EDA) to identify possible data cleaning and patterns on the dataset and visualize them.
Explored and visualized the data to get descriptive and inferential statistics for a better understanding of the dataset. Data analysis was carried out with the help of graphs using Matplotlib and the Seaborn library.
Performed feature engineering process by converting categorical and numerical variables using encoding methods. Completed negative sampling technique to balance the dataset, as no samples represented non-accidents.
Built predictive models, including Logistic Regression, Random Forest, Decision trees, AdaBoost, and XGBoost, to predict the occurrence of an accident using Python Scikit-learn.
Evaluated the performance of the different machine learning algorithms based on various performance metrics.
Enforced F-Score, ROC, Confusion Matrix, Precision, and Recall are used to evaluate the performance of different models. Collected feedback and retrained the model to improve performance.
Junior Manager
Syngene International Ltd
India, Jun 2014 - Aug 2017
06.2014 - 08.2017
Led on-boarding of 50 new hires and organized induction sessions with different business functions
Created reports, data visualizations and presented analysis and interpretation for operational and business review and planning
Worked alongside teams internally within the business team in a professional manner to establish business needs
Created reports, data visualizations and presented analysis and interpretation for operational and business review and planning
Worked alongside teams internally within the business team in a professional manner to establish business needs
Developed HR recruitment reports to organize and present information using different data sources and presented data in a visually effective format using data visualization techniques
Collaborated with HR leaders, business stakeholders on designing and building metrics and reports and provided internal customer service
Performed quantitative analysis using questionaries and surveys to improve recruitment process by increasing candidate experience by 10%
Reduced recruitment costs by 20% by effectively negotiating pricing and fees with vendors
Streamlined the hiring process utilizing the gathered data and reduced recruitment costs by 20%
Accelerated recruitment process and accomplished the goal of hiring 200+ candidates within one year
Worked creating Aggregations, calculated Fields, table calculations, Totals, percentages using Key Performance Indicators (KPI) and Measures
Collaborated with key stakeholders including C-level executives, clients, and vendors, to strategically plan to retain resources and fulfilling hiring needs, enhancing stakeholder satisfaction
Facilitated regular stakeholder meetings, ensuring alignment on program goals and fostering transparent communication across all levels of the organization
Led end-to-end recruitment cycle for departments, closing 250 positions across departments.
Senior Research Associate
Syngene Intl Ltd
India, Oct 2008 - Jun 2013
10.2008 - 06.2013
Managed Merck Sereno and Evotec projects end-end from the initial planning phase of the project to final delivery of the medicinal compounds
Maintained databases of the compounds and their specifications/details to keep a track of the synthesized compounds
Involved in report writing and protocols and designed shipment templates for various targets and library products
Involved in preparing and shipping medicinal compounds
Achieved a target goal of synthesizing 1000 compounds within 4 months
Worked with product development teams to improve product quality.