Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

NIKHIL NATESH

Parsippany,NJ

Summary

Highly motivated and accomplished Senior Data Engineer with 4+ years of hands-on experience in implementing cutting-edge data-driven solutions to drive business growth and efficiency. Skilled in data modeling, data warehousing, data mining, and machine learning techniques to build robust analytical models to deliver actionable insights and solutions for complex business challenges.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Globex IT Solutions
09.2022 - Current
  • Transitioned the ETL processes from Stored Procedures to Databricks notebooks, leveraging Apache Parquet for enhanced distributed processing
  • Applied in domestic operations and expanding to international regions, enhancing processing efficiency and scalability
  • Developed a comprehensive Data Warehouse architecture, incorporating a Star Schema with meticulously designed Dimensions and Fact tables, and data mapping in Parquet files on ADLS
  • This design supports scalable and efficient data management
  • Engineered an end-to-end Kyligence model in the Kyligence UI, facilitating rapid access to data cubes in Excel for business users
  • This included loading data, creating snapshots, indexing, and formulating complex aggregation logic in Calculated Columns and MDX expressions for hierarchical data representation
  • Processed and transformed 5-7 terabytes of data using Databricks notebooks, employing PySpark and SparkSQL
  • This data was then efficiently loaded into ADLS via Azure Data Factory (ADF) pipelines
  • Further, I optimized existing Databricks code, achieving substantial reductions in runtime, thereby delivering faster insights to the business
  • Conducted root cause analysis of production defects, reported by business stakeholders
  • Developed and executed strategic plans with the team to address these issues, significantly enhancing user experience and system reliability
  • Automated the integration of Actuals with Projection processes, significantly reducing manual effort and time for the business
  • This automation allows direct access from the UI, eliminating dependency on the IT team and streamlining business operations
  • Mastered collaborative skills by working closely with business and QRM teams under an agile methodology
  • This involved active participation in decision-making and fostering effective partnerships with product owners and third-party service providers.

Data Engineer

BMW
01.2022 - 07.2022
  • Designed, implemented, and maintained ETL jobs using the IBM DataStage tool
  • Involved in data extraction and preparation, mapping between source and destination, automation, job scheduling, and streamlining data exchange across the organization for distinct data sources like relational databases, flat files, and data lakes
  • Developed a new feature on the banking website that allowed customers to view their spending breakdown by category through a comprehensive dashboard and included quarterly reports and comparisons to the previous year using Python and SQL for data processing, extraction, and visualization
  • This helped customers to gain insights into their spending habits and to make informed financial decisions
  • Interacted with potential clients to gather requirements and to document them; conducted gap and fitment analysis (at a functional level) to effectively address clients' concerns
  • Worked on an effective customer segmentation project for a bank using clustering techniques such as K-Means++, Hierarchical, and DBSCAN, to accurately segment customers and provide personalized offers, improved customer retention by 20%, and reduced credit risk by 15%.

Software Engineer

Vertex Enterprises
09.2019 - 12.2020
  • Worked on a Fiber network management project to analyze network failure using Python's data wrangling and visualization approach, to identify potential issues and prevent downtime
  • This proactive approach helped the team to investigate future network failures and improve network management
  • Assisted the team in transferring data from on-premises to AWS cloud using services like AWS S3, DataSync, & Direct Connect
  • Implemented AWS IAM policies for access control & security and monitored the data in the AWS environment to guarantee a smooth migration process.

Education

Master's in Information Systems & Technology -

Wilmington University
New Castle, DE
08.2022

Bachelor's in Mechanical Engineering -

Geetanjali College of Engineering & Technology
Hyderabad, India
09.2019

Skills

  • Languages : Python (Pandas, NumPy, SciPy, Scikit-learn, NLTK, Matplotlib, Seaborn, Plotly, etc), R, SQL, PL/SQL, Java, HTML, CSS
  • Big Data Stacks : ETL, Spark (PySpark, SparkSQL), Apache Hadoop (HDFS, MapReduce, YARN), IBM DataStage, Informatica PowerCenter, Airflow, Kafka, Alteryx, Tableau, Power BI, MS Excel etc
  • Databases : MySQL, MS SQL Server (SSMS), PostgreSQL, Oracle DB, Mongo DB, Hive, Cassandra DB, etc
  • Cloud : Kyligence, Azure Databricks, ADF, ADLS, AWS EC2, IAM, VPC, S3, RDS, Athena, Redshift, EMR, SageMaker, etc

Websites

Certification

AWS Certified Cloud Practitioner,

Business Analyst using Power BI.

MS SQL Database Administrator.

Timeline

Senior Data Engineer

Globex IT Solutions
09.2022 - Current

Data Engineer

BMW
01.2022 - 07.2022

Software Engineer

Vertex Enterprises
09.2019 - 12.2020

Master's in Information Systems & Technology -

Wilmington University

Bachelor's in Mechanical Engineering -

Geetanjali College of Engineering & Technology

AWS Certified Cloud Practitioner,

Business Analyst using Power BI.

MS SQL Database Administrator.

NIKHIL NATESH