Summary
Overview
Work History
Education
Skills
Timeline
Generic

Neha Satish Sawant

Fremont,CA

Summary

Data Engineer with over five years of experience in designing, implementing, and maintaining robust data infrastructures and pipelines. Proficient in Python, SQL, Scala, and machine learning, with a specialization in ETL processes. Skilled in leveraging tools such as Apache Spark and Hadoop for distributed data processing. Results-driven with a strong foundation in developing efficient ETL processes that ensure data accuracy and contribute to impactful business insights. Recognized for exceptional collaborative skills and adaptability to dynamic project requirements, consistently delivering reliable solutions that enhance data integrity and support informed decision-making.

Overview

8
8
years of professional experience

Work History

Data Engineer

S&P Global
USA
05.2024 - Current
  • Identified and curated diverse data sources, including databases, APIs, and streaming platforms, for predictive analytics initiatives, utilizing Python, numpy, and pandas for data exploration and preliminary analysis.
  • Implemented robust ETL pipelines with Apache Airflow DAGs, integrating data connectors and scheduling regular ingestion jobs for seamless data flow. Proficient in SQL for data querying and manipulation.
  • Established and optimized Snowflake data warehouse infrastructure, tailoring data schemas and implementing versioning mechanisms to ensure efficient data querying and governance, with proficiency in SQL for database management.
  • Performed meticulous data cleaning and preprocessing with Python libraries including Pandas, numpy, and scipy to uphold data validation and accuracy standards. Utilized MS Excel for exploratory data analysis.
  • Collaborated with cross-functional teams to gather requirements and deliver data solutions aligned with business objectives.

Software Engineer

REON Technologies Inc
Lowell, MA
05.2023 - 04.2024
  • Engineered and implemented a Python-based graphical data display application using Matplotlib and Tcl/Tk, optimizing laboratory data visualization.
  • Developed seamless data transfer solutions for sensor data to web API clients, significantly reducing manual effort from 3-4 hours to 30 minutes.
  • Leveraged Python data analysis tools (Matplotlib, Plotly, Pandas, NumPy) to extract, manipulate, and visualize collected data.
  • Mentored junior developers, providing guidance on best practices and coding standards.
  • Led code reviews, fostering a culture of continuous feedback and knowledge sharing among peers.

Data Analyst

Phoenix Technologies
Mumbai, India
05.2020 - 08.2022
  • Implemented robust data ingestion pipelines utilizing AWS services, Lambda functions, and Python scripting, leveraging libraries such as NumPy and pandas for advanced data manipulation and analysis.
  • Collaborated with interdisciplinary data teams to address complex queries related to customer management, employing SQL for efficient data retrieval and analysis.
  • Played a pivotal role in training and onboarding 30+ new employees, ensuring proficiency in Python, SQL, and data analytics tools for seamless project integration.
  • Designed and developed intuitive reporting dashboards and visualizations using Power BI and Excel, enabling stakeholders to derive actionable insights on a daily and weekly basis.
  • Managed 125-member team to achieve 20% save/load time reduction and 15% overall operation time improvement through strategic resource allocation and process enhancements.

Spatial Data Specialist

HERE Technologies Pvt Ltd
Mumbai, India
06.2018 - 09.2019
  • Spearheaded the conversion of physical maps into digital formats, optimizing computer usage and streamlining project workflows.
  • Applied advanced spatial data analysis using ArcGIS and MapInfo, developing structured data models for precise insights and informed decision-making.
  • Led and mentored a 25-member team, ensuring timely project execution and issue resolution, fostering a collaborative and high-performing environment.
  • Implemented daily checks and feedback mechanisms, reducing labor costs by 30% while maintaining superior project deliverables through stringent quality assurance measures.

Education

Master of Science - Information Technology

University of Massachusetts
Boston, USA
12.2023

Bachelor of Engineering - Electronics and Telecommunications Engineering

University of Mumbai
Mumbai, India
05.2017

Skills

  • Programming Languages: (Python, R, Java, SQL, Unix/Shell Scripting)
  • Big data Tools: (Hadoop architecture, HDFS, MapReduce, Hive, Sqoop, Oozie, EMR, Spark, Pig, lambda functions)
  • Frameworks & Tools: (NumPy, Scikit-learn, Pandas, Seaborn, Power BI, Tableau, MS Excel, MS Access, PySpark, Airflow, ETL, Jira)
  • Datastores and Cloud: (Amazon Web Services (AWS), Microsoft Azure, MySQL, Oracle, MongoDB, PostgreSQL, Apache Kafka, Snowflake, Data Bricks)

Timeline

Data Engineer

S&P Global
05.2024 - Current

Software Engineer

REON Technologies Inc
05.2023 - 04.2024

Data Analyst

Phoenix Technologies
05.2020 - 08.2022

Spatial Data Specialist

HERE Technologies Pvt Ltd
06.2018 - 09.2019

Bachelor of Engineering - Electronics and Telecommunications Engineering

University of Mumbai

Master of Science - Information Technology

University of Massachusetts
Neha Satish Sawant