Summary
Overview
Work History
Education
Skills
Websites
Certification
Projects
Timeline
Generic

Anushka Sawant

Brandon,FL

Summary

Detail-oriented and dedicated data engineer with almost 2 years of proven track record in architecting robust data ecosystems and optimizing pipelines. Proficient in ETL processes, data warehousing, and cloud technologies, aiming to apply expertise in developing efficient data solutions. Seeking a role to collaborate within a data-centric team, driving innovation, and leveraging data-driven strategies to fuel organizational success and growth.

Overview

2
2
years of professional experience
1
1
Certification

Work History

Data Engineer

LTIMindtree Pvt Ltd
02.2023 - Current
  • Engineered and maintained critical components of the Anti-Money Laundering (AML) application for a leading financial institution, ensuring robust data processing, analysis, and compliance functionalities.
  • Developed and optimized ETL pipelines to handle large volumes of transactional data, enhancing the application's efficiency by 30%.
  • Developed simple to complex Map/Reduce streaming jobs using Python language that are implemented using Hive.
  • Utilized Apache Spark with Python on data cleaning and reshaping, generated segmented subsets using Spark Data frames and Pandas in Python.
  • Collaborated closely with the Data deletion and Regulatory (DDAR) compliance analysts and regulatory teams to integrate new data sources and ensure adherence to the GDPR compliance.
  • Offered comprehensive technical support and solutions to the DDAR team across North America (NAM) region, ensuring seamless data operations and analytics.
  • Implemented Agile methodologies within cross-functional teams, facilitating daily stand-ups, sprint planning, and retrospectives to enhance collaboration, improve project visibility, and ensure alignment with stakeholder expectations.

Data Engineer Intern

LTIMindtree Pvt Ltd
06.2022 - 12.2022
  • Developed and optimized SQL queries as a Data Engineering intern, improving database performance by 40% through query optimization and enabling efficient data retrieval and analysis.
  • Enhanced data processing speed by optimizing ETL pipelines for efficient data ingestion and transformation.
  • Facilitated seamless integration between disparate systems by developing robust APIs for efficient data exchange between applications.
  • Prioritized scalability in all developed solutions, anticipating future growth and accommodating for it through modular design principles.
  • Spearheaded data ingestion and transformation initiatives, achieving a 22% reduction in processing time through streamlined ETL processes and enhanced data cleansing techniques.
  • Proficiently utilized AutoSys for scheduling and managing complex data tasks, ensuring seamless automation and execution of data processes.
  • Collaborated in implementing and optimizing data workflows using Talend, enhancing ETL processes for efficient data integration and transformation.
  • Automated manual processes by designing SQL logic and developing solutions to enhance functionality of big data platforms.
  • Utilized Agile frameworks to adaptively manage project scopes, prioritize tasks, and deliver iterative solutions, resulting in increased team efficiency, reduced time-to-market, and enhanced responsiveness to evolving data requirements.

Education

MS in Computer Science -

Central Michigan University
Mount Pleasant, MI
12.2022

Bachelor of Science -

Thakur College of Engineering And Technology
Mumbai, India
05.2020

Skills

  • Apache Spark
  • Apache Hive
  • Data Curating
  • Data Warehousing
  • SQL Expertise
  • Data Security
  • Data Pipeline Design
  • Data Migration
  • Data Modeling
  • ETL Development
  • Data Integration
  • Hadoop Ecosystem
  • API Development
  • Agile Methodologies
  • Big Data Processing
  • JIRA
  • Release Lifecycle management
  • Python for data analysis

Certification

Databricks Certified Associate Developer for Apache Spark 3.0

Projects

Sonar Bird Recognition using GCP, 08/01/22, 12/31/22, Developed an ETL pipeline utilizing Google Cloud Platform (GCP) products to extract specific bird sounds from diverse bird recordings. Implemented a conversion process, transforming WAV files into spectrograms to extract unique bird sounds efficiently., Achieved successful differentiation of target bird sounds from varied recordings with an accuracy rate of 90% using spectrogram analysis., Leveraged GCP products for seamless data extraction, transformation, and loading, streamlining the process of isolating distinct bird sounds from multiple sources.

Timeline

Data Engineer

LTIMindtree Pvt Ltd
02.2023 - Current

Data Engineer Intern

LTIMindtree Pvt Ltd
06.2022 - 12.2022

MS in Computer Science -

Central Michigan University

Bachelor of Science -

Thakur College of Engineering And Technology
Anushka Sawant