Summary
Overview
Work History
Education
Skills
Timeline
Generic

Himani Tawade

Austin,TX

Summary

Senior engineering professional with deep expertise in data architecture, pipeline development, and big data technologies. Proven track record in optimizing data workflows, enhancing system efficiency, and driving business intelligence initiatives. Strong collaborator, adaptable to evolving project demands, with focus on delivering impactful results through teamwork and innovation. Skilled in SQL, Python, Spark, and cloud platforms, with strategic approach to data management and problem-solving.

Overview

10
10
years of professional experience

Work History

Sr. Data Engineer

CGI Technologies
06.2025 - Current
  • Collaborated with cross-functional teams to define data requirements and architecture.
  • Established standard procedures for version control, code review, deployment, and documentation to ensure consistency across the team's work products.
  • Evaluated new technologies to improve data processing efficiency and effectiveness.
  • Reengineered existing ETL workflows to improve performance by identifying bottlenecks and optimizing code accordingly.
  • Acted as a trusted advisor for clients by providing thought leadership on best practices in data engineering, ensuring their systems were optimized for performance and scalability.
  • Designed and implemented scalable data pipelines to support business intelligence initiatives.

Data Engineer

Visual Consultants
03.2025 - 06.2025
  • Developed scalable data pipelines to process large datasets efficiently.
  • Collaborated with cross-functional teams to design and implement data architecture solutions.
  • Optimized ETL processes to enhance data retrieval and reporting accuracy.
  • Mentored junior engineers on best practices for data modeling and database management.

Data Engineer

SID Global Solutions
04.2024 - 02.2025
  • Project 1: Amazon MSP Doozer
  • Configured AWS VPC and IAM roles for secure access control.
  • Developed data backup jobs to transfer data from PostgreSQL to Redshift.
  • Automated ETL workflows using AWS Glue for efficient data migration.
  • Optimized Redshift for analytical processing and reporting.
  • Integrated CloudWatch for job monitoring and failure alerts.
  • Ensured data consistency with validation and reconciliation checks.
  • Project 2: SIDGS
  • Developed data pipeline on GCP to support POC for company-wide resource utilization tracking.
  • Extracted data from Jira and SharePoint, performing transformations using event-based Cloud functions and orchestration with Airflow.
  • Developed an LLM-based chatbot with text-to-SQL capabilities for the multi-connector bot “SQL Agent”.
  • Implemented Python based components for a CODY bot to implement generated codes on Big Query

Data Engineer

MetLife
12.2022 - 04.2024
  • Project 1: MetLife Data Streaming
  • Developed an end-to-end ETL solution for internal API data, moving data from S3 to Redshift via AWS Glue.
  • Implemented ad-hoc analysis with Athena and streamlined pipeline management with Airflow.
  • Facilitated data visualization using QuickSight
  • Mapped diverse data sources to ensure seamless integration into a standardized format for analysis.
  • Project 2: LLM Analysis for Reviews
  • Designed a comprehensive data engineering pipeline for the Yelp reviews dataset.
  • Integrated Kafka for streaming, OpenAI LLM for sentiment analysis, and Elasticsearch for data indexing.
  • Connected Kafka topics to Kibana for advanced visualization

Senior Data Engineer

Tata Consultancy Services
07.2018 - 07.2021
  • Project 1: UIDAI
  • Transformed conventional DBMS solutions into real-time batch-processing pipeline.
  • Collected data from multiple sources, transforming it using Spark jobs and automating with cron jobs.
  • Pushed final output into Kafka topics and visualized data using Druid, Elasticsearch, and Kibana.
  • Improved reporting efficiency by 47% through automation.
  • Project 2: Nielsen
  • Created a data pipeline for web applications, extracting data using Python modules and transforming it with PySpark.
  • Documented processes using Confluence and followed agile development practices.
  • Resolved critical issues in PySpark modules, improving data processing time by 30%.
  • Developed solutions for various ad-hoc requests involving data transformations and migrations.

Data Engineer

CGI Group Inc
11.2015 - 06.2018
  • Project 1: Shell Smart Connect
  • Created an anomaly detection system to monitor petroleum sites in real-time.
  • Ingested IoT sensor data into Cassandra, using statistical methods for anomaly detection and visualization.
  • Developed ETL pipelines to move IoT sensor data into Azure Data Lake for analysis.
  • Project 2: Shell Cards
  • Engineered file-splitting mechanisms to identify failed transactions in text files.
  • Generated reports on transaction data, developed DataMart for robust reporting solutions.
  • Automated data refresh processes using cron jobs and shell scripts, enhancing efficiency.

Education

Master of Science - Computer Science

California State University
Fullerton, CA
06-2023

Skills

  • Programming Languages and Packages: Python, Advanced SQL, NumPy, Pandas, PySpark, SQLAlchemy, Openpyxl, Pyaes, OpenAI
  • Big Data and Cloud: Spark, Apache Kafka, Apache Flink, Databricks, EC2, AWS Glue
  • GenAI : OpenAI, LLama2, Mistral, PandasAI models for Text to SQL, L2 Support Agent, Document Analyzer use cases
  • Databases, Data Warehouses, and Storage: MSSQL, Postgres, AWS RDS, Elastic Search, Redis, MongoDB, Firebase, Druid, AWS S3, Azure Data Lake, Snowflake, Amazon Redshift, Google BigQuery
  • Data Modeling: Best practices for data modeling (Dimension, ER) including star schemas, snowflake schemas, and data normalization techniques
  • ETL/ELT Tools: Apache Spark, Talend, AWS Glue, EMR, Debezium, DBT
  • Version Control: Git, Jenkins, CI/CD
  • Containerization: Docker, Kubernetes

Timeline

Sr. Data Engineer

CGI Technologies
06.2025 - Current

Data Engineer

Visual Consultants
03.2025 - 06.2025

Data Engineer

SID Global Solutions
04.2024 - 02.2025

Data Engineer

MetLife
12.2022 - 04.2024

Senior Data Engineer

Tata Consultancy Services
07.2018 - 07.2021

Data Engineer

CGI Group Inc
11.2015 - 06.2018

Master of Science - Computer Science

California State University
Himani Tawade