Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

CHINMAY THAKARE

Chicago,IL

Summary

Results-driven Senior Data Engineer specializing in the design and scaling of Lambda and Medallion architectures within the vehicle insurance domain. Expert in optimizing distributed systems and implementing real-time streaming solutions that drive significant cost savings. Proven track record of leveraging LLMs for automated code migration and re-engineering legacy pipelines into high-performance, incremental workflows.

Overview

8
8
years of professional experience

Work History

Senior Data Engineer

CCC Intelligent Solutions
Chicago, USA
09.2021 - Current
  • LLM-Driven Automation: Leveraged Large Language Models (LLMs) to automate the conversion of Spark scripts into Flink streaming jobs, successfully migrating 20+ jobs and saving weeks of manual engineering effort.
  • Streamlining & Cost Savings: Engineered a streaming solution using Apache Flink to replace Oracle GoldenGate, achieving $40,000/year in licensing savings and reducing data reload latency from days to hours.
  • Architectural Modernization: Architected a Lambda data lake using Apache Hudi for the batch/silver layer. Transitioned from full table reloads to incremental processing, reducing legacy pipeline runtime from 40 hours to 15 hours.
  • Infrastructure Optimization: Scaled down EMR nodes from 10 to 1 while maintaining performance by identifying over-provisioned jobs and implementing bucketing, partitioning, and salting to eliminate data skew.
  • Reliability & CI/CD: Implemented automated data quality checks using Great Expectations, ensuring 99.9% data reliability. Streamlined deployment cycles by implementing CI/CD pipelines for seamless environment migrations.
  • Domain Expertise: Led data engineering for vehicle insurance platforms, managing high-volume datasets including 1st and 3rd party insurance claims.
  • Leadership: Acted as a technical lead, conducting code reviews and mentoring junior engineers on distributed computing best practices. Collaborated with architecture teams and customers to resolve critical bugs.

Senior Spend Data Analyst

GEP Worldwide
Mumbai, India
08.2018 - 09.2019
  • Implemented Spend Analytics solution for Fortune 500 client, managing 12+ million annual transactions and $6+ billion in spend to enhance data-driven decision-making.
  • Automated ETL processes using SQL stored procedures and machine learning models, reducing project turnaround time from 7 days to 2 days, significantly expediting project delivery.
  • Analyzed data trends to inform strategic decision-making for client projects.
  • Developed dashboards using visualization tools to present actionable insights.
  • Collaborated with cross-functional teams to gather data requirements and specifications.

Education

Master’s - Information Technology and Management (Data Management & Analytics)

Illinois Institute of Technology
Chicago

Bachelor’s - Electronics & Telecommunication Engineering

NMIMS University
Mumbai

Skills

  • Data pipeline design
  • Python
  • PySpark
  • Spark SQL
  • Hudi
  • Kafka
  • Apache Airflow
  • Apache Flink
  • Hive
  • PostgreSQL
  • AWS EMR
  • Docker
  • CI/CD
  • Data Quality
  • Monitoring
  • Observability
  • Tableau

Timeline

Senior Data Engineer

CCC Intelligent Solutions
09.2021 - Current

Senior Spend Data Analyst

GEP Worldwide
08.2018 - 09.2019

Master’s - Information Technology and Management (Data Management & Analytics)

Illinois Institute of Technology

Bachelor’s - Electronics & Telecommunication Engineering

NMIMS University
CHINMAY THAKARE