Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Karuna Gujar

Nashville,TN

Summary

Dynamic Data Scientist with experience at 66Degrees, specializing in GCP and LLM integration. Successfully automated document processing, enhancing operational efficiency and client value. Adept at collaborating with clients to design tailored solutions, leveraging skills in Python and Terraform to drive impactful results. Passionate about transforming data into actionable insights.

Overview

13
13
years of professional experience
1
1
Certification

Work History

Data Scientist

66Degrees
04.2023 - Current
  • Developed RAG pipelines to integrate Large Language Models (LLMs) for content generation, unstructured data classification, and information retrieval across the Real Estate, Tech, and Retail sectors
  • This solution significantly reduced manual intervention, leading to improved operational efficiency and increased business value
  • Implemented GCP Cloud Function and Workflow pipelines to utilize DocAI processors for a leading real estate agency
  • This enabled the seamless extraction of information from millions of documents and the ingestion of that data into BigQuery, reducing document processing turnaround time
  • Built an LLM pipeline leveraging Gemini for a pharmaceutical client to scan, scrape, and extract thematic data from documents
  • Integrated the extracted results into internal applications using Eventarc, Cloud Functions, Pub/Sub, and BigQuery, automating URL-based data extraction and eliminating manual processes, delivering enhanced client value
  • Designed and deployed a conversational AI agent using Vertex AI Agent Builder for a private equity firm to facilitate querying across thousands of documents, answering frequent questions commonly asked during the legal due diligence process
  • Developed and deployed an image recognition system in Vertex AI for a construction consulting firm to monitor solar installations and track site progress across expansive areas
  • The automation of tracking reduced manual labor by 50% and significantly enhanced operational efficiency
  • Client Communication & Solution Design: Collaborated effectively with clients to gather detailed requirements, design customized solutions, and produce both business and technical documentation, ensuring all deliverables exceeded client expectations

Data Scientist

Deloitte
02.2022 - 03.2023
  • Implemented data transformation pipelines using BigQuery, Dataflow, and orchestrated with Astronomer Airflow
  • Provisioned GCP infrastructure using Terraform and managed CI/CD with Tekton
  • Conducted user requirement analysis, designed data models, and developed recommendation models using collaborative filtering

Application Developer

Vanderbilt University Medical Center
08.2015 - 06.2019
  • Developed modules and graphical user interface for a Java-based tool for automating free-text annotation using natural language processing techniques
  • Technologies: Java
  • Libraries: Unified Medical Language System (UMLS), Lexical Variant Generation (LVG)
  • Designed and developed a toolkit used to construct a medical chronology, tracking temporal patterns across events, episodes and trends of interest by mining the electronic health record
  • Technologies: Java, Python
  • Framework: UIMA

Research Assistant

Vanderbilt University
08.2014 - 08.2015
  • Developed a stand-alone graphical user interface for a scientific software used in drug discovery allowing bench chemists to interact with the software
  • Embedded the BCL (Bio Chemical Library) software written in C++ into an external Java application for drawing and viewing molecules (JChemPaint)
  • Technologies: Java swing, Java Native Interface (JNI)

Software Engineer

Atos India Pvt. Ltd.
09.2012 - 04.2014
  • Web application development involving front end, back end and data model development for online order creation, order tracking, warehouse management and resource allocation for leading businesses in the consumer industry
  • Technologies: Java, Servlets, JSP, Javascript, AJAX, HTML, CSS, SQL

Education

B.S. - Electronics and Telecommunication

University of Pune
India
01.2018

Master of Science - Computer Science

Middle Tennessee State University
Murfreesboro, TN

Skills

  • Google Cloud Platform (GCP)
  • Compute Engine
  • App Engine
  • Kubernetes Engine (GKE)
  • Cloud Functions
  • Cloud Run
  • BigQuery
  • Cloud Datastore
  • Firestore
  • Cloud Storage
  • Dataflow
  • Pub/Sub
  • Cloud Composer (Apache Airflow)
  • BigQuery ML
  • AutoML
  • Vertex AI
  • Looker
  • Agent Builder
  • DocAI Warehouse
  • Agentspace
  • Python
  • SQL
  • Java
  • Terraform
  • UMLS
  • LVG
  • FastAI
  • Scikit-learn
  • Keras
  • Tensorflow
  • UIMA
  • Stanford CoreNLP
  • Gensim

Websites

Certification

  • Google Professional Data Engineer
  • Google Professional Machine Learning Engineer

Timeline

Data Scientist

66Degrees
04.2023 - Current

Data Scientist

Deloitte
02.2022 - 03.2023

Application Developer

Vanderbilt University Medical Center
08.2015 - 06.2019

Research Assistant

Vanderbilt University
08.2014 - 08.2015

Software Engineer

Atos India Pvt. Ltd.
09.2012 - 04.2014

B.S. - Electronics and Telecommunication

University of Pune

Master of Science - Computer Science

Middle Tennessee State University
Karuna Gujar