Overview
Work History
Education
Skills
Timeline
Generic

Kai Tarafdar

Speech And Language Machine Learning Engineer

Overview

6
6
years of professional experience
5
5
years of post-secondary education

Work History

Data Lead

Abodoo Ltd
03.2024 - Current
  • Led and orchestrated end-to-end data pipelines, from data extraction and processing to deployment and visualization, using Python, Cloud Storage (Azure, GCS), and Databases.
  • Managed and optimized data workflows across multiple pipelines with complex interdependencies, ensuring seamless integration and automation of data processing tasks.
  • Developed and maintained taxonomies and ontologies, leveraging linguistic insights and NLP techniques (including BERT, LSTM, and LLM models) to improve data categorization and semantic understanding.
  • Designed and implemented data visualization backends for real-time analytics dashboards using Python, enhancing decision-making through actionable insights.
  • Guided juniors, providing mentorship on best practices.
  • Collaborated with cross-functional teams to ensure that data workflows and analytics solutions met business needs and supported data-driven decision-making.

Freelance AI Engineer

Seedext
10.2023 - 02.2024

Develop in house LLM Chat application.

NLP Engineeer

Pixels Ltd
09.2023 - 10.2023

Improve NLP based matching engine. Expand product to multilingual suites.

Contractor - Data Team

Bud
05.2023 - 07.2023

Improved classification models through data optimization.

AI Developer - Speech Recognition

Data Unblocked
03.2023 - 05.2023
  • Designed, developed, and implemented accent classification model.
  • Performed enhanced EDA of large speech datasets to identify key differentiable speech features.

Machine Learning Engineer - Text-to-speech

Audioblogs Inc.
10.2022 - 03.2023
  • Created novel algorithm to generate standardized and clean lexicon from open source lexical resources.
  • Composed text normalization and feature extraction library for text-to-speech pipeline.
  • Optimized text-to-speech backend to reduce inference time.

Natural Language and Speech Processing Expert

ProjectPro
06.2022 - 03.2023
  • Developed end-to-end NLP and STT projects to be used for production or academic purposes.
  • Compiled a retrieval based chatbot using only a small amount of chat transcripts through finetuning on LLMs.
  • Compiled a speech to text model achieving SOTA WER with small amount of speech data by leveraging transfer learning method.
  • Documented project in technical manuals to be used by clients in future projects.
  • Created audio/visual presentation and guides to accompany codebase and documentation.

NLP Engineer

BeyondWords
12.2020 - 06.2022
  • Lead interdisciplinary team of linguists and NLP engineers in the creation, expansion, and maintenance of core NLP products.
  • Created tools to assist in rapidly expanding NLP corpora, to be used by technical and non technical users.
  • Used both classical, recent, and novel methods to process, extract, and classify language features for machine learning.
  • Collaborated in development and optimization of end to end custom speech synthesis model.

Speech-to-text Engineer

Apple
07.2019 - 07.2020
  • Played key development role in very early R&D phase of core new feature of Apple Podcasts: Real-time speech-to-text transcription and Information retrieval.

Computational Linguist

Apple
01.2019 - 07.2020
  • Improved and expanded core NLP resources and tools for English locales used with Siri personal assistant.

Education

Master of Science - Speech And Language Processing

The University of Edinburgh
09.2017 - 09.2018

Bachelor of Arts - English Language And Linguistics

The University of Roehampton
09.2012 - 05.2016

Skills

Tensorflow

Pytorch

Keras

Numpy

Pandas

SciKit Learn

Python

Huggingface

Lightning

Kaldi

Git

GCP

AWS

NLTK

Spacy

SQL

Cosmo DB

Apache Airflow

Github Actions

CI/CD

Timeline

Data Lead

Abodoo Ltd
03.2024 - Current

Freelance AI Engineer

Seedext
10.2023 - 02.2024

NLP Engineeer

Pixels Ltd
09.2023 - 10.2023

Contractor - Data Team

Bud
05.2023 - 07.2023

AI Developer - Speech Recognition

Data Unblocked
03.2023 - 05.2023

Machine Learning Engineer - Text-to-speech

Audioblogs Inc.
10.2022 - 03.2023

Natural Language and Speech Processing Expert

ProjectPro
06.2022 - 03.2023

NLP Engineer

BeyondWords
12.2020 - 06.2022

Speech-to-text Engineer

Apple
07.2019 - 07.2020

Computational Linguist

Apple
01.2019 - 07.2020

Master of Science - Speech And Language Processing

The University of Edinburgh
09.2017 - 09.2018

Bachelor of Arts - English Language And Linguistics

The University of Roehampton
09.2012 - 05.2016
Kai TarafdarSpeech And Language Machine Learning Engineer