Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Michael Tallini

Coram,NY

Summary

Highly motivated Principal NLP Engineer with over 6 years of hands-on experience in developing and deploying cutting-edge Natural Language Processing (NLP) solutions for software applications. Proven track record of leading the design and implementation of NLP algorithms, leveraging cloud platforms to process, analyze, and extract insights from text data. Skilled in containerizing NLP models using Docker and creating REST APIs for seamless integration into existing applications. Expertise in Large Language Models (LLMs) and their application to text analysis and sentiment analysis. Collaborative and innovative, dedicated to revolutionizing customer outcomes through advanced AI-driven technologies.

Overview

6
6
years of professional experience
1
1
Certification

Work History

NLP Engineer

90 North
11.2020 - Current
  • Managed ML projects and coordinated with multiple research teams to set requirements
  • Translated technical concepts and information into terms understandable by general population
  • Researched and tested machine learning algorithms to solve key client problems
  • Designed ML models throughout model lifecycle of experimentation, development, evaluation, deployment and monitoring
  • Cleaned and processed large text data, improving data quality.
  • Resolved multiple critical issues, resulting in improvement in model accuracy and efficiency.

Dialogue Designer

Google
03.2020 - 11.2020
  • Labeled Natural Language data for Machine Learning Models
    Developed taxonomy for error analysis of chat bot conversation data
  • Administered data visualization tools used by team
  • Utilized word vectorization techniques to quantify natural language features for ML model input
  • Wrote regular expressions using Google Data Loss Prevention for PII redaction
  • Conducted computational experiments to measure methodology correlation with performance
  • Cleaned raw text data using NLP techniques
  • Created tools to extract, evaluate, and edit chat bot data structure, reducing workload by 70%

Language Annotator

Lionbridge
11.2019 - 12.2019
  • Provide linguistic expertise for Machine Translation project
  • Used annotation software BRAT to tag natural language data
  • Annotated syntactic chunks, verbal arguments, and entities of written text
  • Performed entity mapping of keywords throughout document

Research Intern

Stony Brook University
05.2019 - 09.2019
  • Developed code to aid linguistic research of Minimalist Grammars
  • Optimizing metric data storage of Minimalist Grammar trees
  • Analyzed tree metric representations of Minimalist Grammars

Data Analyst Intern

Kasisito
01.2019 - 05.2019
  • Used Python to read in tabular data and perform functions on entries
  • Utilized packages Numpy, Pandas, SpaCy, and NLTK to construct tables, calculate summary statistics and manipulate text data
  • Analyzed natural language data to determine errors in chatbot model
  • Collaborated with others to identify model errors due to ambiguity
  • Applied preprocessing of language data into trees using Penn treebank
  • Developed programs to identify dependencies in language data and use those in feature engineering
  • Constructed tests to analyze accuracy of language model
  • Improved language model accuracy by 12%

Linguist Intern

Stony Brook University
01.2018 - 12.2018
  • Annotated spectrogram data of 9 native English speakers speaking Japanese and English over 8 months
  • Identified steady-state formants of word-initial vowels among participants and how formants changed over time using PRAAT
  • Used Excel to display results and fit the data to linear regression model

Linguist Intern

Stony Brook University
09.2017 - 12.2017
  • Identified phonetic qualities of endangered Spanish dialect Ayacucho
  • Used PRAAT to measure release delay of plosives in Ayacucho and compared to standard dialect

Education

Master of Arts - Computational Linguistics

SUNY Stony Brook
Stony Brook, NY
12.2019

Bachelor of Arts - Linguistics

SUNY Stony Brook
Stony Brook, NY
05.2018

Associate of Arts - Liberal Arts And General Studies

Suffolk County Community College
Selden, NY
05.2015

Skills

  • Python, Haskell, Java, R programming languages
  • Numpy, Pandas, NLTK, Gensim, Seaborn, Sci-kit Learn, Scipy, Pymc3, Spacy, TensorFlow, PyTorch, Huggingface, Langchain
  • AzureML Studio, Looker, Docker, REST API, and Large Language Models (LLMs)
  • Git, Bash, and SQL
  • Building CI/CD pipelines for ML workflows
  • Annotation skills in text and speech data
  • Tokenization, part-of-speech tagging, named entity recognition, sentiment analysis, language modeling
  • Competent in Spanish and Japanese
  • LLM fine-tuning, prompt engineering, knowledge graph construction
  • Skills in Machine Learning, Deep Learning, and Artificial Intelligence

Certification

  • Machine Learning from Coursera
  • Mathematics of Machine Learning Specialization from Coursera
  • Deep Learning Specialization from Coursera
  • Stanford Artificial Intelligence Professional Program

Timeline

NLP Engineer

90 North
11.2020 - Current

Dialogue Designer

Google
03.2020 - 11.2020

Language Annotator

Lionbridge
11.2019 - 12.2019

Research Intern

Stony Brook University
05.2019 - 09.2019

Data Analyst Intern

Kasisito
01.2019 - 05.2019

Linguist Intern

Stony Brook University
01.2018 - 12.2018

Linguist Intern

Stony Brook University
09.2017 - 12.2017

Master of Arts - Computational Linguistics

SUNY Stony Brook

Bachelor of Arts - Linguistics

SUNY Stony Brook

Associate of Arts - Liberal Arts And General Studies

Suffolk County Community College
Michael Tallini