Tech: Led synthetic data initiatives with an emphasis on model and data evaluation
Sales engineering: Signed the first paid pilot customer in addition to managing the deliverable
Evangelism: Moderated or was a panelist in 4 events that I organized across 4 cities and started planning a deeplearning.ai short course
Director of Data Science
Spectrum Labs
Remote
04.2022 - 09.2023
Managed a team that developed and maintains solutions for content moderation across platforms and clients such as social, gaming and dating apps
Coordinated with our Data Operations team
Developed a novel approach to machine labeling and synthetic data generation to improve model performance
Facilitated hiring top talent for 3 openings
VP, Data Science at IQT Labs
In-Q-Tel
Menlo Park, CA
11.2020 - 04.2022
Managed timely completion of 5 collaborative projects across 8 direct reports
Worked across diverse deep learning research domains: ensemble system for deepfake detection, multimodal identity intelligence, robust models via GAN-enhanced synthetic data, image geo-localization
Briefed mission partners and executive staff
Coordinated the interview process and hired top talent for two open roles
Senior Data Scientist at IQT Labs
In-Q-Tel
Menlo Park, CA
04.2020 - 11.2020
Led a 4-member project team in developing an SBERT claim-matching system for scaling fact-checking around COVID.
Briefed several groups on our results, including mission partners, executive staff, and a workshop presentation by a team member
Data Scientist at IQT Labs
In-Q-Tel
Menlo Park, CA
10.2018 - 04.2020
Led a project in Russian-English machine translation quality estimation, contributing a novel dataset and manuscript to the WMT workshop at EMNLP
Contributed to a system investigating model security
Education
Ph.D. - Neuroscience
University of Maryland, Baltimore
Baltimore
12-2015
Bachelor of Arts - Behavioral Biology
Johns Hopkins University
Baltimore, MD
05-2006
Skills
Languages & Packages
Python
Numpy
Pandas
Sklearn
PyTorch
Transformers
OpenAI API
Git
____________________
Technical domains
Language modeling
Generative AI
Natural Language Processing
Machine Translation
Content moderation
Multilingual language modeling
Synthetic data
Machine data labeling
Model evaluation
Data quality
____________________
Non-technical skills
Recruiting, leading, and managing teams
Product - PMF, GTM strategy
Evangelism: Blogs, talks, events, courses
Sales engineering
____________________
Human language proficiency
fluent in English and Russian
conversational in French
basic Spanish
enough to be helpful for NLP: German, Hebrew, Mandarin
learning Brazilian Portuguese
Additional experience
Insight AI Fellowship, Spring 2018
Postdoctoral Fellow, UC Berkeley, 2016-2018
Research Assistant, Princeton University, 2010-2011