Summary

Overview

Work History

Education

Skills

Publications and Patents

Timeline

Kai Yuan

Sammamish,WA

Summary

14 years of work journey in industry-leading companies, proficient in cutting-edge data processing and ML technology; extensive large-scale ML experience in the search and recommendation domain; 3+ years of tech-leading and mentorship experience in Amazon and Apple; led 2 research areas that resulted in $1.3 billion Gross Merchandize Value (GMV) gain.

Overview

years of professional experience

Work History

Staff Applied Researcher

Apple (Seattle, WA)

Seattle, WA

06.2023 - Current

ASE Foundation Modeling

Build Large Language Model (LLM) pre-training and fine-tuning tool.
Research on domain adaptation to improve the performance of domain-specific tasks.

LLM-Based Query Auto-Completion Modeling

Build LLM-based Query Auto-Completion suggester with RAG, multi-objective alignment & distillation;
Yielded +3.20% query auto-completion usage rate lift and -5.37% reduction on average typed keystrokes per search;

Senior Applied Scientist

Amazon (Seattle, WA)

Seattle, WA

07.2015 - 06.2023

Search Widget Ranking (2021-Current)

Formulate search widget ranking as a multi-agent markov decision process (MAMDP), and tackle it by the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) model. Off policy evaluation (OPE) method shows +5.7% improvements on the doubly robust estimator; Online experiment is on-going.
Led 5 experienced engineers to productionize the MADDPG model and a mix-offline multi-agent RL system.
Requested a national patent on applying multi-agent RL to search ranking and related paper (Publication #1 below) got accepted by Amazon Machine Learning Conference (AMLC) 2022 as oral presentation with 8% acceptance rate.
Designed mission-aware deep bandits model to rank widgets in search. Offline yields +3% CTR and +2.3% GMV prediction accuracy lift; Online experiment yielded +$42MM GMV gain.
Led 1 scientist and 3 engineers to productionize the mission-aware deep bandits system.
Related paper (Publication #3 below) got accepted by AMLC 2021 as oral presentation with 9% acceptance rate.

Search Query Autocomplete Ranking (2019-2020)

Proposed metrics-weighted mean reciprocal rank (MW-MRR) as offline ranking evaluation metrics for Query Autocomplete ranking task, adopted by 10 scientists onwards and driven 12+ ranking iterations.
Launched three tree-based and one deep learning-based pairwise learning to rank models, yielded +276bps suggestion acceptance rate lift and +$1.3B annualized GMV gain.
Partnered with 2 senior engineers to enable deep learning online inference to serve 300K TPS at 15ms @p99.
Related paper (Publication #4) got accepted by AMLC 2020 as oral presentation with 10% acceptance rate.

Series Search (2015-2017)

Designed 3 query understanding models to link customer queries to book/movie series entities.
Drove 9 online experiments and 3 were successfully launched, yielded +$4.3MM GMV gain.

Senior Research and Developer

Baidu (Beijing, China)

07.2011 - 07.2015

Baidu News Recommendation

News feed ranking using GBDT.
Customer modeling and feature engineering: 1) mining interests using Ebbinghaus Forget Curve; 2) mining commuting route using DBSCAN; 3) cluster customers using K-means, and recommend news from belonging clusters.

Education

Master of Science - Computer Science

University of Washington

Seattle, WA

06-2011

Bachelor of Science - Computer Science

Wuhan University of Technology

Wuhan

07.2009

Skills

Python, Java, Spark, ML & Reinforcement Learning

Recommendation, Informational Retrieval

Publications and Patents

Cooperative Multi-Agent Deep Reinforcement Learning In Widget Ranking (arxiv.org/pdf/2408.04251, SIGIR2024)
An Application Of View Based Modeling In Widget Ranking (tinyurl.com/4c7jnmy5, CSS2022 Oral)
Mission Aware Widget Ranking With Transformer (tinyurl.com/mr38bzp8, AMLC2021 Oral)
Deep Pairwise Learning To Rank For Search Autocomplete (arxiv.org/abs/2108.04976, WWW2021)
Context-Aware Reranking for Search Autocomplete (tinyurl.com/3vyyjx7a, AMLC2019)
Indexing And Presentation Of New Digital Content (patents.justia.com/patent/10909196)

Timeline

Staff Applied Researcher

Apple (Seattle, WA)

06.2023 - Current

Senior Applied Scientist

Amazon (Seattle, WA)

07.2015 - 06.2023

Senior Research and Developer

Baidu (Beijing, China)

07.2011 - 07.2015

Bachelor of Science - Computer Science

Wuhan University of Technology

Master of Science - Computer Science

University of Washington

Kai Yuan

Summary

Overview

Work History

Staff Applied Researcher

Senior Applied Scientist

Senior Research and Developer

Education

Master of Science - Computer Science

Bachelor of Science - Computer Science

Skills

Publications and Patents

Timeline

Staff Applied Researcher

Senior Applied Scientist

Senior Research and Developer

Bachelor of Science - Computer Science

Master of Science - Computer Science

Similar Profiles

LAMINE KABALAMINE KABA

CANDACE M. FRIEBOESCANDACE M. FRIEBOES

Brenna TaylorBrenna Taylor

Yichen SunYichen Sun

Channay KuchChannay Kuch