Pioneered On-Device Agentic Capabilities: Initiated and built a cross-platform On-Device Retrieval-Augmented Generation (RAG) and Function Calling (FC) framework. Delivered C++ and Java pipelines and evaluation tools, driving adoption with key products like Chrome, Google Home, and other research projects.
Optimized LLM Runtime: Co-developed the on-device inference runtime for the next-generation Gemini Nano model. The runtime was successfully adopted by Chrome to power browser features, including real-time tech support scam detection.
Advanced On-Device LLM Support: Engineered the JAX-to-TFLite conversion pipeline essential for deploying Gemini Nano on Android platform. Developed the conversion logic to legalize StableHLO operators into TFLite, enabling the first generation of on-device LLM execution for Gemini Nano.
Advanced PyTorch on-device support: Led the design and implementation of unbounded dynamism in the conversion pipeline. Drove cross-functional collaboration with the PyTorch-XLA team to architect solutions for dynamic shapes on edge devices.
Architected Gemini Nano iOS SDK: Led the design and development of the foundational SDK for serving on-device generative AI across Google’s first-party iOS apps. Authored core technical architecture for model serving and on-device safety, navigated complex security approvals and drove integration with key clients like Google Search, Google Home and XR.
Software Engineer
Google - Lens
07.2019 - 03.2022
Led the development of the Unified Search Results Page, spanning both mobile client architecture and frontend server development.
Education
Master of Science - Electrical And Computer Engineering
Carnegie Mellon University
Pittsburgh, PA
12-2014
Bachelor of Science - Electrical And Computer Engineering
Beijing University of Posts And Telecommunications
China
08-2013
Skills
On-device AI & LLM
ML Frameworks & Tools: JAX, TensorFlow, PyTorch, MLIR
SDK development
Languages: C, Python, Java, Swift
Timeline
Software Engineer
Google - TensorFlow Lite/On-Device ML, CoreML
03.2022 - Current
Software Engineer
Google - Lens
07.2019 - 03.2022
Master of Science - Electrical And Computer Engineering
Carnegie Mellon University
Bachelor of Science - Electrical And Computer Engineering
Beijing University of Posts And Telecommunications