Summary
Overview
Work History
Education
Skills
Accomplishments
Recent Projects
Certification
Timeline
Generic

Haitao Liu

Foster City,USA

Summary

Senior software engineer with extensive experience in architecting and developing online streaming and offline batch services on distributed systems utilized by machine learning applications

Overview

13
13
years of professional experience
1
1
Certification

Work History

Senior Software Engineer

Google Inc.
Sunnyvale, USA
04.2018 - Current
  • Google Play - TL of the Google Play Incentive Offer Retrieval Platform, driving personalized offer serving on store surfaces powered by CRM targeting, governance and abuse, and ML ranking capabilities, working with XFN teams (tooling, marketing, product, analyst) for the user growth and monetization goals.

Lead Data Platform Software Engineer

Lattice Engines
San Mateo, USA
04.2014 - 04.2018
  • Designed and built converged big data machine learning platform for sales prediction products with a dedicated team.
  • Developed Python machine learning pipelines focused on feature generation, model creation, and visualization.

Cloud Platform Software Engineer

Samsung R&D Center
San Jose, USA
02.2013 - 03.2014
  • Implemented AWS Infrastructure Provisioning automation scripts and Configuration Management tools.

Software Development Intern

VCE
Santa Clara, USA
06.2012 - 08.2012
  • Developed Java CLI for System Admins to manage the Converged Private Cloud Infrastructure VBlock (VMWare Virtualization, Cisco Blade Server & Switch, EMC Storage)

Education

Master of Computer Science -

University of Illinois
Urbana-Champaign, IL
12.2012

Bachelor of Engineering - Computer Science with honor

Nanjing University of Posts & Telecommunications
Nanjing, China
06.2011

Skills

  • Programming languages: Java, C, Python, Javascript, Typescript
  • Frameworks: Guice, Spring, Hibernate
  • Distributed systems: Boq, Kafka, Redis, Hadoop Yarn, Spanner, HDFS, S3, Influx DB
  • Machine learning techniques: Deep learning, Hyperparameter tuning, Regularization
  • Machine learning frameworks: NumPy, Pandas, TensorFlow
  • A/B testing experiment tools: Rasta, Mendel

Accomplishments

China National Scholarship, 2010

Recent Projects

  • Google Play Store Incentive Offer Retrieval Platform, Redesigned the Incentive Offer Retrieval Platform by introducing a mini runtime workflow framework, which allows config driven plugins to enable various capabilities including User Targeting, Content Personalization, State Persistence, Governance Snoozing, Abuse Prevention, and Rule based and ML based Ranking. Successfully onboarded multiple serving flows and landed important use cases on Store Surfaces requested by different Vertical teams (Games, Apps, Search, Loyalty, Commerce teams, etc.)
  • Self-Service Modeling and Bulk Scoring Platform, Led a team and developed a Data Analytics Machine Learning Platform and Scoring Pipeline, supporting Camel Connector Importer, data Matching and Enrichment, Cascading data Transformation, Analytics/Scoring pipeline and data Export. Designed a Kafka based distributed Job Scheduling & Orchestration platform. Implemented the Service Registry Watcher, Distributed Lock and Caching layer using Redis.
  • Hadoop Yarn Data Platform, Architected the Container Allocation framework on top of Hadoop Yarn to allow running different types of jobs with different resources. The resource efficiency had been improved by 30%., Added the support for Resource Manager HA in spring-yarn-core library, by performing failover on top of regular RPC Proxy.
  • Machine Learning Modeling Pipeline, Worked on the Supervised Machine Learning Pipeline for feature generation in Python, involving Null Value Imputation, Discrete Value Bucketing, Embeddings, etc, and building Random Forest Models. Output Model evaluation Scores and Feature Importance for Visualization on UI.
  • Prediction Engine, Redesigned the batch-processing prediction engine with a more light weight, multi-tenant, supporting both Quartz scheduled and on demand requests based and distributed version, by leveraging the MapReduce framework with 15x throughput improvement.

Certification

  • Neural Networks and Deep Learning
  • Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
  • Sequence Models
  • Generative AI with Large Language Models

Timeline

Senior Software Engineer

Google Inc.
04.2018 - Current

Lead Data Platform Software Engineer

Lattice Engines
04.2014 - 04.2018

Cloud Platform Software Engineer

Samsung R&D Center
02.2013 - 03.2014

Software Development Intern

VCE
06.2012 - 08.2012

Master of Computer Science -

University of Illinois

Bachelor of Engineering - Computer Science with honor

Nanjing University of Posts & Telecommunications