Summary
Overview
Work History
Education
Skills
Area of Expertise
Key Qualifications
Research Experiences
Pulications
Languages
Timeline
Generic

Pengcheng Liu

Plano,TX

Summary

Forward-thinking and insightful experts with diversified skills in software system and machine learning. Experienced leader with multiple successful projects from idealization to production with varies project size from few people to hundreds of people. Exceptional problem-solving abilities both in team-oriented and self-motivated settings. Highly articulate with strong communication and interpersonal skills combined with deep technical expertise and innovated thinking capabilities.

Overview

19
19
years of professional experience

Work History

Director of Product Management

FICO
Plano, Texas
04.2023 - Current
  • Led the end-to-end product strategy, development, and launch of an enterprise-scale AI/ML next-generation decision platform for FICO.
  • Partnered with cross-functional teams (engineering, data scientists, marketing, compliance, architects) to define the product vision, roadmap, and KPIs.
  • Drove feature delivery for model lifecycle management, automated monitoring, explainability, and compliance controls, improving model execution time by 100 times faster and model deployment time by 10 times.
  • Championed customer-centric design, balancing technical innovation with regulatory requirements, and translated complex analytic concepts and requirements into workable tasks for the engineering research and architecture team, accelerating enterprise AI transformation.
  • Spearheaded initiatives to integrate large language models (LLM), including building Retrieval-Augmented Generation (RAG) pipelines, and leveraging LLMs to enhance user interaction on the platform, documentation, and knowledge discovery.

Principal Machine Learning Engineer

Siemens Enlighted
Charlotte, USA
11.2021 - 04.2023
  • Leading the development of Real-Time Location Service product including: prototype ML algorithm using LSMT recurrent neural network to predict people/asset indoor locations in real time with low energy blue tooth beacon data; deploy large number of trained ML models in big site both on prem and cloud using cloud agnostic MLOps techniques; design and implement data collection application using FastAPI to automate training process.
  • Successfully designed/implemented data lake and data analytics platform for varies type of IoT sensor data; leading data platform team to build IoT sensor connectivity from edge to cloud; guiding occupancy insight team to design the new generation occupancy analytics APIs, building data transformation pipelines with different AWS service such as Glue, Athena, Lambda and Lake Formation
  • As a subject matter of experts on Machine Learning, big data, cloud best practices, real-time processing, working with product team to plan the transition of current IoT sensor business into data platform service business
  • With the ownership mindset. taking the initiative to dive in legacy product code to help new development team to understand and support the product, and create documents and powerpoint to bridge the knowledge gap within the company for hardware, software architecture design, product functionality

QAC Principal Data Scientist

Wells Fargo
Charlotte, USA
01.2019 - 11.2021
  • As a subject matter of experts in Big data, leading the project for risk modeling team to convert SAS platform to Hadoop platform. Serve as the technique product manager to align expectations and requirements between technology team and modeling group , successfully designed and delivered the risk modeling platform (PyFarm) to enable 300+ modelers and 700+ risk models
  • As a domain expert in both software architecture and modeling, working with technology and program manager teams to design/implement/maintain Risk Modeling 200+ production system applications
  • Successfully conducting multiple training sessions within the Risk Modeling group for big data techniques such as spark, MapReduce, etc; Designing best practices for the Modeling group to follow when using the platform; assisting modeling/production team to optimize their spark jobs, improved the job run time 10x with same compute resource
  • Leading a software engineering team to develop/maintain the common python/spark modeling toolbox (pyfarm-MLlib) for Risk Modeling Group's machine learning development and statistical modeling activities
  • As a subject matter of experts in Machine Learning, working with modelling innovation team to investigate the new technology on explain-ability of algorithms such as EBM, Shapley, GAMI-NET, xNN, and ALE, successfully delivered a first ML risk model (auto loan lost forecast) within Wells Fargo

Analytic Architect

Siemens MindSphere
Charlotte, USA
04.2017 - 12.2018
  • Researched and prototyped the predictive learning application which serves as data scientist platform as Mindsphere flagship product, leading 4 scrum teams (multiple sites) to deliver Predictive Learning production launch. As a subject matter of experts in both ML and software engineer, serve teams as product owner and architect to align product requirements and system design
  • As part of the Core Architecture team in Mindsphere, Championing the serverless cloud architecture design for Mindsphere as an IoT platform and providing guidance for migrating existing Siemens internal analytic applications to Mindsphere as well as external third-party applications
  • As a subject matter of experts in machine learning, working closely with other Siemens data scientist teams to provide guidance and feedback of the machine learning research and innovations
  • As an analytic architect, working with different product development teams to provide system architect design for analytic capabilities and influencing team members to adopt best practices in software development, and bridging the gap between development and analytic requirement.

Lead Data Scientist

Siemens PLM Cloud
Charlotte, USA
10.2014 - 07.2017
  • Leading software development team and data scientist team to build a cloud machine learning application (Predictive Learning) and provide guidance in architecture design and data scientist perspective
  • Leading proof of concept team and data scientist team to run potential customer use cases with Machine Learning solutions
  • Assisting the product management team to define the product roadmap and future vision of the Analytics Applications within Siemens PLM Cloud
  • Successfully migrated existing SAAS products from private cloud infrastructure to AWS cloud leveraging native AWS cloud services such as S3, EMR, Athena, EC2, Lambda, VPC, etc
  • Building an internal data scientist team and mentoring/growing junior data scientists to promote a data-driven mindset for products and internal data analysis.

Data Analyst

Camstar Systems
Charlotte, USA
10.2013 - 10.2014
  • Successfully designed, implemented, and maintained big data platform and Analytics application (Performance Analytics) for automatically discovering insights via massive manufactory supply chain data using Big data techniques such as Hadoop, Solr, Impala, and Hbase
  • Researched and Prototyped the next generation machine learning applications (Root Cause Analysis, Predictive Maintenance) using Spark MLlib, and Mahout, and investigated frequent set and association rule algorithms to solve customer use cases
  • Implemented/maintained the data transformation pipeline process using Hadoop Pig and Hive

Software Engineer

Camstar Systems
Charlotte, USA
09.2011 - 10.2013
  • Researched and investigated big data techniques for industry manufacturer company such as Hyptertable, BigQuery, and Hadoop, played a critical role in designing and implementing the data contextualization system to link datasets across the product lifecycle for analysis, resulting in patent filling for techniques used to analyze large scale disparate datasets for root cause analysis and KPI monitoring
  • Successfully designed and implemented a big data platform and the first cloud application for storing and processing massive manufactory data (test data, returned data, failure data) using the Hadoop ecosystem. The solution assist customers to identify product quality issues across design, production, and test much earlier and saved key customers upward to 25M in annual costs due to quality improvements.
  • Design and implement data transformation process for large scale dataset using MapReduce as backend job to prepare data stored in Hbase, Impala, Solr to serve different analytical needs, Optimize the original code run time speed to 10x with major data structure redesign

Research Assistant

University Of North Carolina At Charlotte
Charlotte, NC
08.2006 - 08.2011
  • Assisted professors to perform cutting-edge research in 3D reconstruction, surface matching, surface classification and image segmentation.
  • Gathered and analyzed major research clinical data.
  • Invented new strategies for performance optimizing existing surface matching algorithms.
  • Improved existing medical 3D bone fracture reconstruction software applications FxRedux by meeting with users to collect feedback and issues of the software
  • Organized lab activities, and collaborative meetings, advised new Ph.D and master students, and addressed challenging research problems.
  • Assisted with preparing presentation materials before major meetings and reviews.

Education

PhD - Electrical Engineering

The University of North Carolina
12.2012

M.S - Electrical Engineering

The University of North Carolina
12.2010

M.S - Information Technology

State University of New York
05.2006

B.S - Electrical Engineering

Huazhong University of Science & Technology
06.2004

Skills

  • Software: Java, Python, JAVA 3D, SQL, C, C, VHDL, Matlab, R
  • Big Data: Hadoop, Cloudera, CDP, AWS EMR, HBase, MapReduce, HDFS, Solr, Elastic Search, Kafka, Storm, Databricks, Impala, Mahout, Pig, Hive, Yarn, Parquet, Avro
  • Machine Learning: Deep learning (CNN, RNN, LSMT), LLM, Reinforce Learning, NLP, Text mining, Computer Vision, 3D reconstruction, Time series, 3D graphics, Association Rule, Frequent Set Analysis, OpenCV
  • Machine Learning tools: Tensorflow, Caffe, Scikit-learn, Explainable AI (LIME, SHAP, XNN, ALE), Spark MLlib, Mahout, Weka, KNIME, Deeplearning4j
  • Cloud Platforms: AWS, Google Cloud Platform
  • Data Systems: RDMS, Data warehousing design, NoSQL (DynamoDB, Hbase, MongoDB), Graph Database (Neo4J, AWS Neptune)
  • DevOps: Terraform, Cloud Formation, CI/CD, Git Lab, Jenkins
  • AWS Services: S3, Glue, Lambda, API Gateway, Cloudwatch, EC2, EMR, RDS, SageMaker, RedShift, SQS, SNS, VPC, IoT, MKS, Kinesis, CloudFront, EKS, ECS, IAM
  • GCP Services: BigQuery, VPC network, Cloud Run, App Engine, VM Engine, Compute Engine, Batch, Cloud Storage, Cloud Build, Cloud Deploy

Area of Expertise

Artificial Intelligence, Model Context Protocol, Analytics, Big Data, Cloud, Serverless, Microservice, OOD, Real-time Systems

Research and Development,  Agile Scrum,  Project Management, Product Management, Leadership

Key Qualifications

  • 15 years of experience of software development
  • 12 years of delivering commercial enterprise applications on premise and in the cloud
  • 11 years of experience with big data analytics and distributed systems
  • 14 years of experience with machine learning research and innovations
  • 10 years of hands-on experience of machine learning algorithm development and production deployment
  • 7 years of experience product and project management
  • 5 years of experience of IoT platforms and solutions

Research Experiences

 

  • Medical Software System Project: High-energy fracture preoperative software application design
  • Goal: Design and develop the clinical embraced software system to assist surgeon for preoperative planning
  • Approach: Utilize Java and Java3D to implement the software system
  • Cooperative trauma surgeons and orthopedics researchers perform software testing with real clinical cases
  • The application contains image container and related analysis tool, 3D canvas with collision detection, scene graph structure, and underlying state-of-art image/surface segmentation and classification algorithms
  • Achievement: Successfully restore fracture original anatomy under the designed application for 10 clinical cases
  • The application is at the final stage to reach commercialization
  • As chief application software architecture designer, team leader of the system development and main coordinator with orthopedics researchers and surgeons, during the application development
  • Medical 3D Puzzle Solving Project: Automatically virtual 3D reconstruction of comminuted fracture
  • Goal: Develop a novel computer algorithm for automated virtual reconstruction of comminuted fractures
  • Approach: Novel 3D Puzzle solving algorithm using state-of-art surface matching, searching and registration techniques
  • Achievement: Successfully reconstruct 10 clinical comminuted fractures with average 3 minutes
  • One collaborative publication in biomedical article, one computer vision journal article in reviewing status
  • 3D Surface Mesh Segmentation Project : Graph efficient surface mesh segmentation
  • Goal: Develop an automated surface segmentation algorithm using the graph representation of the 3D surface
  • Approach: Construct the surface as a graph and apply efficient graph cut segmentation method on the graph
  • Achievement: Successfully divide surface into meaningful sub region
  • One conference article in proceeding
  • Embedded Vision system Project: Stereoscopic 3D Reconstruction using Motorized Zoom Lenses within an Embedded System
  • Goal: Design a novel embedded stereoscopic 3D reconstruction system capable of estimating 3D positions of surfaces, which meant to act as a 3D sensing payload for a terrestrial robot
  • Approach: Construct the hardware embedded system with a DSP running µclinux and an FPGA interacting to the system devices which consists of a CMOS camera pair and a pair of servo motors rotate each camera
  • Program the system software of novel computer vision and image analysis tools such as camera calibration, stereo dense matching, and stereo reconstruction
  • Achievement: (1) Automatic computation of the focus and exposure settings of the lens and camera sensor, (2) calibration of the system for various zoom settings and (3) Stereo reconstruction of free form objects
  • One conference article was published in this project
  • Archaeology Surface Registration Project: 3D Surface registration interface
  • Goal: Implement a semi-automated surface registration interface means to stitch overlapped 3D surface together form a complete structure model
  • Approach: Java, Java3D Implementation of the interface, surface matching and registration techniques, image stitching and texture remap
  • Achievement: Align and registered 8 archaeology sites with total 122 individual scan data

Pulications

     

Publication: Liu, P. and Willis, A. and Sui, Y., Stereoscopic 3D Reconstruction using Motorized Zoom Lenses within an Embedded System, SPIE Electronic Imaging, January 18–22, San Jose, California, Vol. 7251, pp.  72510W-72510W-12, 2009.

Publication: Thomas T.P., Anderson D.D., Willis A.R., Liu P.,   Frank M.C., Marsh J.L., and Brown T.D. A computational/experimental platform for investigating three-dimensional puzzle-solving of comminuted articular fractures. Computer Methods in Biomechanics and Biomedical Engineering.14 (3), pp. 263-270, 2011.

Publication: Liu, P. and Hewitt, N. and Waseem, G and Willis  A.R A system for 3D Reconstruction of Comminuted Tibial Plafond Bone Fractures. Computerized Medical Imaging and Graphics, vol. 2102.11684, 2021.

Languages

  • Mandarin
  • English

Timeline

Director of Product Management

FICO
04.2023 - Current

Principal Machine Learning Engineer

Siemens Enlighted
11.2021 - 04.2023

QAC Principal Data Scientist

Wells Fargo
01.2019 - 11.2021

Analytic Architect

Siemens MindSphere
04.2017 - 12.2018

Lead Data Scientist

Siemens PLM Cloud
10.2014 - 07.2017

Data Analyst

Camstar Systems
10.2013 - 10.2014

Software Engineer

Camstar Systems
09.2011 - 10.2013

Research Assistant

University Of North Carolina At Charlotte
08.2006 - 08.2011

PhD - Electrical Engineering

The University of North Carolina

M.S - Electrical Engineering

The University of North Carolina

M.S - Information Technology

State University of New York

B.S - Electrical Engineering

Huazhong University of Science & Technology
Pengcheng Liu