Optimize seed job by integrating with Avro exporter, implementing pagination of DB exports and adding data quality check
Design and build TiDB release pipeline to automate chaos tests injection to TiDB, PD and TiKV to mimic the production issues and operations
Build TiDB performance/regression check based on grafana metrics and push notification
Design and build TiCDC Release pipeline with chaos-mesh to automate chaos injection to TiCDC
Building the tool to send synthetic data with qps configurable and java services to consume cdc data from kafka, decode and verify the correctness and order
Building DB exports Logging System to log out metadata of DB Exports
Data Infra
Software Engineer
Robinhood
04.2021 - 06.2022
Company Overview: User Onboarding
Building a new user Django-based onboarding flow for spending(RHY) account, including vendor checks, fraud checks, identification check and bulk approve/reject new applications
Building Django-based multiple accounts onboarding experiences dashboard for agents to review applications
Design and build company-wide agreement/consent platform, integrated with multiple vendors, and track user consent actions to support and optimize multiple products onboarding
User Onboarding
Software Engineer
Google
12.2018 - 04.2021
Company Overview: Google Cloud Storage
Optimizing performance of reading or writing and refactoring APIs
Regionalized GC pipelines, improving the availability across all regions in prod
Regionalized global spanner tables and refactored callers to avoid the single point failure and decrease the qps towards the spanner table
Designed and implemented finer-grained emergency shutoff, which is a mechanism to pause all our GC pipelines when detecting potential data loss
Implemented a general util library to export monitoring metrics for all GC jobs, which providing more insights on compaction and deletion delay
Design and implemented a LiveChunks deletion pipeline to decrease the spanner disk usage by 50%
Google Cloud Storage
Software Engineer
Apple
08.2017 - 12.2018
Company Overview: Platform and Infrastructure
Created a new generation of APM solutions for insights into applications and monitoring metrics from multiple teams
Built web service for DAS APIs for CURD of data in DAO/Service layer and created REST APIs for external users
Designed Cassandra data modeling and schema for data storage and migration from Oracle/HBase
Implemented and optimized algorithm for data migration from Oracle to Cassandra