Tech lead of Snap Observability team, a lean team responsible for whole Snap's real-time metrics, monitoring and dashboard.
Leading the development and migration of the next generation real-time metrics framework for Snap
Developing and operating one of the world's largest OSS metrics system with high scalability, stability, cost-efficiency and performance
Founding member of Timestream storage team to build state-of-the-art distributed time series database.
Deterministic Failure Handling Built failure hint strategies to handle Timestream distributed micro-service's failures deterministically.
Ingestion Auto Scaling Implemented strategies to auto scale up tiles (Timestream internal storage unit to store customers' data) to manage GB/s insert traffic at hundreds milliseconds latency per customer table.
Kinesis GetRecords Enhanced Fan-out Led, designed and implemented 20 consumers Enhanced Fan Out to support Kinesis shard read capacity from 10 MB/s to 40 MB/s with 99.9% availability.