I am an Machine Learning Software Engineer with experience in JAX and XLA high-performance computing at Google. I have special experience in image and diffusion models. I also have experience contributing to OSS projects and working in OSS community. I have an deep understanding of accelerator architecture (TPU/GPU), Multivariate calculus behind AI and low level kernel programming in accelerators. I am very passionate about ML perfomance and outside work I am active in the field. I have given talks in Eleuther AI's ML performance reading group, the recordings are on their youtube channel. I am also active on several discord servers such as GPU mode and jax helping answer questions related to XLA, TPUs, Jax, pallas and also learning from the community.
I worked on model bring up and optimization of mainly OSS diffusion and some MOE models. My team and my main responsibility is implementing the forward pass, training pipeline in JAX and optimizing its performance on TPUs and GPUs.
Worked on Optimizing WAN 2.1 diffusion model training and inference on latest.
Previously worked on Distributed Inference Team
Worked on Tanium Cloud team , wrote bazel libraries to support tanium cloud product
Joined as new grad and implemented a Kafka pipeline to ingest AWS logs and using a logical model create firewall policies.