Disaggregated model serving.
- Design and build a model rewriting tool for disaggregated serving.
- Measure latency overhead and TCO benefits.
- Model optimization for disaggregated serving.
I am a fifth-year Ph.D. student at the Computer Systems Lab, Paul G. Allen School of Computer Science & Engineering, University of Washington advised by Arvind Krishnamurthy. Before joining UW, I obtained a bachelor's degree from the ACM Honors Class, Shanghai Jiao Tong University, and I was a research intern at the Cornell University, advised by Emin Gün Sirer.
I am broadly interested in distributed systems, operating systems, and machine learning systems.