Disaggregated model serving.
- Design and implementation a model rewriting tool for disaggregated serving.
- Measurement of latency overhead and estimation of benefits in total cost of ownership.
- Rule-based model optimization for disaggregated serving.