This project enables latency-guaranteed co-location of inference and training for reducing data center expenses. If you find this project is helpful and would like to use it, please cite our paper:
• Guoyu Chen, Srinivasan Subramaniyan and Xiaorui Wang, "Latency-guaranteed Co-location of Inference and Training for Reducing Data Center Expenses,'' 2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS), Jersey City, New Jersey, USA, 2024 (Accepted to appear).