Publications

Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation
Guodong Liu, Youshan Miao, Zhiqi Lin, Xiaoxiang Shi, Saeed Maleki, Fan Yang, Yungang Bao, Sa Wang
EuroSys 2024

SEER: A Time Prediction Model for CNNs from GPU Kernel’s View
Guodong Liu, Sa Wang, Yungang Bao
PACT 2021

Breaking the computation and communication abstraction barrier in distributed machine learning workloads
Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi
ASPLOS 2022

Superscaler: Supporting flexible DNN parallelization via a unified abstraction
Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang, Fan Yang, Saeed Maleki, Yi Zhu, Xu Cao, Cheng Li, Mao Yang, Lintao Zhang, Lidong Zhou
Preprint 2023