共 102 条
[1]
Li Songze, Maddah-Ali M A, Yu Qian, Et al., A fundamental tradeoff between computation and communication in distributed computing, IEEE Transactions on Information Theory, 64, 1, pp. 109-128, (2017)
[2]
Lee K, Lam M, Pedarsani R, Et al., Speeding up distributed machine learning using codes, IEEE Transactions on Information Theory, 64, 3, pp. 1514-1529, (2018)
[3]
Wang Yan, Li Nianshuang, Wang Xiling, Et al., Coding-based performance improvement of distributed machine learning in large-scale clusters, Journal of Computer Research and Development, 57, 3, pp. 542-561, (2020)
[4]
Li Songze, Avestimehr S., Coded Computing: Mitigating Fundamental Bottlenecks in Large-scale Distributed Computing and Machine Learning, Foundations and Trends in Communications and Information Theory, 17, 1, pp. 1-148, (2020)
[5]
Chowdhury M, Zaharia M, Ma J, Et al., Managing data transfers in computer clusters with orchestra, ACM Sigcomm Computer Communication Review, 41, 4, pp. 98-109, (2011)
[6]
Zhang Zhuoyao, Cherkasova L, Loo B T., Performance modeling of mapreduce jobs in heterogeneous cloud environments, Proc of the 6th Int Conf on Cloud Computing, pp. 839-846, (2013)
[7]
Yadwadkar N J, Hariharan B, Gonzalez J E, Et al., Multi-task learning for straggler avoiding predictive job scheduling, Journal of Machine Learning Research, 17, 1, pp. 3692-3728, (2016)
[8]
Jeffrey D, Luiz A B., The tail at scale, Communications of the ACM, 56, 2, pp. 74-80, (2013)
[9]
Li Songze, Yu Qian, Maddah-Ali M A, Et al., A Scalable framework for wireless distributed computing, IEEE/ACM Transactions on Networking, 25, 5, pp. 2643-2654, (2017)
[10]
Li Songze, Maddah-Ali M A, Avestimehr A S., Coded distributed computing: straggling servers and multistage dataflows, Proc of the 54th Annual Allerton Conf on Communication, Control, and Computing, pp. 164-171, (2016)