Joint Data Collection and Resource Allocation for Distributed Machine Learning at the Edge

被引:8
作者
Chen, Min [1 ,2 ]
Wang, Haichuan [1 ,2 ]
Meng, Zeyu [3 ]
Xu, Hongli [1 ,2 ]
Xu, Yang [1 ,2 ]
Liu, Jianchun [4 ]
Huang, He [5 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst Adv Study, Suzhou 215123, Jiangsu, Peoples R China
[3] Univ Sci & Technol China, Sch Cyberspace Secur, Hefei 230027, Anhui, Peoples R China
[4] Univ Sci & Technol China, Sch Data Sci, Hefei 230027, Anhui, Peoples R China
[5] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Servers; Training; Task analysis; Edge computing; Distributed databases; Resource management; Data models; model training; job assignment; resource allocation; service placement; SERVICE;
D O I
10.1109/TMC.2020.3045436
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Under the paradigm of edge computing, the enormous data generated at the network edge can be processed locally. To make full utilization of these widely distributed data, we focus on an edge computing system that conducts distributed machine learning using gradient-descent based approaches. To ensure the system's performance, there are two major challenges: how to collect data from multiple data source nodes for training jobs and how to allocate the limited resources on each edge server among these jobs. In this paper, we jointly consider the two challenges for distributed training (without service requirement), aiming to maximize the system throughput while ensuring the system's quality of service (QoS). Specifically, we formulate the joint problem as a mixed-integer non-linear program, which is NP-hard, and propose an efficient approximation algorithm. Furthermore, we take service placement into consideration for diverse training jobs and propose an approximation algorithm. We also analyze that our proposed algorithm can achieve the constant bipartite approximation under many practical situations. We build a test-bed to evaluate the effectiveness of our proposed algorithm in a practical scenario. Extensive simulation results and testing results show that the proposed algorithms can improve the system throughput 56-69 percent compared with the conventional algorithms.
引用
收藏
页码:2876 / 2894
页数:19
相关论文
共 61 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[2]   Energy-Efficient Resource Allocation for Mobile Edge Computing-Based Augmented Reality Applications [J].
Al-Shuwaili, Ali ;
Simeone, Osvaldo .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2017, 6 (03) :398-401
[3]  
[Anonymous], 2011, P 14 INT C EXT DAT T
[4]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[5]  
[Anonymous], 1990, Discrete Location Theory
[6]  
[Anonymous], 2004, Knapsack Problems, DOI DOI 10.1007/978-3-540-24777-710
[7]  
Barcelo M, 2015, IEEE ICC, P344, DOI 10.1109/ICC.2015.7248345
[8]  
Barua S., 2016, IEEE Winter Conf. Appl. Comput. Vis. (WACV), P1, DOI DOI 10.1109/WACV.2016.7477561
[9]  
Cao Qingqing., 2017, P 1 INT WORKSH DEEP, P1
[10]   A 3/4-approximation algorithm for multiple subset sum [J].
Caprara, A ;
Kellerer, H ;
Pferschy, U .
JOURNAL OF HEURISTICS, 2003, 9 (02) :99-111