Efficient Training Management for Mobile Crowd-Machine Learning: A Deep Reinforcement Learning Approach

被引：68

作者：

Tran The Anh ^{[1
]}

Nguyen Cong Luong ^{[1
]}

Niyato, Dusit ^{[1
]}

Kim, Dong In ^{[2
]}

Wang, Li-Chun ^{[3
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, South Korea

[3] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan

来源：

IEEE WIRELESS COMMUNICATIONS LETTERS | 2019年 / 8卷 / 05期

基金：

新加坡国家研究基金会;

关键词：

Mobile crowd; federated learning; deep reinforcement learning;

D O I：

10.1109/LWC.2019.2917133

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this letter, we consider the concept of mobile crowd-machine learning (MCML) for a federated learning model. The MCML enables mobile devices in a mobile network to collaboratively train neural network models required by a server while keeping data on the mobile devices. The MCML thus addresses data privacy issues of traditional machine learning. However, the mobile devices are constrained by energy, CPU, and wireless bandwidth. Thus, to minimize the energy consumption, training time, and communication cost, the server needs to determine proper amounts of data and energy that the mobile devices use for training. However, under the dynamics and uncertainty of the mobile environment, it is challenging for the server to determine the optimal decisions on mobile device resource management. In this letter, we propose to adopt a deep Q-learning algorithm that allows the server to learn and find optimal decisions without any a priori knowledge of network dynamics. Simulation results show that the proposed algorithm outperforms the static algorithms in terms of energy consumption and training latency.

引用

页码：1345 / 1348

页数：4

共 7 条

[1] HD Live Maps for Automated Driving: An AI Approach [J].

Chen, Xin .

26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, :1-1

[2] Some remarks on greedy algorithms [J].

DeVore, RA ;

Temlyakov, VN .

ADVANCES IN COMPUTATIONAL MATHEMATICS, 1996, 5 (2-3) :173-187

[3]

Goodman J., 2002, IUI 02. 2002 International Conference on Intelligent User Interfaces, P194

[4]

McMahan B., Federated Learning: Collaborative Machine Learning without Centralized Training Data

[5] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[6]

van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094

[7]

Zhao Yue, 2018, ARXIV180600582

← 1 →