BEV-SGD: Best Effort Voting SGD Against Byzantine Attacks for Analog-Aggregation-Based Federated Learning Over the Air

被引：19

作者：

Fan, Xin ^{[1
]}

Wang, Yue ^{[2
]}

Huo, Yan ^{[1
]}

Tian, Zhi ^{[2
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China

[2] George Mason Univ, Dept Elect & Comp Engn, Fairfax, VA 22030 USA

来源：

IEEE INTERNET OF THINGS JOURNAL | 2022年 / 9卷 / 19期

基金：

美国国家科学基金会; 北京市自然科学基金; 中国国家自然科学基金;

关键词：

Power control; Convergence; Wireless communication; Internet of Things; Computational modeling; Collaborative work; Simulation; Analog aggregation; best effort voting (BEV); Byzantine attack; channel-inversion; convergence analysis; federated learning (FL); STOCHASTIC GRADIENT DESCENT; UNCODED TRANSMISSION;

D O I：

10.1109/JIOT.2022.3164339

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As a promising distributed learning technology, analog aggregation-based federated learning over the air (FLOA) provides high communication efficiency and privacy provisioning under the edge computing paradigm. When all edge devices (workers) simultaneously upload their local updates to the parameter server (PS) through commonly shared time-frequency resources, the PS obtains the averaged update only rather than the individual local ones. While such a concurrent transmission and aggregation scheme reduces the latency and communication costs, it unfortunately renders FLOA vulnerable to Byzantine attacks. Aiming at Byzantine-resilient FLOA, this article starts from analyzing the channel inversion (CI) mechanism that is widely used for power control in FLOA. Our theoretical analysis indicates that although CI can achieve good learning performance in the benign scenarios, it fails to work well with limited defensive capability against Byzantine attacks. Then, we propose a novel scheme called the best effort voting (BEV) power control policy that is integrated with stochastic gradient descent (SGD). Our BEV-SGD enhances the robustness of FLOA to Byzantine attacks, by allowing all the workers to send their local updates at their maximum transmit power. Under worst-case attacks, we derive the expected convergence rates of FLOA with CI and BEV power control policies, respectively. The rate comparison reveals that our BEV-SGD outperforms its counterpart with CI in terms of better convergence behavior, which is verified by experimental simulations.

引用

页码：18946 / 18959

页数：14

共 53 条

[1]

Abari Omid, 2015, 2015 IEEE Conference on Computer Communications (INFOCOM). Proceedings, P1742, DOI 10.1109/INFOCOM.2015.7218555

[2]

Aji A. F., 2017, EMNLP 2017, P440

[3]

Alistarh D, 2017, ADV NEUR IN, V30

[4] Federated Learning Over Wireless Fading Channels [J].

Amiri, Mohammad Mohammadi ;

Gunduz, Deniz .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) :3546-3557

[5] Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air [J].

Amiri, Mohammad Mohammadi ;

Gunduz, Deniz .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) :2155-2169

[6] Collaborative Machine Learning at the Wireless Edge with Blind Transmitters [J].

Amiri, Mohammad Mohammadi ;

Duman, Tolga M. ;

Gunduz, Deniz .

2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,

[7]

Bernstein J, 2018, PR MACH LEARN RES, V80

[8]

Blanchard P, 2017, ADV NEUR IN, V30

[9] Optimal Power Control for Over-the-Air Computation [J].

Cao, Xiaowen ;

Zhu, Guangxu ;

Xu, Jie ;

Huang, Kaibin .

2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,

[10] A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].

Chen, Mingzhe ;

Yang, Zhaohui ;

Saad, Walid ;

Yin, Changchuan ;

Poor, H. Vincent ;

Cui, Shuguang .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283

← 1 2 3 4 5 6 →