LEARNING TO CONTINUOUSLY OPTIMIZE WIRELESS RESOURCE IN EPISODICALLY DYNAMIC ENVIRONMENT

被引:12
作者
Sun, Haoran [1 ]
Pu, Wenqiang [2 ]
Zhu, Minghe [2 ,3 ]
Fu, Xiao [4 ]
Hong, Mingyi [1 ]
Changtt, Tsung-Hui [2 ,3 ]
机构
[1] Univ Minnesota, ECE Dept, Minneapolis, MN 55455 USA
[2] Shenzhen Res Inst Big Data, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Shenzhen, Peoples R China
[4] Oregon State Univ, Sch EECS, Corvallis, OR 97331 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Deep learning; continual learning; wireless communication; data-driven methods; NEURAL-NETWORKS; DEEP;
D O I
10.1109/ICASSP39728.2021.9413503
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There has been a growing interest in developing data-driven, in particular deep neural network (DNN) based methods for modern communication tasks. For a few popular tasks such as power control, beamforming, and MIMO detection, these methods achieve state-of-the-art performance while requiring less computational efforts, less channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment where parameters such as CSIs keep changing. This work develops a methodology that enables data-driven methods to continuously learn and optimize in a dynamic environment. Specifically, we consider an "episodically dynamic" setting where the environment changes in "episodes", and in each episode the environment is stationary. We propose a continual learning (CL) framework for wireless systems, which can incrementally adapt the learning models to the new episodes, without forgetting models learned from the previous episodes. Our design is based on a novel min-max formulation which ensures certain "fairness" across different episodes. Finally, we demonstrate the effectiveness of the CL approach by customizing it to a popular DNN based model for power control, and testing using both synthetic and real data.
引用
收藏
页码:4945 / 4949
页数:5
相关论文
共 31 条
[1]   Contribution of the Zubair source rocks to the generation and expulsion of oil to the reservoirs of the Mesopotamian Basin, Southern Iraq [J].
Al-Khafaji, Amer Jassim ;
Sadooni, Fadhil ;
Hindi, Mohammed Hadi .
PETROLEUM SCIENCE AND TECHNOLOGY, 2019, 37 (08) :940-949
[2]  
Aljundi R, 2019, ADV NEUR IN, V32
[3]  
Bengtsson M., 1999, PROC ANN ALLERTON C, P987
[4]   Deep Learning Based Communication Over the Air [J].
Doerner, Sebastian ;
Cammerer, Sebastian ;
Hoydis, Jakob ;
ten Brink, Stephan .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (01) :132-143
[5]   Optimal Wireless Resource Allocation With Random Edge Graph Neural Networks [J].
Eisen, Mark ;
Ribeiro, Alejandro .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 :2977-2991
[6]  
Isele D, 2018, AAAI CONF ARTIF INTE, P3302
[7]   Overcoming catastrophic forgetting in neural networks [J].
Kirkpatricka, James ;
Pascanu, Razvan ;
Rabinowitz, Neil ;
Veness, Joel ;
Desjardins, Guillaume ;
Rusu, Andrei A. ;
Milan, Kieran ;
Quan, John ;
Ramalho, Tiago ;
Grabska-Barwinska, Agnieszka ;
Hassabis, Demis ;
Clopath, Claudia ;
Kumaran, Dharshan ;
Hadsell, Raia .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) :3521-3526
[8]   Deep Power Control: Transmit Power Control Scheme Based on Convolutional Neural Network [J].
Lee, Woongsup ;
Kim, Minhoe ;
Cho, Dong-Ho .
IEEE COMMUNICATIONS LETTERS, 2018, 22 (06) :1276-1279
[9]   Towards Optimal Power Control via Ensembling Deep Neural Networks [J].
Liang, Fei ;
Shen, Cong ;
Yu, Wei ;
Wu, Feng .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (03) :1760-1776
[10]  
Lopez-Paz D, 2017, ADV NEUR IN, V30