An Autonomous Transmission Scheme Using Dueling DQN for D2D Communication Networks

被引：37

作者：

Ban, Tae-Won ^{[1
]}

机构：

[1] Gyeongsang Natl Univ, Dept Informat & Commun Engn, TongYeong 138727, South Korea

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2020年 / 69卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Autonomous transmission; device-to-device (D2D); dueling deep reinforcement learning (DRL); transmission scheme;

D O I：

10.1109/TVT.2020.3041458

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we investigate device-to-device (D2D) communication networks which are one of the key technologies for next-generation mobile communication networks and many other applications such as unmanned aerial vehicles (UAVs), vehicle-to-vehicle (V2V), and Internet of things (IoT). The overlay D2D communication networks that are considered in our study use dedicated radio resources separate from what cellular networks use and there exists co-channel interference in D2D networks without cross-channel interference between two networks. We propose a new transmission scheme for overlay D2D networks that uses a dueling deep reinforcement learning (DRL) architecture. The DRL is especially effective in environments where actions do not affect subsequent states as in wireless communication networks. The main contribution of this paper is that the proposed architecture is designed to utilize only information that each D2D devices can easily obtain by measuring channels. The proposed scheme thus enables D2D devices to train their own neural networks and to decide autonomously whether to transmit data without any intervention from infrastructures. The performance of the proposed scheme is analyzed in terms of average sum-rates and is compared to three baseline schemes. Simulation results show that the proposed scheme can achieve almost optimal sum-rates with low signal-to-noise (SNR) values without any intervention from infrastructure.

引用

页码：16348 / 16352

页数：5

共 20 条

[1] MNO-OTT Collaborative Video Streaming in 5G: The Zero-Rated QoE Approach for Quality and Resource Management [J].

Ahmad, Arslan ;

Atzori, Luigi .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (01) :361-374

[2]

[Anonymous], 2006, Fundamentals of Wireless Communication

[3] On the Link Scheduling for Cellular-Aided Device-to-Device Networks [J].

Ban, Tae-Won ;

Jung, Bang Chul .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (11) :9404-9409

[4]

Bi Z., 2020, 2020 IEEE 91 VEHICUL, P1

[5]

Cisco, 2019, CISC VIS NETW IND GL

[6] Smart Mode Selection Using Online Reinforcement Learning for VR Broadband Broadcasting in D2D Assisted 5G HetNets [J].

Feng, Lei ;

Yang, Zhixiang ;

Yang, Yang ;

Que, Xiaoyu ;

Zhang, Kai .

IEEE TRANSACTIONS ON BROADCASTING, 2020, 66 (02) :600-611

[7] Energy-Efficient Device Discovery in D2D Cellular Networks for Public Safety Scenario [J].

Kaleem, Zeeshan ;

Qadri, Nadia N. ;

Duong, Trung Q. ;

Karagiannidis, George K. .

IEEE SYSTEMS JOURNAL, 2019, 13 (03) :2716-2719

[8] Interference-Aware Resource-Sharing Scheme for Multiple D2D Group Communications Underlaying Cellular Networks [J].

Li, Yunpeng ;

Kaleem, Zeeshan ;

Chang, KyungHi .

WIRELESS PERSONAL COMMUNICATIONS, 2016, 90 (02) :749-768

[9]

Malarski KM, 2020, 2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), P196, DOI [10.1109/fmec49853.2020.9144704, 10.1109/FMEC49853.2020.9144704]

[10] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

← 1 2 →