Deep Reinforcement Learning-Based Intelligent Reflecting Surface Optimization for TDD Multi-User MIMO Systems

被引:2
作者
Zhao, Fengyu [1 ]
Chen, Wen [1 ]
Liu, Ziwei [1 ]
Li, Jun [2 ]
Wu, Qingqing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Nanjing Univ Sci Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
关键词
Downlink; Uplink; Rician channels; Optimization; Channel estimation; Wireless networks; Signal to noise ratio; Intelligent reflecting surface (IRS); time-division duplexing (TDD); multi-user multiple-input-multiple-output (MU MIMO); deep reinforcement learning (DRL);
D O I
10.1109/LWC.2023.3301496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, we investigate the discrete phase shift design of the intelligent reflecting surface (IRS) in a time-division duplexing (TDD) multi-user multiple-input-multiple-output (MIMO) system. We modify the design of deep reinforcement learning (DRL) scheme so that we can maximizing the average downlink data transmission rate free from the sub-channel channel state information (CSI). Based on the characteristics of the model, we modify the "proximal policy optimization (PPO)" algorithm and integrate gated recurrent unit (GRU) to tackle the non-convex optimization problem. Simulation results show that the performance of the proposed PPO-GRU surpasses the benchmarks in terms of performance, convergence speed, and training stability.
引用
收藏
页码:1951 / 1955
页数:5
相关论文
共 12 条
  • [1] IRS-Aided Wireless Powered MEC Systems: TDMA or NOMA for Computation Offloading?
    Chen, Guangji
    Wu, Qingqing
    Chen, Wen
    Ng, Derrick Wing Kwan
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (02) : 1201 - 1218
  • [2] Goldsmith A., 2005, Wireless communications
  • [3] Joint Active and Passive Beamforming Design for IRS-Aided Radar-Communication
    Hua, Meng
    Wu, Qingqing
    He, Chong
    Ma, Shaodan
    Chen, Wen
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (04) : 2278 - 2294
  • [4] Jensen TL, 2020, INT CONF ACOUST SPEE, P5000, DOI [10.1109/ICASSP40776.2020.9053695, 10.1109/icassp40776.2020.9053695]
  • [5] Joint Beamforming Design and Power Splitting Optimization in IRS-Assisted SWIPT NOMA Networks
    Li, Zhendong
    Chen, Wen
    Wu, Qingqing
    Wang, Kunlun
    Li, Jun
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) : 2019 - 2033
  • [6] Poor H. V., 1994, INTRO SIGNAL DETECTI, V2nd ed
  • [7] Schulman J., 2016, P INT C LEARN REPR I, P1
  • [8] On the Performance of Multi-Antenna IRS-Assisted NOMA Networks With Continuous and Discrete IRS Phase Shifting
    Sun, Zeyu
    Jing, Yindi
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (05) : 3012 - 3023
  • [9] Intelligent Reflecting Surface Configurations for Smart Radio Using Deep Reinforcement Learning
    Wang, Wei
    Zhang, Wei
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2335 - 2346
  • [10] Jittering Effects Analysis and Beam Training Design for UAV Millimeter Wave Communications
    Wang, Wei
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (05) : 3131 - 3146