Minimizing AoI in High-Speed Railway Mobile Networks: DQN-Based Methods

被引:1
作者
Zhang, Xiang [1 ,2 ,3 ]
Xiong, Ke [1 ,2 ,3 ]
Chen, Wei [4 ,5 ]
Fan, Pingyi [4 ,5 ]
Ai, Bo [6 ,7 ,8 ,9 ]
Ben Letaief, Khaled [10 ]
机构
[1] Beijing Jiaotong Univ, Engn Res Ctr Network Management Technol High Spee, Minist Educ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Collaborat Innovat Ctr Railway Traff Safety, Beijing 100044, Peoples R China
[3] Beijing Jiaotong Univ, Natl Engn Res Ctr Adv Network Technol, Beijing 100044, Peoples R China
[4] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
[5] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[6] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[7] Beijing Jiaotong Univ, Key Lab Railway Ind Broadband Mobile Informat Com, Beijing 100044, Peoples R China
[8] Beijing Jiaotong Univ, Beijing Engn Res Ctr High Speed Railway Broadband, Beijing 100044, Peoples R China
[9] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China
[10] Hong Kong Univ Sci & Technol HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
High-speed railway communications; age of information; AoI minimization; reinforcement learning; STATUS UPDATE; INFORMATION; AGE; COMMUNICATION;
D O I
10.1109/TITS.2024.3472033
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper studies the high-speed railway mobile networks (HSRMN), where multiple railway-side sensors (RSs) are deployed along the track to sense environmental data, and multiple train-mounted sensors (TSs) are deployed on the train to collect train data. Both RSs and TSs are scheduled to transmit their sensed data respectively to the ground base station (BS) in a time division multiple access (TDMA) mode. To keep the data received at the BS from the RSs as fresh as possible and also ensure that the TSs complete the given uploading tasks, an optimization problem is established to minimize the average age of information (AoI) of the data gathered from RSs by jointly optimizing sensors' scheduling and transmission power control constrained by the maximum transmission power budget of RSs and TSs. Since the problem is non-convex and lacks an explicit expression of the objective function and the prior information about future channel state, we present a deep Q-learning network (DQN)-based method to solve it. Particularly, the BS is viewed as the agent, and the action space is constructed by scheduling policy and power control. To further accelerate the convergence speed of the presented DQN-based solution framework, an action space-reduced (ASR) version of the DQN-based method, i.e., the ASR-DQN-based method, is designed by deriving a closed-form solution to the optimal transmission power for a given sensors' scheduling policy. Numerical simulations show that, compared to the DQN-based method, the ASR-DQN-based method decreases the number of episodes required for convergence by about 23% and reduces the running time by about 41%. Moreover, compared with three baselines, i.e., the random method, the round-robin method, and the deep-Sarsa method, our presented ASR-DQN-based method achieves the lowest average AoI and has the best robustness among these compared methods.
引用
收藏
页码:20137 / 20150
页数:14
相关论文
共 51 条
[1]   Non-Orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach [J].
Al-Habob, Ahmed A. ;
Tabassum, Hina ;
Waqar, Omer .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) :9789-9803
[2]   Timely Status Updating Over Erasure Channels Using an Energy Harvesting Sensor: Single and Multiple Sources [J].
Arafa, Ahmed ;
Yang, Jing ;
Ulukus, Sennur ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (01) :6-19
[3]   Timely Updates in Energy Harvesting Two-Hop Networks: Offline and Online Policies [J].
Arafa, Ahmed ;
Ulukus, Sennur .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (08) :4017-4030
[4]  
Arafa A, 2017, CONF REC ASILOMAR C, P1801, DOI 10.1109/ACSSC.2017.8335672
[5]   AoI-Aware Joint Scheduling and Power Allocation in Intelligent Transportation System: A Deep Reinforcement Learning Approach [J].
Bai, Guangming ;
Qu, Long ;
Liu, Juan ;
Sun, Dechao .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) :5781-5795
[6]   Optimal Energy Resources Allocation Method of Wireless Sensor Networks for Intelligent Railway Systems [J].
Bin, Sheng ;
Sun, Gengxin .
SENSORS, 2020, 20 (02)
[7]   Scaling Laws for Age of Information in Wireless Networks [J].
Buyukates, Baturalp ;
Soysal, Alkan ;
Ulukus, Sennur .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (04) :2413-2427
[8]   A Cache-Aided Time-Domain Power Allocation for High-Speed Railway Communications [J].
Chen, Deen ;
Xiong, Ke ;
Zhang, Wanle ;
Ai, Bo ;
Fan, Pingyi ;
Ben Letaief, Khaled .
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, :1022-1027
[9]   On the Age of Information in Status Update Systems With Packet Management [J].
Costa, Maice ;
Codreanu, Marian ;
Ephremides, Anthony .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (04) :1897-1910
[10]   High-Speed Railway Wireless Communications: Efficiency Versus Fairness [J].
Dong, Yunquan ;
Fan, Pingyi ;
Ben Letaief, Khaled .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2014, 63 (02) :925-930