Deep Reinforcement Learning Enables Joint Trajectory and Communication in Internet of Robotic Things

被引：0

作者：

Luo, Ruyu ^{[1
]}

Tian, Hui ^{[1
]}

Ni, Wanli ^{[2
]}

Cheng, Julian ^{[3
]}

Chen, Kwang-Cheng ^{[4
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

[3] Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, Canada

[4] Univ S Florida, Dept Elect Engn, Tampa, FL 33620 USA

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Ultra reliable low latency communication; Trajectory; Resource management; Robots; NOMA; Wireless communication; Decoding; Deep reinforcement learning; Internet of Robotic Things; trajectory design; ultra-reliable low-latency communications; RESOURCE-ALLOCATION; URLLC; NOMA; OPTIMIZATION; CAPACITY;

D O I：

10.1109/TWC.2024.3462450

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Internet of Robotic Things (IoRT) emphasizes the integrated robotic, artificial intelligence computing, and communication technologies, enabling more sophisticated operations and decision-making. As a crucial element of IoRT, mission-critical applications, such as industrial manufacturing and emergency services, impose stringent requirements on ultra-reliable and low-latency communication (URLLC). The paper focuses on addressing URLLC challenges in the context of IoRT, particularly when autonomous mobile robots (AMRs) coexist with static sensors. We prioritize safe and efficient AMRs' travel through trajectory design and communication resource allocation in IoRT systems without the need of any prior knowledge. To enhance network connectivity and exploit diversity gains, we introduce the flexible decoding and free clustering as the next-generation multiple access technologies in spectrum-limited downlink IoRT system. Then, aiming at minimizing the decoding error probability and travel time, we formulate a long-term multi-objective optimization problem by jointly designing AMRs' trajectory and communication resource. To accommodate the inherent dynamics and unpredictability in the IoRT system, we introduce a multi-agent actor-critic deep reinforcement learning (DRL) framework, offering four distinct implementations, each accompanied by comprehensive complexity analyses. Simulation results reveal the following insights: 1) in terms of DRL implementations, off-policy algorithms with deterministic policies outperform their on-policy counterparts, achieving approximately a 67% increase in rewards; 2) In terms of communication schemes, our proposed flexible decoding and free clustering strategies under designed trajectories can effectively reduce decoding errors; and 3) In terms of algorithm optimality, our DRL framework shows superior flexibility and adaptability in communication environments compared to traditional A* search and heuristic methods.

引用

页码：18154 / 18168

页数：15

共 56 条

[1] A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks [J].

Ahsan, Waleed ;

Yi, Wenqiang ;

Liu, Yuanwei ;

Nallanathan, Arumugam .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (08) :5989-6002

[2] Policy derivation methods for critic -only reinforcement learning in continuous spaces [J].

Alibekov, Eduard ;

Kubalik, Jiri ;

Babuska, Robert .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 69 :178-187

[3] Superposition-Based URLLC Traffic Scheduling in 5G and Beyond Wireless Networks [J].

Almekhlafi, Mohammed ;

Arfaoui, Mohamed Amine ;

Assi, Chadi ;

Ghrayeb, Ali .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (09) :6295-6309

[4] Risk-Aware Resource Allocation for URLLC: Challenges and Strategies with Machine Learning [J].

Azari, Amin ;

Ozger, Mustafa ;

Cavdar, Cicek .

IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (03) :42-48

[5]

Baxi Amit, 2022, IEEE Internet of Things Magazine, V5, P26, DOI 10.1109/IOTM.001.2200056

[6] Optimizing Resource Allocation in URLLC for Real-Time Wireless Control Systems [J].

Chang, Bo ;

Zhang, Lei ;

Li, Liying ;

Zhao, Guodong ;

Chen, Zhi .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (09) :8916-8927

[7] Wireless Networked Multirobot Systems in Smart Factories [J].

Chen, Kwang-Cheng ;

Lin, Shih-Chun ;

Hsiao, Jen-Hao ;

Liu, Chun-Hung ;

Molisch, Andreas F. ;

Fettweis, Gerhard P. .

PROCEEDINGS OF THE IEEE, 2021, 109 (04) :468-494

[8] A Comprehensive Survey on Internet of Things (IoT) Toward 5G Wireless Systems [J].

Chettri, Lalit ;

Bera, Rabindranath .

IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (01) :16-32

[9] Deep Learning Based Communication Over the Air [J].

Doerner, Sebastian ;

Cammerer, Sebastian ;

Hoydis, Jakob ;

ten Brink, Stephan .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (01) :132-143

[10]

Duan Y, 2016, PR MACH LEARN RES, V48

← 1 2 3 4 5 6 →