共 18 条
- [1] 3GPP, 2020, 23501 3GPP TS
- [2] [Anonymous], 2017, 3GPP TR 38.912
- [3] Cheng NF., 2023, REINFORCEMENT LEARNI, P140
- [4] Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
- [5] FU YB, 2020, IEEE T BROADCAST, V67, P2023, DOI DOI 10.1007/S10722-020-00957-W
- [6] Haarnoja Tuomas., 2018, International conference on machine learning, P1861, DOI DOI 10.48550/ARXIV.1801.01290
- [7] On-Policy vs. Off-Policy Deep Reinforcement Learning for Resource Allocation in Open Radio Access Network [J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 1461 - 1466
- [8] Iqbal S, 2019, INT C MACH LEARN, P2961
- [9] Li C. C., 2020, White Paper
- [10] Lotfi Fatemeh, 2022, 2022 IEEE Globecom Workshops (GC Wkshps), P227, DOI 10.1109/GCWkshps56602.2022.10008614