Overview of Deep Reinforcement Learning Improvements and Applications

被引:9
作者
Zhang, Junjie [1 ]
Zhang, Cong [1 ]
Chien, Wei-Che [2 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan, Peoples R China
[2] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Shoufeng Township, Hualien County, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2021年 / 22卷 / 02期
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Value function; Policy gradient; Sparse reward; NETWORK;
D O I
10.3966/160792642021032202002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The deep reinforcement learning value has received a lot of attention from researchers since it was proposed. It combines the data representation capability of deep learning and the self-learning capability of reinforcement learning to give agents the ability to make direct action decisions on raw data. Deep reinforcement learning continuously optimizes the control strategy by using value function approximation and strategy search methods, ultimately resulting in an agent with a higher level of understanding of the target task. This paper provides a systematic description and summary of the corresponding improvements of these two types of classical method machines. First, this paper briefly describes the basic algorithms of classical deep reinforcement learning, including the Monte Carlo algorithm, the Q-Learning algorithm, and the most primitive deep Q network. Then the machine improvement method of deep reinforcement learning method based on value function and strategy gradient is introduced. And then the applications of deep reinforcement learning in robot control, algorithm parameter optimization and other fields are outlined. Finally, the future of deep reinforcement learning is envisioned based on the current limitations of deep reinforcement learning.
引用
收藏
页码:239 / 255
页数:17
相关论文
共 50 条
  • [41] Deep Reinforcement Learning Based Efficient and Robust Navigation Method For Autonomous Applications
    Hemming, Nathan
    Menon, Vineetha
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 287 - 293
  • [42] Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
    Xu, Lanyu
    Zhu, Simeng
    Wen, Ning
    PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (22)
  • [43] Multi-Agent Deep Reinforcement Learning Applications in Cybersecurity: challenges and perspectives
    Tolba, Zakaria
    Dehimi, Nour El Houda
    Galland, Stephane
    Boukelloul, Soufiene
    Guassmi, Djaber
    2024 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER, TELECOMMUNICATION AND ENERGY TECHNOLOGIES, ECTE-TECH, 2024,
  • [44] Learning to Steal Electricity in Power Distribution Systems with Deep Reinforcement Learning
    Anderson, Osten
    Yu, Nanpeng
    2022 17TH INTERNATIONAL CONFERENCE ON PROBABILISTIC METHODS APPLIED TO POWER SYSTEMS (PMAPS), 2022,
  • [45] Study on recommendation of personalised learning resources based on deep reinforcement learning
    Li Z.
    Wang H.
    International Journal of Information and Communication Technology, 2023, 23 (04) : 299 - 313
  • [46] Automated Saturation Mitigation Controlled by Deep Reinforcement Learning
    Aguas, Elkin
    Lambert, Anthony
    Blanc, Gregory
    Debar, Herve
    2020 IEEE 28TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP 2020), 2020,
  • [47] Deep reinforcement learning in medical imaging: A literature review
    Zhou, S. Kevin
    Le, Hoang Ngan
    Luu, Khoa
    Nguyen, Hien, V
    Ayache, Nicholas
    MEDICAL IMAGE ANALYSIS, 2021, 73
  • [48] Deep reinforcement learning in autonomous manipulation for celestial bodies exploration: Applications and challenges
    Gao X.
    Tang L.
    Huang H.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (06):
  • [49] Deep reinforcement learning: Algorithm, applications, and ultra-low-power implementation
    Li, Hongjia
    Cai, Ruizhe
    Liu, Ning
    Lin, Xue
    Wang, Yanzhi
    NANO COMMUNICATION NETWORKS, 2018, 16 : 81 - 90
  • [50] Decentralized Scheduling for Cooperative Localization With Deep Reinforcement Learning
    Peng, Bile
    Seco-Granados, Gonzalo
    Steinmetz, Erik
    Frohle, Markus
    Wymeersch, Henk
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4295 - 4305