Overview of Deep Reinforcement Learning Improvements and Applications

被引:9
作者
Zhang, Junjie [1 ]
Zhang, Cong [1 ]
Chien, Wei-Che [2 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan, Peoples R China
[2] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Shoufeng Township, Hualien County, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2021年 / 22卷 / 02期
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Value function; Policy gradient; Sparse reward; NETWORK;
D O I
10.3966/160792642021032202002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The deep reinforcement learning value has received a lot of attention from researchers since it was proposed. It combines the data representation capability of deep learning and the self-learning capability of reinforcement learning to give agents the ability to make direct action decisions on raw data. Deep reinforcement learning continuously optimizes the control strategy by using value function approximation and strategy search methods, ultimately resulting in an agent with a higher level of understanding of the target task. This paper provides a systematic description and summary of the corresponding improvements of these two types of classical method machines. First, this paper briefly describes the basic algorithms of classical deep reinforcement learning, including the Monte Carlo algorithm, the Q-Learning algorithm, and the most primitive deep Q network. Then the machine improvement method of deep reinforcement learning method based on value function and strategy gradient is introduced. And then the applications of deep reinforcement learning in robot control, algorithm parameter optimization and other fields are outlined. Finally, the future of deep reinforcement learning is envisioned based on the current limitations of deep reinforcement learning.
引用
收藏
页码:239 / 255
页数:17
相关论文
共 50 条
  • [21] A survey on deep reinforcement learning approaches for traffic signal control
    Zhao, Haiyan
    Dong, Chengcheng
    Cao, Jian
    Chen, Qingkui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [22] A survey on reinforcement learning in aviation applications
    Razzaghi, Pouria
    Tabrizian, Amin
    Guo, Wei
    Chen, Shulu
    Taye, Abenezer
    Thompson, Ellis
    Bregeon, Alexis
    Baheri, Ali
    Wei, Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [23] A deep reinforcement learning algorithm based on modified Twin delay DDPG method for robotic applications
    Vasquez-Jalpa, Carlos
    Nakano-Miyatake, Mariko
    Escamilla-Hernandez, Enrique
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 743 - 748
  • [24] Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
    Lei, Lei
    Tan, Yue
    Zheng, Kan
    Liu, Shiwen
    Zhang, Kuan
    Shen, Xuemin
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (03): : 1722 - 1760
  • [25] Applications of asynchronous deep reinforcement learning based on dynamic updating weights
    Xingyu Zhao
    Shifei Ding
    Yuexuan An
    Weikuan Jia
    Applied Intelligence, 2019, 49 : 581 - 591
  • [26] Multimodal Deep Reinforcement Learning for Visual Security of Virtual Reality Applications
    Andam, Amine
    Bentahar, Jamal
    Hedabou, Mustapha
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 39890 - 39900
  • [27] Common challenges of deep reinforcement learning applications development: an empirical study
    Morovati, Mohammad Mehdi
    Tambon, Florian
    Taraghi, Mina
    Nikanjam, Amin
    Khomh, Foutse
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (04)
  • [28] Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms
    Ibrahim, Abdikarim Mohamed
    Yau, Kok-Lim Alvin
    Chong, Yung-Wey
    Wu, Celimuge
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [29] Survey of Deep Reinforcement Learning Based on Value Function and Policy Gradient
    Liu J.-W.
    Gao F.
    Luo X.-L.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (06): : 1406 - 1438
  • [30] Applications of asynchronous deep reinforcement learning based on dynamic updating weights
    Zhao, Xingyu
    Ding, Shifei
    An, Yuexuan
    Jia, Weikuan
    APPLIED INTELLIGENCE, 2019, 49 (02) : 581 - 591