Overview of Deep Reinforcement Learning Improvements and Applications

被引:9
作者
Zhang, Junjie [1 ]
Zhang, Cong [1 ]
Chien, Wei-Che [2 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan, Peoples R China
[2] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Shoufeng Township, Hualien County, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2021年 / 22卷 / 02期
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Value function; Policy gradient; Sparse reward; NETWORK;
D O I
10.3966/160792642021032202002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The deep reinforcement learning value has received a lot of attention from researchers since it was proposed. It combines the data representation capability of deep learning and the self-learning capability of reinforcement learning to give agents the ability to make direct action decisions on raw data. Deep reinforcement learning continuously optimizes the control strategy by using value function approximation and strategy search methods, ultimately resulting in an agent with a higher level of understanding of the target task. This paper provides a systematic description and summary of the corresponding improvements of these two types of classical method machines. First, this paper briefly describes the basic algorithms of classical deep reinforcement learning, including the Monte Carlo algorithm, the Q-Learning algorithm, and the most primitive deep Q network. Then the machine improvement method of deep reinforcement learning method based on value function and strategy gradient is introduced. And then the applications of deep reinforcement learning in robot control, algorithm parameter optimization and other fields are outlined. Finally, the future of deep reinforcement learning is envisioned based on the current limitations of deep reinforcement learning.
引用
收藏
页码:239 / 255
页数:17
相关论文
共 50 条
  • [31] Hypernetwork Dismantling via Deep Reinforcement Learning
    Yan, Dengcheng
    Xie, Wenxin
    Zhang, Yiwen
    He, Qiang
    Yang, Yun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (05): : 3302 - 3315
  • [32] Deep Reinforcement Learning for Job Scheduling on Cluster
    Yao, Zhenjie
    Chen, Lan
    Zhang, He
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 613 - 624
  • [33] A Review of Deep Reinforcement Learning Theory and Application
    Wan L.
    Lan X.
    Zhang H.
    Zheng N.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (01): : 67 - 81
  • [34] A deep actor critic reinforcement learning framework for learning to rank
    Padhye, Vaibhav
    Lakshmanan, Kailasam
    NEUROCOMPUTING, 2023, 547
  • [35] A Survey on Deep Reinforcement Learning
    Liu Q.
    Zhai J.-W.
    Zhang Z.-Z.
    Zhong S.
    Zhou Q.
    Zhang P.
    Xu J.
    2018, Science Press (41): : 1 - 27
  • [36] Double Deep Reinforcement Learning
    Kiefer, Josue
    Dorer, Klaus
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 17 - 22
  • [37] Coevolutionary Deep Reinforcement Learning
    Cotton, David
    Traish, Jason
    Chaczko, Zenon
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2600 - 2607
  • [38] Deep reinforcement learning: a survey
    Hao-nan Wang
    Ning Liu
    Yi-yun Zhang
    Da-wei Feng
    Feng Huang
    Dong-sheng Li
    Yi-ming Zhang
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1726 - 1744
  • [39] Deep reinforcement learning: a survey
    Wang, Hao-nan
    Liu, Ning
    Zhang, Yi-yun
    Feng, Da-wei
    Huang, Feng
    Li, Dong-sheng
    Zhang, Yi-ming
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (12) : 1726 - 1744
  • [40] Energy Conservation for Internet of Things Tracking Applications Using Deep Reinforcement Learning
    Sultan, Salman Md
    Waleed, Muhammad
    Pyun, Jae-Young
    Um, Tai-Won
    SENSORS, 2021, 21 (09)