Knowledge guided fuzzy deep reinforcement learning

被引:0
|
作者
Qin, Peng [1 ]
Zhao, Tao [1 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Knowledge guide; Fuzzy system; Reinforcement learning; Deep Q-network;
D O I
10.1016/j.eswa.2024.125823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) addresses complex sequential decision-making problems through interactive trial- and-error and the handling of delayed rewards. However, reinforcement learning typically starts from scratch, necessitating extensive exploration, which results in low learning efficiency. In contrast, humans often leverage prior knowledge to learn. Inspired by this, this paper proposes a semantic knowledge-guided reinforcement learning method (KFDQN), which fully utilizes knowledge to influence reinforcement learning, thereby improving learning efficiency, training stability, and performance. In terms of knowledge representation, considering the strong fuzziness of semantic knowledge, a fuzzy system is constructed to represent this knowledge. In terms of knowledge integration, a knowledge-guided framework that integrates a hybrid action selection strategy (HYAS), a hybrid learning method (HYL), and knowledge updating is constructed in conjunction with the existing reinforcement learning framework. The HYAS integrates knowledge into action selection, reducing the randomness of traditional exploration methods. The HYL incorporates knowledge into the learning target, thereby reducing uncertainty in the learning objective. Knowledge updating ensures that new data is utilized to update knowledge, avoiding the negative impact of knowledge limitations on the learning process. The algorithm is validated through numerical tasks in OpenAI Gym and real-world mobile robot Goal Reach and obstacle avoidance tasks. The results confirm that the algorithm effectively combines knowledge and reinforcement learning, resulting in a 28.6% improvement in learning efficiency, a 19.56% enhancement in performance, and increased training stability.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Multiple Target Prediction for Deep Reinforcement Learning
    Chien, Jen-Tzung
    Hung, Po-Yen
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1611 - 1616
  • [32] Autonomous exploration through deep reinforcement learning
    Yan, Xiangda
    Huang, Jie
    He, Keyan
    Hong, Huajie
    Xu, Dasheng
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2023, 50 (05): : 793 - 803
  • [33] Path planning in an unknown environment based on deep reinforcement learning with prior knowledge
    Lou, Ping
    Xu, Kun
    Jiang, Xuemei
    Xiao, Zheng
    Yan, Junwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5773 - 5789
  • [34] Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
    Gao, Yifan
    Wu, Lezhou
    ELECTRONICS, 2021, 10 (13)
  • [35] Transfer Learning in Deep Reinforcement Learning
    Islam, Tariqul
    Abid, Dm. Mehedi Hasan
    Rahman, Tanvir
    Zaman, Zahura
    Mia, Kausar
    Hossain, Ramim
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 145 - 153
  • [36] A3C Deep Reinforcement Learning Model Compression and Knowledge Extraction
    Zhang J.
    Wang Z.
    Ren Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1373 - 1384
  • [37] Reinforcement Learning in Continuous Spaces by Using Learning Fuzzy Classifier Systems
    Chen, Gang
    Douch, Colin
    Zhang, Mengjie
    Pang, Shaoning
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 320 - 328
  • [38] Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes
    Haklidir, Mehmet
    Temeltas, Hakan
    IEEE ACCESS, 2021, 9 : 159672 - 159683
  • [39] Automated Knowledge Base Completion Using Collaborative Filtering and Deep Reinforcement Learning
    Tortay, Alisher
    Lee, Jee Hang
    Lee, Chang Hwa
    Lee, Sang Wan
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3069 - 3074
  • [40] Acquisition of Automated Guided Vehicle Route Planning Policy Using Deep Reinforcement Learning
    Kamoshida, Ryota
    Kazama, Yoriko
    2017 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LOGISTICS AND TRANSPORT (ICALT), 2017, : 1 - 6