Knowledge guided fuzzy deep reinforcement learning

被引：0

作者：

Qin, Peng ^{[1
]}

Zhao, Tao ^{[1
]}

机构：

[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 264卷

基金：

中国国家自然科学基金;

关键词：

Knowledge guide; Fuzzy system; Reinforcement learning; Deep Q-network;

D O I：

10.1016/j.eswa.2024.125823

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) addresses complex sequential decision-making problems through interactive trial- and-error and the handling of delayed rewards. However, reinforcement learning typically starts from scratch, necessitating extensive exploration, which results in low learning efficiency. In contrast, humans often leverage prior knowledge to learn. Inspired by this, this paper proposes a semantic knowledge-guided reinforcement learning method (KFDQN), which fully utilizes knowledge to influence reinforcement learning, thereby improving learning efficiency, training stability, and performance. In terms of knowledge representation, considering the strong fuzziness of semantic knowledge, a fuzzy system is constructed to represent this knowledge. In terms of knowledge integration, a knowledge-guided framework that integrates a hybrid action selection strategy (HYAS), a hybrid learning method (HYL), and knowledge updating is constructed in conjunction with the existing reinforcement learning framework. The HYAS integrates knowledge into action selection, reducing the randomness of traditional exploration methods. The HYL incorporates knowledge into the learning target, thereby reducing uncertainty in the learning objective. Knowledge updating ensures that new data is utilized to update knowledge, avoiding the negative impact of knowledge limitations on the learning process. The algorithm is validated through numerical tasks in OpenAI Gym and real-world mobile robot Goal Reach and obstacle avoidance tasks. The results confirm that the algorithm effectively combines knowledge and reinforcement learning, resulting in a 28.6% improvement in learning efficiency, a 19.56% enhancement in performance, and increased training stability.

引用

页数：17

共 50 条

[31] Multiple Target Prediction for Deep Reinforcement Learning
Chien, Jen-Tzung
Hung, Po-Yen
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1611 - 1616
[32] Autonomous exploration through deep reinforcement learning
Yan, Xiangda
Huang, Jie
He, Keyan
Hong, Huajie
Xu, Dasheng
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2023, 50 (05): : 793 - 803
[33] Path planning in an unknown environment based on deep reinforcement learning with prior knowledge
Lou, Ping
Xu, Kun
Jiang, Xuemei
Xiao, Zheng
Yan, Junwei
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5773 - 5789
[34] Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
Gao, Yifan
Wu, Lezhou
ELECTRONICS, 2021, 10 (13)
[35] Transfer Learning in Deep Reinforcement Learning
Islam, Tariqul
Abid, Dm. Mehedi Hasan
Rahman, Tanvir
Zaman, Zahura
Mia, Kausar
Hossain, Ramim
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 145 - 153
[36] A3C Deep Reinforcement Learning Model Compression and Knowledge Extraction
Zhang J.
Wang Z.
Ren Y.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1373 - 1384
[37] Reinforcement Learning in Continuous Spaces by Using Learning Fuzzy Classifier Systems
Chen, Gang
Douch, Colin
Zhang, Mengjie
Pang, Shaoning
NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 320 - 328
[38] Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes
Haklidir, Mehmet
Temeltas, Hakan
IEEE ACCESS, 2021, 9 : 159672 - 159683
[39] Automated Knowledge Base Completion Using Collaborative Filtering and Deep Reinforcement Learning
Tortay, Alisher
Lee, Jee Hang
Lee, Chang Hwa
Lee, Sang Wan
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3069 - 3074
[40] Acquisition of Automated Guided Vehicle Route Planning Policy Using Deep Reinforcement Learning
Kamoshida, Ryota
Kazama, Yoriko
2017 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LOGISTICS AND TRANSPORT (ICALT), 2017, : 1 - 6

← 1 2 3 4 5 →