The Improved Q-Learning Algorithm based on Pheromone Mechanism for Swarm Robot System

被引：0

作者：

Shi, Zhiguo ^{[1
,2
]}

Tu, Jun ^{[1
]}

Zhang, Qiao ^{[1
]}

Zhang, Xiaomeng ^{[1
]}

Wei, Junming ^{[3
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China

[2] Ryerson Univ, Dept Aerosp Engn, Toronto, ON M5B 2K3, Canada

[3] Australian Natl Univ, ANU Coll Engn & Comp Sci, Canberra, ACT 2601, Australia

来源：

2013 32ND CHINESE CONTROL CONFERENCE (CCC) | 2013年

基金：

北京市自然科学基金;

关键词：

Swarm robotics system; Distribute reinforcement learning; Q-Learning; Pheromone mechanism;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The reinforcement learning of the robot learning have general applicability in path planning, motion control and other aspects of mobile robot, which not only converges of reinforcement learning but also attributes to the simple implementation of the reinforcement learning, the typical reinforcement learning method is Q-Learning. Some improvements of the shortcomings of the Q-Learning is proposed by using the pheromone mechanism of the ant colony algorithm to solve the information sharing problem in the reinforcement learning system. Finally, the improved Q-Learning algorithm is simulated in the platform of Player/Stage. The results are compared with Q-Learning algorithm and PSO algorithm, which prove that the improved Q-Learning has high efficiency in the path planning of swarm robotics.

引用

页码：6033 / 6038

页数：6

共 50 条

[31] Balance Control of Robot With CMAC Based Q-learning [J].

Li Ming-ai ;

Jiao Li-fang ;

Qiao Jun-fei ;

Ruan Xiao-gang .

2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, :2668-2672

[32] An improved memetic algorithm with Q-learning for low carbon economic scheduling of cogeneration system [J].

Wang, Liming ;

Liu, Yingming ;

Pang, Xinfu ;

Wang, Qimin ;

Wang, Xiaodong .

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) :11585-11600

[33] An inverse reinforcement learning framework with the Q-learning mechanism for the metaheuristic algorithm [J].

Zhao, Fuqing ;

Wang, Qiaoyun ;

Wang, Ling .

KNOWLEDGE-BASED SYSTEMS, 2023, 265

[34] Extended Q-Learning Algorithm for Path-Planning of a Mobile Robot [J].

Goswami , Indrani ;

Das, Pradipta Kumar ;

Konar, Amit ;

Janarthanan, R. .

SIMULATED EVOLUTION AND LEARNING, 2010, 6457 :379-+

[35] Synergism of Firefly Algorithm and Q-Learning for Robot Arm Path Planning [J].

Sadhu, Arup Kumar ;

Konar, Amit ;

Bhattacharjee, Tanuka ;

Das, Swagatam .

SWARM AND EVOLUTIONARY COMPUTATION, 2018, 43 :50-68

[36] A Q-learning algorithm for task scheduling based on improved SVM in wireless sensor networks [J].

Wei, Zhenchun ;

Liu, Fei ;

Zhang, Yan ;

Xu, Juan ;

Ji, Jianjun ;

Lyu, Zengwei .

COMPUTER NETWORKS, 2019, 161 :138-149

[37] An improved ant colony algorithm based on Q-Learning for route planning of autonomous vehicle [J].

Zhao, Liping ;

Li, Feng ;

Sun, Dongye ;

Zhao, Zihan .

INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2024, 19 (03) :1-15

[38] A selection hyper-heuristic algorithm with Q-learning mechanism [J].

Zhao, Fuqing ;

Liu, Yuebao ;

Zhu, Ningning ;

Xu, Tianpeng ;

Jonrinaldi .

APPLIED SOFT COMPUTING, 2023, 147

[39] Mobile robot navigation based on improved CA-CMAC and Q-learning in dynamic environment [J].

Li Guo-jin ;

Chen Shuang ;

Xiao Zhu-li ;

Dong Di-yong .

2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, :5020-5024

[40] Multi-Target Tracking Using a Swarm of UAVs by Q-learning Algorithm [J].

Soleymani, Seyed Ahmad ;

Goudarzi, Shidrokh ;

Liu, Xingchi ;

Mihaylova, Lyudmila ;

Wang, Wenwu ;

Xiao, Pei .

2023 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE, SSPD, 2023, :41-45

← 1 2 3 4 5 →