Realization of an Adaptive Memetic Algorithm Using Differential Evolution and Q-Learning: A Case Study in Multirobot Path Planning

被引：118

作者：

Rakshit, Pratyusha ^{[1
]}

Konar, Amit ^{[1
]}

Bhowmik, Pavel ^{[1
]}

Goswami, Indrani ^{[1
]}

Das, Swagatam ^{[2
]}

Jain, Lakhmi C. ^{[3
]}

Nagar, Atulya K. ^{[4
]}

机构：

[1] Jadavpur Univ, Elect & Telecommun Engn Dept, Kolkata 700032, India

[2] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata 700108, India

[3] Univ S Australia, Dept Elect & Elect Engn, Adelaide, SA 5000, Australia

[4] Liverpool Hope Univ, Dept Math & Comp Sci, Liverpool L16 9JD, Merseyside, England

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2013年 / 43卷 / 04期

关键词：

Adaptive memetic algorithm (MA); differential evolution (DE); multirobot path planning; Q-learning;

D O I：

10.1109/TSMCA.2012.2226024

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Memetic algorithms (MAs) are population-based meta-heuristic search algorithms that combine the composite benefits of natural and cultural evolutions. An adaptive MA (AMA) incorporates an adaptive selection of memes (units of cultural transmission) from a meme pool to improve the cultural characteristics of the individual member of a population-based search algorithm. This paper presents a novel approach to design an AMA by utilizing the composite benefits of differential evolution (DE) for global search and Q-learning for local refinement. Four variants of DE, including the currently best self-adaptive DE algorithm, have been used here to study the relative performance of the proposed AMA with respect to runtime, cost function evaluation, and accuracy (offset in cost function from the theoretical optimum after termination of the algorithm). Computer simulations performed on a well-known set of 25 benchmark functions reveal that incorporation of Q-learning in one popular and one outstanding variants of DE makes the corresponding algorithm more efficient in both runtime and accuracy. The performance of the proposed AMA has been studied on a real-time multirobot path-planning problem. Experimental results obtained for both simulation and real frameworks indicate that the proposed algorithm-based path-planning scheme outperforms the real-coded genetic algorithm, particle swarm optimization, and DE, particularly its currently best version with respect two standard metrics defined in the literature.

引用

页码：814 / 831

页数：18

共 72 条

[1]

Angeline P. J., 1998, Evolutionary Programming VII. 7th International Conference, EP98. Proceedings, P601, DOI 10.1007/BFb0040811

[2]

[Anonymous], THESIS JADAVPUR U IN

[3]

[Anonymous], P 2 NAT C REC TRENDS

[4]

[Anonymous], 2005, PROBLEM DEFINITIONS

[5]

[Anonymous], 2006, ICML

[6]

[Anonymous], FRONTIERS EVOLUTIONA

[7]

[Anonymous], NEW ACHIEVEMENTS EVO

[8]

[Anonymous], 2008, ADV DIFFERENTIAL EVO

[9]

[Anonymous], P 28 ANN APICS C MAT

[10]

[Anonymous], 1993, MACHINE LEARNING MET

← 1 2 3 4 5 6 7 8 →