Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses

被引:41
作者
Dou, Jiajia [1 ,2 ]
Chen, Chunlin [1 ,2 ]
Yang, Pei [1 ,2 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Dept Control & Syst Engn, Nanjing 210093, Jiangsu, Peoples R China
[2] Nanjing Univ, Res Ctr Novel Technol Intelligent Equipments, Nanjing 210093, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
ROBOT NAVIGATION; ALGORITHM;
D O I
10.1155/2015/597956
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A new hybrid solution is presented to improve the efficiency of intelligent warehouses with multirobot systems, where the genetic algorithm (GA) based task scheduling is combined with reinforcement learning (RL) based path planning for mobile robots. Reinforcement learning is an effective approach to search for a collision-free path in unknown dynamic environments. Genetic algorithm is a simple but splendid evolutionary search method that provides very good solutions for task allocation. In order to achieve higher efficiency of the intelligent warehouse system, we design a new solution by combining these two techniques and provide an effective and alternative way compared with other state-of-the-art methods. Simulation results demonstrate the effectiveness of the proposed approach regarding the optimization of travel time and overall efficiency of the intelligent warehouse system.
引用
收藏
页数:10
相关论文
共 29 条
[1]  
[Anonymous], 2004, COLLECTION TECHNICAL, DOI DOI 10.2514/6.2004-6229
[2]  
[Anonymous], 1998, Reinforcement Learning: An Introduction
[3]  
[Anonymous], P IEEE INT C SYST MA
[4]  
[Anonymous], 2011, IT CONVERGENCE SERVI
[5]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[6]   Hybrid control for robot navigation - A hierarchical Q-learning algorithm [J].
Chen, Chunlin ;
Li, Han-Xiong ;
Dong, Daoyi .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2008, 15 (02) :37-47
[7]   Fidelity-Based Probabilistic Q-Learning for Control of Quantum Systems [J].
Chen, Chunlin ;
Dong, Daoyi ;
Li, Han-Xiong ;
Chu, Jian ;
Tarn, Tzyh-Jong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) :920-933
[8]   Hybrid MDP based integrated hierarchical Q-learning [J].
Chen ChunLin ;
Dong DaoYi ;
Li Han-Xiong ;
Tarn, Tzyh-Jong .
SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (11) :2279-2294
[9]  
D'Andrea R, 2008, 2008 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGIES FOR PRACTICAL ROBOT APPLICATIONS, P80, DOI 10.1109/TEPRA.2008.4686677
[10]   Design and control of warehouse order picking: A literature review [J].
de Koster, Rene ;
Le-Duc, Tho ;
Roodbergen, Kees Jan .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (02) :481-501