Qualitative case-based reasoning and learning

被引:23
作者
Donadon Homem, Thiago Pedro [1 ,2 ]
Santos, Paulo Eduardo [2 ,5 ]
Reali Costa, Anna Helena [3 ]
da Costa Bianchi, Reinaldo Augusto [2 ]
de Mantaras, Ramon Lopez [4 ]
机构
[1] IFSP Fed Inst Sao Paulo, Sao Paulo, SP, Brazil
[2] FEI Univ Ctr FEI, Sao Bernardo Do Campo, SP, Brazil
[3] Univ Sao Paulo, Sao Paulo, SP, Brazil
[4] CSIC Spanish Natl Res Council, Barcelona, Spain
[5] Flinders Univ S Australia, Sch Sci & Technol, Adelaide, SA, Australia
基金
巴西圣保罗研究基金会; 欧盟地平线“2020”;
关键词
Case-based reasoning; Qualitative spatial reasoning; Reinforcement learning; Robot soccer; SOCCER; KNOWLEDGE;
D O I
10.1016/j.artint.2020.103258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of autonomous agents that perform tasks with the same dexterity as performed by humans is one of the challenges of artificial intelligence and robotics. This motivates the research on intelligent agents, since the agent must choose the best action in a dynamic environment in order to maximise the final score. In this context, the present paper introduces a novel algorithm for Qualitative Case-Based Reasoning and Learning (QCBRL), which is a case-based reasoning system that uses qualitative spatial representations to retrieve and reuse cases by means of relations between objects in the environment. Combined with reinforcement learning, QCBRL allows the agent to learn new qualitative cases at runtime, without assuming a pre-processing step. In order to avoid cases that do not lead to the maximum performance, QCBRL executes case-base maintenance, excluding these cases and obtaining new (more suitable) ones. Experimental evaluation of QCBRL was conducted in a simulated robot-soccer environment, in a real humanoid-robot environment and on simple tasks in two distinct gridworld domains. Results show that QCBRL outperforms traditional RL methods. As a result of running QCBRL in autonomous soccer matches, the robots performed a higher average number of goals than those obtained when using pure numerical models. In the gridworlds considered, the agent was able to learn optimal and safety policies. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:23
相关论文
共 72 条
[1]  
AAMODT A, 1994, AI COMMUN, V7, P39
[2]   Efficient interactive decision-making framework for robotic applications [J].
Agostini, Alejandro ;
Torras, Carme ;
Woergoetter, Florentin .
ARTIFICIAL INTELLIGENCE, 2017, 247 :187-212
[3]   Autonomous agents modelling other agents: A comprehensive survey and open problems [J].
Albrecht, Stefano V. ;
Stone, Peter .
ARTIFICIAL INTELLIGENCE, 2018, 258 :66-95
[4]  
[Anonymous], 1994, TECHNICAL REPORT
[5]  
[Anonymous], 1989, LEARNING DELAYED REW
[6]  
[Anonymous], 2017 LAT AM ROB S LA
[7]  
[Anonymous], 2017, ICCBR WORKSH
[8]  
[Anonymous], 2007 NACE C NACE INT
[9]  
[Anonymous], 2015, CORR
[10]  
[Anonymous], ROBOCUP SOCC SIM