The exploration of unknown environments populated with entities by a surprise-curiosity-based agent

被引:10
作者
Macedo, L. [1 ]
Cardoso, A. [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, P-3030 Coimbra, Portugal
来源
COGNITIVE SYSTEMS RESEARCH | 2012年 / 19-20卷
关键词
Exploration of unknown environments; Surprise; Curiosity; Motivational agents; Belief-Desire-Intention agents; INTRINSIC MOTIVATION; INCENTIVES;
D O I
10.1016/j.cogsys.2012.04.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a Belief-Desire-Intention-like architecture for an explorer agent in which the psychological constructs of surprise and curiosity play an important role in decision-making, particularly in the selection of view-points during the process of exploring unknown environments. Taking into account previous studies about the psychological constructs involved in exploratory behaviour, the agent is equipped in advance with the basic desires for maximal information gain (reduce curiosity), and maximal surprise. However, to reflect Berlyne's theory that says that the tendency to explore the environment occurs in the absence of known drives, we considered also the basic desire for minimal hunger as a representative example of those additional basic desires that can restrain exploration. This surprise-curiosity-based exploration strategy was confronted with a "cold" classical exploration strategy in environments populated with entities. The results of this experiment indicate that the classical strategy outperforms slightly the surprise-curiosity-based one with respect to the exploration performance measures of the time/energy required to explore all the environment completely, and the time/energy required to explore all the entities. However, the classical strategy was outperformed by the surprise-curiosity-based one with respect to the time/energy required to explore all different entities, and consequently, with more evidence, with respect to the number of steps (trips between two entities) required to explore all different entities. This is a valuable result for resource-bounded, active learning agents that benefit from choosing the more informative data from which they learn while ignoring time-consuming/expensive, redundant data. This important result is confirmed by the results of the analysis of the agents' behaviour exhibited along the traversing paths in the environment. The experiment also provided results concerning the robustness of the surprise-curiosity-based approach by assessing the influence of surprise and curiosity in several environments of different complexity and with different amplitudes for the visual field of the agent. (C) 2012 Elsevier B. V. All rights reserved.
引用
收藏
页码:62 / 87
页数:26
相关论文
共 126 条
  • [1] Amat J., 1997, Experimental Robotics IV. 4th International Symposium, P40, DOI 10.1007/BFb0035195
  • [2] Anguelov D., 2002, Conference on Uncertainty in Arti cial Intelligence, P10
  • [3] [Anonymous], 2001, P 23 ANN C COGN SCI
  • [4] [Anonymous], 1981, Motivation and Personality
  • [5] [Anonymous], 2002, Exploring Artificial Intelligence in the New Millennium, DOI DOI 10.5555/779343.779345
  • [6] Aristotle, 1953, The Nicomachean ethics
  • [7] Bartlett F. C., 1932, Remembering: A study in experimental and social psychology, DOI [DOI 10.1111/J.2044-8279.1933.TB02913.X, 10.1111/j.2044-8279.1933.tb02913.x]
  • [8] Barto AG, 2004, INT C DEV LEARN EP R
  • [9] Bentham Jeremy, 1789, The Works of Jeremy Bentham
  • [10] Berlyne D., 1967, NEBRASKA S MOTIVATIO, P1