Predictive information and explorative behavior of autonomous robots

被引:95
作者
Ay, N. [1 ,3 ]
Bertschinger, N. [1 ]
Der, R. [1 ]
Guettler, F. [2 ]
Olbrich, E. [1 ]
机构
[1] Max Planck Inst Math Sci Leipzig, D-04009 Leipzig, Germany
[2] Univ Leipzig, D-04009 Leipzig, Germany
[3] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
89.70.Cf Entropy and other measures of information; 87.19.lo Information theory; 87.85.St Robotics;
D O I
10.1140/epjb/e2008-00175-0
中图分类号
O469 [凝聚态物理学];
学科分类号
070205 ;
摘要
Measures of complexity are of immediate interest for the field of autonomous robots both as a means to classify the behavior and as an objective function for the autonomous development of robot behavior. In the present paper we consider predictive information in sensor space as a measure for the behavioral complexity of a two-wheel embodied robot moving in a rectangular arena with several obstacles. The mutual information (MI) between past and future sensor values is found empirically to have a maximum for a behavior which is both explorative and sensitive to the environment. This makes predictive information a prospective candidate as an objective function for the autonomous development of such behaviors. We derive theoretical expressions for the MI in order to obtain an explicit update rule for the gradient ascent dynamics. Interestingly, in the case of a linear or linearized model of the sensorimotor dynamics the structure of the learning rule derived depends only on the dynamical properties while the value of the MI influences only the learning rate. In this way the problem of the prohibitively large sampling times for information theoretic measures can be circumvented. This result can be generalized and may help to derive explicit learning rules from complexity theoretic measures.
引用
收藏
页码:329 / 339
页数:11
相关论文
共 25 条
  • [1] [Anonymous], 2005, P AAAI SPRING S DEVE
  • [2] Predictability, complexity, and learning
    Bialek, W
    Nemenman, I
    Tishby, N
    [J]. NEURAL COMPUTATION, 2001, 13 (11) : 2409 - 2463
  • [3] Box G.E. P., 1994, Time Series Analysis: Forecasting Control, V3rd
  • [4] Cover T.M., 1991, WILEY SER TELECOMMUN
  • [5] INFERRING STATISTICAL COMPLEXITY
    CRUTCHFIELD, JP
    YOUNG, K
    [J]. PHYSICAL REVIEW LETTERS, 1989, 63 (02) : 105 - 108
  • [6] Der R, 2001, THEOR BIOSCI, V120, P179
  • [7] Der R, 1999, CONCUR SYST ENGN SER, V55, P43
  • [8] DER R, 2005, J ADAPTIVE BEHAV, V14, P105
  • [9] Der R., 2002, P WORKSH BIOL INSP R
  • [10] DER R, 2006, ARTIFICIAL LIFE 10, P192