Towards a neuroscience of active sampling and curiosity

被引:224
作者
Gottlieb, Jacqueline [1 ,2 ,3 ]
Oudeyer, Pierre-Yves [4 ,5 ]
机构
[1] Columbia Univ, Dept Neurosci, New York, NY 10027 USA
[2] Columbia Univ, Kavli Inst Brain Sci, New York, NY 10027 USA
[3] Columbia Univ, Mortimer B Zuckerman Mind Brain Behav Inst, New York, NY 10027 USA
[4] INRIA, Bordeaux, France
[5] Ensta ParisTech, Paris, France
关键词
INFORMATION PREDICTION ERRORS; DECISION-MAKING; EPISTEMIC CURIOSITY; ATTENTION; REWARD; PARIETAL; CHOICE; UNCERTAINTY; SALIENCE; REINFORCEMENT;
D O I
10.1038/s41583-018-0078-0
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
In natural behaviour, animals actively interrogate their environments using endogenously generated 'question-and-answer' strategies. However, in laboratory settings participants typically engage with externally imposed stimuli and tasks, and the mechanisms of active sampling remain poorly understood. We review a nascent neuroscientific literature that examines active-sampling policies and their relation to attention and curiosity. We distinguish between information sampling, in which organisms reduce uncertainty relevant to a familiar task, and information search, in which they investigate in an open-ended fashion to discover new tasks. We review evidence that both sampling and search depend on individual preferences over cognitive states, including attitudes towards uncertainty, learning progress and types of information. We propose that, although these preferences are non-instrumental and can on occasion interfere with external goals, they are important heuristics that allow organisms to cope with the high complexity of both sampling and search, and generate curiosity-driven investigations in large, open environments in which rewards are sparse and ex ante unknown.
引用
收藏
页码:758 / 770
页数:13
相关论文
共 130 条
[31]   Intrinsically motivated oculomotor exploration guided by uncertainty reduction and conditioned reinforcement in non-human primates [J].
Daddaoua, Nabil ;
Lopes, Manuel ;
Gottlieb, Jacqueline .
SCIENTIFIC REPORTS, 2016, 6
[32]   Model-Based Influences on Humans' Choices and Striatal Prediction Errors [J].
Daw, Nathaniel D. ;
Gershman, Samuel J. ;
Seymour, Ben ;
Dayan, Peter ;
Dolan, Raymond J. .
NEURON, 2011, 69 (06) :1204-1215
[33]   The Emerging Neuroscience of Intrinsic Motivation: A New Frontier in Self-Determination Research [J].
Di Domenico, Stefano I. ;
Ryan, Richard M. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2017, 11
[34]  
Ebitz RB, 2018, NEURON, V97, P450, DOI 10.1016/j.neuron.2017.12.007
[35]   Experimental testing of intrinsic preferences for NonInstrumental information [J].
Eliaz, Kfir ;
Schotter, Andrew .
AMERICAN ECONOMIC REVIEW, 2007, 97 (02) :166-169
[36]   Humans integrate visual and haptic information in a statistically optimal fashion [J].
Ernst, MO ;
Banks, MS .
NATURE, 2002, 415 (6870) :429-433
[37]   An information theory account of cognitive control [J].
Fan, Jin .
FRONTIERS IN HUMAN NEUROSCIENCE, 2014, 8
[38]   Neurobiological basis of individual variation in stimulus-reward learning [J].
Flagel, Shelly B. ;
Robinson, Terry E. .
CURRENT OPINION IN BEHAVIORAL SCIENCES, 2017, 13 :178-185
[39]   Self-Evaluation of Decision-Making: A General Bayesian Framework for Metacognitive Computation [J].
Fleming, Stephen M. ;
Daw, Nathaniel D. .
PSYCHOLOGICAL REVIEW, 2017, 124 (01) :91-114
[40]   Parietal neurons encode expected gains in instrumental information [J].
Foley, Nicholas C. ;
Kelly, Simon P. ;
Mhatre, Himanshu ;
Lopes, Manuel ;
Gottlieb, Jacqueline .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (16) :E3315-E3323