Uncertainty-driven regulation of learning and exploration in adolescents: A computational account

被引:34
作者
Jepma, Marieke [1 ]
Schaaf, Jessica V. [1 ]
Visser, Ingmar [1 ]
Huizenga, Hilde M. [1 ]
机构
[1] Univ Amsterdam, Dept Psychol, Amsterdam, Netherlands
基金
美国国家科学基金会;
关键词
MEDIAL PREFRONTAL CORTEX; RISK-TAKING; DECISION-MAKING; BRAIN; MODEL; PERFORMANCE; STRIATUM; LEARNERS; SUBREGIONS; RESPONSES;
D O I
10.1371/journal.pcbi.1008276
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Author summary To successfully learn the value of stimuli and actions, people should take into account their current (un)certainty about these values: Learning rates and exploration should be high when one's value estimates are highly uncertain (in the beginning of learning), and decrease over time as evidence accumulates and uncertainty decreases. Recent studies have shown that healthy adults flexibly adapt their learning strategies based on ongoing changes in uncertainty, consistent with normative learning. However, the development of this ability prior to adulthood is yet unknown, as developmental learning studies have not considered trial-to-trial changes in uncertainty. Here, we show that adolescents, as compared to adults, showed a smaller decrease in both learning rate and exploration over time. Computational modeling revealed that both of these effects were due to adolescents overestimating the amount of environmental volatility, which made them more sensitive to recent relative to older evidence. The overestimation of volatility during adolescence may represent the rapidly changing environmental demands during this developmental period, and can help understand the surge in real-life risk taking and exploratory behaviours characteristic of adolescents. Healthy adults flexibly adapt their learning strategies to ongoing changes in uncertainty, a key feature of adaptive behaviour. However, the developmental trajectory of this ability is yet unknown, as developmental studies have not incorporated trial-to-trial variation in uncertainty in their analyses or models. To address this issue, we compared adolescents' and adults' trial-to-trial dynamics of uncertainty, learning rate, and exploration in two tasks that assess learning in noisy but otherwise stable environments. In an estimation task-which provides direct indices of trial-specific learning rate-both age groups reduced their learning rate over time, as self-reported uncertainty decreased. Accordingly, the estimation data in both groups was better explained by a Bayesian model with dynamic learning rate (Kalman filter) than by conventional reinforcement-learning models. Furthermore, adolescents' learning rates asymptoted at a higher level, reflecting an over-weighting of the most recent outcome, and the estimated Kalman-filter parameters suggested that this was due to an overestimation of environmental volatility. In a choice task, both age groups became more likely to choose the higher-valued option over time, but this increase in choice accuracy was smaller in the adolescents. In contrast to the estimation task, we found no evidence for a Bayesian expectation-updating process in the choice task, suggesting that estimation and choice tasks engage different learning processes. However, our modeling results of the choice task suggested that both age groups reduced their degree of exploration over time, and that the adolescents explored overall more than the adults. Finally, age-related differences in exploration parameters from fits to the choice data were mediated by participants' volatility parameter from fits to the estimation data. Together, these results suggest that adolescents overestimate the rate of environmental change, resulting in elevated learning rates and increased exploration, which may help understand developmental changes in learning and decision-making.
引用
收藏
页数:29
相关论文
共 65 条
[1]   Knowing how much you don't know: a neural organization of uncertainty estimates [J].
Bach, Dominik R. ;
Dolan, Raymond J. .
NATURE REVIEWS NEUROSCIENCE, 2012, 13 (08) :572-586
[2]   Fitting Linear Mixed-Effects Models Using lme4 [J].
Bates, Douglas ;
Maechler, Martin ;
Bolker, Benjamin M. ;
Walker, Steven C. .
JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01) :1-48
[3]   Learning the value of information in an uncertain world [J].
Behrens, Timothy E. J. ;
Woolrich, Mark W. ;
Walton, Mark E. ;
Rushworth, Matthew F. S. .
NATURE NEUROSCIENCE, 2007, 10 (09) :1214-1221
[4]   Developmental Changes in Learning: Computational Mechanisms and Social Influences [J].
Bolenz, Florian ;
Reiter, Andrea M. F. ;
Ben Eppinger .
FRONTIERS IN PSYCHOLOGY, 2017, 8
[5]   Separate amygdala subregions signal surprise and predictiveness during associative fear learning in humans [J].
Boll, Sabrina ;
Gamer, Matthias ;
Gluth, Sebastian ;
Finsterbusch, Juergen ;
Buechel, Christian .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2013, 37 (05) :758-767
[6]   A contribution of cognitive decision models to clinical assessment: Decomposing performance on the bechara gambling task [J].
Busemeyer, JR ;
Stout, JC .
PSYCHOLOGICAL ASSESSMENT, 2002, 14 (03) :253-262
[7]   Neural and Psychological Maturation of Decision-making in Adolescence and Young Adulthood [J].
Christakou, Anastasia ;
Gershman, Samuel J. ;
Niv, Yael ;
Simmons, Andrew ;
Brammer, Mick ;
Rubia, Katya .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, 25 (11) :1807-1823
[8]   A unique adolescent response to reward prediction errors [J].
Cohen, Jessica R. ;
Asarnow, Robert F. ;
Sabb, Fred W. ;
Bilder, Robert M. ;
Bookheimer, Susan Y. ;
Knowlton, Barbara J. ;
Poldrack, Russell A. .
NATURE NEUROSCIENCE, 2010, 13 (06) :669-671
[9]   Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration [J].
Cohen, Jonathan D. ;
McClure, Samuel M. ;
Yu, Angela J. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2007, 362 (1481) :933-942
[10]   Understanding adolescence as a period of social-affective engagement and goal flexibility [J].
Crone, Eveline A. ;
Dahl, Ronald E. .
NATURE REVIEWS NEUROSCIENCE, 2012, 13 (09) :636-650