Modeling the effects of motivation on choice and learning in the basal ganglia

被引：17

作者：

van Swieten, Maaike M. H. ^{[1
]}

Bogacz, Rafal ^{[1
]}

机构：

[1] Univ Oxford, MRC Brain Network Dynam Unit, Oxford, England

来源：

PLOS COMPUTATIONAL BIOLOGY | 2020年 / 16卷 / 05期

基金：

英国生物技术与生命科学研究理事会; 英国医学研究理事会;

关键词：

PREDICTION ERRORS; DOPAMINE NEURONS; PHASIC DOPAMINE; BRAIN DOPAMINE; REWARD; STATE; MODULATION; ACTIVATION; FOOD; ACQUISITION;

D O I：

10.1371/journal.pcbi.1007465

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Author summary Behaviour is made of decisions that are based on the evaluation of costs and benefits of potential actions in a given situation. Actions are usually generated in response to reinforcement cues which are potent triggers of desires that can range from normal appetites to compulsive addictions. However, learned cues are not constant in their motivating power. Food cues are more potent when you are hungry than when you have just finished a meal. These changes in cue-triggered desire produced by a change in biological state present a challenge to many current computational models of motivation and learning. Here, we demonstrate concrete examples of how motivation can instantly modulate reinforcement values and actions; we propose an overarching framework of learning and action selection based on maintaining the physiological balance to better capture the dynamic interaction between learning and physiology that controls the incentive salience mechanism of motivation for reinforcements. These models provide a unified account of state-dependent learning of the incentive value of actions and selecting actions according to the learned positive and negative consequences of those actions and with respect to the physiological state. We propose a biological implementation of how these processes are controlled by an area in the brain called the basal ganglia, which is associated with error-driven learning. Decision making relies on adequately evaluating the consequences of actions on the basis of past experience and the current physiological state. A key role in this process is played by the basal ganglia, where neural activity and plasticity are modulated by dopaminergic input from the midbrain. Internal physiological factors, such as hunger, scale signals encoded by dopaminergic neurons and thus they alter the motivation for taking actions and learning. However, to our knowledge, no formal mathematical formulation exists for how a physiological state affects learning and action selection in the basal ganglia. We developed a framework for modelling the effect of motivation on choice and learning. The framework defines the motivation to obtain a particular resource as the difference between the desired and the current level of this resource, and proposes how the utility of reinforcements depends on the motivation. To account for dopaminergic activity previously recorded in different physiological states, the paper argues that the prediction error encoded in the dopaminergic activity needs to be redefined as the difference between utility and expected utility, which depends on both the objective reinforcement and the motivation. We also demonstrate a possible mechanism by which the evaluation and learning of utility of actions can be implemented in the basal ganglia network. The presented theory brings together models of learning in the basal ganglia with the incentive salience theory in a single simple framework, and it provides a mechanistic insight into how decision processes and learning in the basal ganglia are modulated by the motivation. Moreover, this theory is also consistent with data on neural underpinnings of overeating and obesity, and makes further experimental predictions.

引用

页数：33

共 68 条

[1] Ghrelin modulates the activity and synaptic input organization of midbrain dopamine neurons while promoting appetite [J].

Abizaid, Alfonso ;

Liu, Zhong-Wu ;

Andrews, Zane B. ;

Shanabrough, Marya ;

Borok, Erzsebet ;

Elsworth, John D. ;

Roth, Robert H. ;

Sleeman, Mark W. ;

Picciotto, Marina R. ;

Tschop, Matthias H. ;

Gao, Xiao-Bing ;

Horvath, Tamas L. .

JOURNAL OF CLINICAL INVESTIGATION, 2006, 116 (12) :3229-3239

[2] Nucleus accumbens core dopamine signaling tracks the need-based motivational value of food-paired cues [J].

Aitken, Tara J. ;

Greenfield, Venuz Y. ;

Wassum, Kate M. .

JOURNAL OF NEUROCHEMISTRY, 2016, 136 (05) :1026-1036

[3]

[Anonymous], 1997, Machine Learning

[4] State-dependent valuation learning in fish: Banded tetras prefer stimuli associated with greater past deprivation [J].

Aw, J. M. ;

Holbrook, R. I. ;

de Perera, T. Burt ;

Kacelnik, A. .

BEHAVIOURAL PROCESSES, 2009, 81 (02) :333-336

[5] Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].

Bayer, HM ;

Glimcher, PW .

NEURON, 2005, 47 (01) :129-141

[6] What does dopamine mean? [J].

Berke, Joshua D. .

NATURE NEUROSCIENCE, 2018, 21 (06) :787-793

[7] What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? [J].

Berridge, KC ;

Robinson, TE .

BRAIN RESEARCH REVIEWS, 1998, 28 (03) :309-369

[8]

BERRIDGE KC, 1989, Q J EXP PSYCHOL-B, V41, P121

[9] The debate over dopamine's role in reward: the case for incentive salience [J].

Berridge, Kent C. .

PSYCHOPHARMACOLOGY, 2007, 191 (03) :391-431

[10] From prediction error to incentive salience: mesolimbic computation of reward motivation [J].

Berridge, Kent C. .

EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) :1124-1143

← 1 2 3 4 5 6 7 →