A simple computational algorithm of model-based choice preference

被引：0

作者：

Asako Toyama

Kentaro Katahira

Hideki Ohira

机构：

[1] Nagoya University,Department of Psychology, Graduate School of Environmental Studies

[2] Nagoya University,Department of Psychology, Graduate School of Informatics

[3] Japan Society for the Promotion of Science,undefined

来源：

Cognitive, Affective, & Behavioral Neuroscience | 2017年 / 17卷

关键词：

Computational model; Model-free; Model-based; Eligibility trace; Reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A broadly used computational framework posits that two learning systems operate in parallel during the learning of choice preferences—namely, the model-free and model-based reinforcement-learning systems. In this study, we examined another possibility, through which model-free learning is the basic system and model-based information is its modulator. Accordingly, we proposed several modified versions of a temporal-difference learning model to explain the choice-learning process. Using the two-stage decision task developed by Daw, Gershman, Seymour, Dayan, and Dolan (2011), we compared their original computational model, which assumes a parallel learning process, and our proposed models, which assume a sequential learning process. Choice data from 23 participants showed a better fit with the proposed models. More specifically, the proposed eligibility adjustment model, which assumes that the environmental model can weight the degree of the eligibility trace, can explain choices better under both model-free and model-based controls and has a simpler computational algorithm than the original model. In addition, the forgetting learning model and its variation, which assume changes in the values of unchosen actions, substantially improved the fits to the data. Overall, we show that a hybrid computational model best fits the data. The parameters used in this model succeed in capturing individual tendencies with respect to both model use in learning and exploration behavior. This computational model provides novel insights into learning with interacting model-free and model-based components.

引用

页码：764 / 783

页数：19

共 50 条

[41] Gait recognition based on model-based methods and deep belief networks
Benouis, Mohamed
Senouci, Mohamed
Tlemsani, Redouane
Mostefai, Lotfi
INTERNATIONAL JOURNAL OF BIOMETRICS, 2016, 8 (3-4) : 237 - 253
[42] Optimistic MLE: A Generic Model-Based Algorithm for Partially Observable Sequential Decision Making
Liu, Qinghua
Netrapalli, Praneeth
Szepesvari, Csaba
Jin, Chi
PROCEEDINGS OF THE 55TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2023, 2023, : 363 - 376
[43] MODEL-BASED INFORMATION ACCESS
JAGANATHAN, V
KARINTHI, R
ALMASI, G
INTERNATIONAL JOURNAL OF INTELLIGENT & COOPERATIVE INFORMATION SYSTEMS, 1994, 3 (02): : 107 - 127
[44] Model-based development in automation
Witte, Martin Emmerich
Diedrich, Christian
Figalist, Helmut
AT-AUTOMATISIERUNGSTECHNIK, 2018, 66 (05) : 360 - 371
[45] Model-based learning and the contribution of the orbitofrontal cortex to the model-free world
McDannald, Michael A.
Takahashi, Yuji K.
Lopatina, Nina
Pietras, Brad W.
Jones, Josh L.
Schoenbaum, Geoffrey
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 991 - 996
[46] A computational model-based study on the exchangeability of hepatic venous pressure gradients measured in multiple hepatic veins
Wang, Tianqi
Liang, Fuyou
Li, Lei
Zhang, Wen
Wang, Guangchuan
Wang, Jitao
Zhang, Chunqing
Qi, Xiaolong
MEDICAL ENGINEERING & PHYSICS, 2020, 84 : 28 - 35
[47] A computational model-based study on the feasibility of predicting post-splenectomy thrombosis using hemodynamic metrics
Wang, Tianqi
Yong, Yan
Ge, Xinyang
Wang, Jitao
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2024, 11
[48] Loss Aversion Correlates With the Propensity to Deploy Model-Based Control
Solway, Alec
Lohrenz, Terry
Montague, P. Read
FRONTIERS IN NEUROSCIENCE, 2019, 13
[49] Trust, but Verify: Alleviating Pessimistic Errors in Model-Based Exploration
Czechowski, Konrad
Odrzygozdz, Tomasz
Izworski, Michal
Zbysinski, Marek
Kucinski, Lukasz
Milos, Piotr
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[50] Model-Based Reinforcement Learning with Automated Planning for Network Management
Ordonez, Armando
Mauricio Caicedo, Oscar
Villota, William
Rodriguez-Vivas, Angela
da Fonseca, Nelson L. S.
SENSORS, 2022, 22 (16)

← 1 2 3 4 5 →