A simple computational algorithm of model-based choice preference

被引:0
|
作者
Asako Toyama
Kentaro Katahira
Hideki Ohira
机构
[1] Nagoya University,Department of Psychology, Graduate School of Environmental Studies
[2] Nagoya University,Department of Psychology, Graduate School of Informatics
[3] Japan Society for the Promotion of Science,undefined
来源
Cognitive, Affective, & Behavioral Neuroscience | 2017年 / 17卷
关键词
Computational model; Model-free; Model-based; Eligibility trace; Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
A broadly used computational framework posits that two learning systems operate in parallel during the learning of choice preferences—namely, the model-free and model-based reinforcement-learning systems. In this study, we examined another possibility, through which model-free learning is the basic system and model-based information is its modulator. Accordingly, we proposed several modified versions of a temporal-difference learning model to explain the choice-learning process. Using the two-stage decision task developed by Daw, Gershman, Seymour, Dayan, and Dolan (2011), we compared their original computational model, which assumes a parallel learning process, and our proposed models, which assume a sequential learning process. Choice data from 23 participants showed a better fit with the proposed models. More specifically, the proposed eligibility adjustment model, which assumes that the environmental model can weight the degree of the eligibility trace, can explain choices better under both model-free and model-based controls and has a simpler computational algorithm than the original model. In addition, the forgetting learning model and its variation, which assume changes in the values of unchosen actions, substantially improved the fits to the data. Overall, we show that a hybrid computational model best fits the data. The parameters used in this model succeed in capturing individual tendencies with respect to both model use in learning and exploration behavior. This computational model provides novel insights into learning with interacting model-free and model-based components.
引用
收藏
页码:764 / 783
页数:19
相关论文
共 50 条
  • [41] Gait recognition based on model-based methods and deep belief networks
    Benouis, Mohamed
    Senouci, Mohamed
    Tlemsani, Redouane
    Mostefai, Lotfi
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2016, 8 (3-4) : 237 - 253
  • [42] Optimistic MLE: A Generic Model-Based Algorithm for Partially Observable Sequential Decision Making
    Liu, Qinghua
    Netrapalli, Praneeth
    Szepesvari, Csaba
    Jin, Chi
    PROCEEDINGS OF THE 55TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2023, 2023, : 363 - 376
  • [43] MODEL-BASED INFORMATION ACCESS
    JAGANATHAN, V
    KARINTHI, R
    ALMASI, G
    INTERNATIONAL JOURNAL OF INTELLIGENT & COOPERATIVE INFORMATION SYSTEMS, 1994, 3 (02): : 107 - 127
  • [44] Model-based development in automation
    Witte, Martin Emmerich
    Diedrich, Christian
    Figalist, Helmut
    AT-AUTOMATISIERUNGSTECHNIK, 2018, 66 (05) : 360 - 371
  • [45] Model-based learning and the contribution of the orbitofrontal cortex to the model-free world
    McDannald, Michael A.
    Takahashi, Yuji K.
    Lopatina, Nina
    Pietras, Brad W.
    Jones, Josh L.
    Schoenbaum, Geoffrey
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 991 - 996
  • [46] A computational model-based study on the exchangeability of hepatic venous pressure gradients measured in multiple hepatic veins
    Wang, Tianqi
    Liang, Fuyou
    Li, Lei
    Zhang, Wen
    Wang, Guangchuan
    Wang, Jitao
    Zhang, Chunqing
    Qi, Xiaolong
    MEDICAL ENGINEERING & PHYSICS, 2020, 84 : 28 - 35
  • [47] A computational model-based study on the feasibility of predicting post-splenectomy thrombosis using hemodynamic metrics
    Wang, Tianqi
    Yong, Yan
    Ge, Xinyang
    Wang, Jitao
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2024, 11
  • [48] Loss Aversion Correlates With the Propensity to Deploy Model-Based Control
    Solway, Alec
    Lohrenz, Terry
    Montague, P. Read
    FRONTIERS IN NEUROSCIENCE, 2019, 13
  • [49] Trust, but Verify: Alleviating Pessimistic Errors in Model-Based Exploration
    Czechowski, Konrad
    Odrzygozdz, Tomasz
    Izworski, Michal
    Zbysinski, Marek
    Kucinski, Lukasz
    Milos, Piotr
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [50] Model-Based Reinforcement Learning with Automated Planning for Network Management
    Ordonez, Armando
    Mauricio Caicedo, Oscar
    Villota, William
    Rodriguez-Vivas, Angela
    da Fonseca, Nelson L. S.
    SENSORS, 2022, 22 (16)