Proselfs depend more on model-based than model-free learning in a non-social probabilistic state-transition task

被引:0
作者
Oguchi, Mineki [1 ]
Li, Yang [1 ,2 ]
Matsumoto, Yoshie [1 ,3 ]
Kiyonari, Toko [4 ]
Yamamoto, Kazuhiko [5 ]
Sugiura, Shigeki [5 ]
Sakagami, Masamichi [1 ]
机构
[1] Tamagawa Univ, Brain Sci Inst, 6 1 1 Tamagawagakuen, Machida, Tokyo, Japan
[2] Nagoya Univ, Grad Sch Informat, Nagoya, Japan
[3] Seinan Gakuin Univ, Fac Human Sci, Dept Psychol, Fukuoka, Japan
[4] Aoyama Gakuin Univ, Sch Social Informat, Sagamihara, Kanagawa, Japan
[5] Genesis Res Inst, Nagoya, Aichi, Japan
关键词
SOCIAL VALUE ORIENTATION; DECISION-MAKING; MECHANISMS; FOUNDATIONS; COOPERATION; ARBITRATION; INFERENCE; REWARDS; SYSTEMS;
D O I
10.1038/s41598-023-27609-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Humans form complex societies in which we routinely engage in social decision-making regarding the allocation of resources among ourselves and others. One dimension that characterizes social decision-making in particular is whether to prioritize self-interest or respect for others-proself or prosocial. What causes this individual difference in social value orientation? Recent developments in the social dual-process theory argue that social decision-making is characterized by its underlying domain-general learning systems: the model-free and model-based systems. In line with this "learning" approach, we propose and experimentally test the hypothesis that differences in social preferences stem from which learning system is dominant in an individual. Here, we used a non-social state transition task that allowed us to assess the balance between model-free/model-based learning and investigate its relation to the social value orientations. The results showed that proselfs depended more on model-based learning, whereas prosocials depended more on model-free learning. Reward amount and reaction time analyses showed that proselfs learned the task structure earlier in the session than prosocials, reflecting their difference in model-based/model-free learning dependence. These findings support the learning hypothesis on what makes differences in social preferences and have implications for understanding the mechanisms of prosocial behavior.
引用
收藏
页数:15
相关论文
共 33 条
  • [31] Hierarchical control architecture regulating competition between model-based and context-dependent model-free reinforcement learning strategies
    Kim, Dongjae
    Park, Geon Young
    Lee, Sang Wan
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 990 - 994
  • [32] Path-Guided Model-Free Flocking Control of Unmanned Surface Vehicles Based on Concurrent Learning Extended State Observers
    Peng, Zhouhua
    Jiang, Yue
    Liu, Lu
    Shi, Yang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (08): : 4729 - 4739
  • [33] An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration
    Guo, Lei
    Xiong, Wenbo
    Song, Yuan
    Gan, Dongming
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (06) : 748 - 763