Heterogeneity in generalized reinforcement learning and its relation to cognitive ability

被引:7
作者
Chen, Shu-Heng [1 ]
Du, Ye-Rong [1 ,2 ]
机构
[1] Natl Chengchi Univ, AI ECON Res Ctr, Dept Econ, Taipei, Taiwan
[2] Taiwan Inst Econ Res, Reg Dev Res Ctr, Taipei, Taiwan
关键词
Generalized reinforcement learning; Experience-weighted attraction learning; Cognitive ability; Granularity; NORMAL-FORM GAMES; COORDINATION GAMES; DOPAMINE NEURONS; WORKING-MEMORY; EXPERIENCE; PREDICTION; CAPACITY; MIDBRAIN; BEHAVIOR; NUMBERS;
D O I
10.1016/j.cogsys.2016.11.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the connections between working memory capacity (WMC) and learning in the context of economic guessing games. We apply a generalized version of reinforcement learning, popularly known as the experience-weighted attraction (EWA) learning model, which has a connection to specific cognitive constructs, such as memory decay, the depreciation of past experience, counterfactual thinking, and choice intensity. Through the estimates of the model, we examine behavioral differences among individuals due to different levels of WMC. In accordance with 'Miller's magic number', which is the constraint of working memory capacity, we consider two different sizes (granularities) of strategy space: one is larger (finer) and one is smaller (coarser). We find that constraining the EWA models by using levels (granules) within the limits of working memory allows for a better characterization of the data based on individual differences in WMC. Using this level-reinforcement version of EWA learning, also referred to as the EWA rule learning model, we find that working memory capacity can significantly affect learning behavior. Our likelihood ratio test rejects the null that subjects with high WMC and subjects with low WMC follow the same EWA learning model. In addition, the parameter corresponding to 'counterfactual thinking ability' is found to be reduced when working memory capacity is low. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 61 条
[1]  
Arthur W.B., 1993, J ECOLUTIONARY EC, V3, P1, DOI DOI 10.1007/BF01199986
[2]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[3]   Hierarchical reinforcement learning and decision making [J].
Botvinick, Matthew Michael .
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) :956-962
[4]   Cognitive effort in the Beauty Contest Game [J].
Branas-Garza, Pablo ;
Garcia-Munoz, Teresa ;
Hernan Gonzalez, Roberto .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2012, 83 (02) :254-260
[5]   On the behavior of proposers in ultimatum games [J].
Brenner, Thomas ;
Vriend, Nicolaas J. .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2006, 61 (04) :617-631
[6]   Rational route to randomness [J].
Brock, WA ;
Hommes, CH .
ECONOMETRICA, 1997, 65 (05) :1059-1095
[7]   Adaptive learning and equilibrium selection in experimental coordination games: An ARCH(1) approach [J].
Broseta, B .
GAMES AND ECONOMIC BEHAVIOR, 2000, 32 (01) :25-50
[8]   Higher cognitive ability is associated with lower entries in a p-beauty contest [J].
Burnham, Terence C. ;
Cesarini, David ;
Johannesson, Magnus ;
Lichtenstein, Paul ;
Wallace, Bjorn .
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2009, 72 (01) :171-175
[9]  
Bush R. R., 1955, STOCHASTIC MODELS LE, DOI DOI 10.1037/14496-000
[10]  
Byrne RMJ, 2005, RATIONAL IMAGINATION: HOW PEOPLE CREATE ALTERNATIVES TO REALITY, P1