Dynamic learning in behavioral games: A hidden Markov mixture of experts approach

被引:0
作者
Asim Ansari
Ricardo Montoya
Oded Netzer
机构
[1] Columbia University,Columbia Business School
[2] University of Chile,Industrial Engineering Department
来源
Quantitative Marketing and Economics | 2012年 / 10卷
关键词
Learning; Behavioral game theory; Experimental economics; Hidden Markov mixture of experts model; Bayesian estimation; c5; c7; c11; D83;
D O I
暂无
中图分类号
学科分类号
摘要
Over the course of a repeated game, players often exhibit learning in selecting their best response. Research in economics and marketing has identified two key types of learning rules: belief and reinforcement. It has been shown that players use either one of these learning rules or a combination of them, as in the Experience-Weighted Attraction (EWA) model. Accounting for such learning may help in understanding and predicting the outcomes of games. In this research, we demonstrate that players not only employ learning rules to determine what actions to choose based on past choices and outcomes, but also change their learning rules over the course of the game. We investigate the degree of state dependence in learning and uncover the latent learning rules and learning paths used by the players. We build a non-homogeneous hidden Markov mixture of experts model which captures shifts between different learning rules over the course of a repeated game. The transition between the learning rule states can be affected by the players’ experiences in the previous round of the game. We empirically validate our model using data from six games that have been previously used in the literature. We demonstrate that one can obtain a richer understanding of how different learning rules impact the observed strategy choices of players by accounting for the latent dynamics in the learning rules. In addition, we show that such an approach can improve our ability to predict observed choices in games.
引用
收藏
页码:475 / 503
页数:28
相关论文
共 88 条
[1]  
Allenby G(1998)On the heterogeneity of demand Journal of Marketing Research 34 384-389
[2]  
Arora N(2006)Semiparametric Thurstonian models for recurrent choices: a Bayesian analysis Psychometrika 71 631-657
[3]  
Ginter J(2003)E-customization Journal of Marketing Research 40 131-145
[4]  
Ansari A(2006)An adaptive version for the metropolis adjusted Langevin algorithm with a truncated drift Methodology and Computing in Applied Probability 8 235-254
[5]  
Iyengar R(1998)EWA learning in coordination games: probability rules, heterogeneity, and time variation Journal of Mathematical Psychology 42 305-326
[6]  
Ansari A(1999)Experience-weighted attraction learning in games Econometrica 87 827-874
[7]  
Mela C(2002)Sophisticated learning and strategic teaching Journal of Economic Theory 104 137-188
[8]  
Atchadé YF(2003)Models of thinking, and teaching in games The American Economic Review 93 192-195
[9]  
Camerer C(2007)Self-tuning experience weighted attraction learning in games Journal of Economic Theory 133 177-198
[10]  
Ho TH(1995)Adaptive dynamics in coordination games Econometrica 63 103-143