Rage against the machines: how subjects play against learning algorithms

被引:20
作者
Duersch, Peter [1 ]
Kolb, Albert [1 ]
Oechssler, Joerg [1 ]
Schipper, Burkhard C. [2 ]
机构
[1] Heidelberg Univ, Dept Econ, D-69117 Heidelberg, Germany
[2] Univ Calif Davis, Dept Econ, Davis, CA 95616 USA
关键词
Learning; Fictitious play; Imitation; Reinforcement; Trial & error; Strategic teaching; Cournot duopoly; Experiments; Internet; EQUILIBRIA; EXPERIENCE; IMITATION; OLIGOPOLY; GAMES;
D O I
10.1007/s00199-009-0446-0
中图分类号
F [经济];
学科分类号
02 ;
摘要
We use a large-scale internet experiment to explore how subjects learn to play against computers that are programmed to follow one of a number of standard learning algorithms. The learning theories are (unbeknown to subjects) a best response process, fictitious play, imitation, reinforcement learning, and a trial & error process. We explore how subjects' performances depend on their opponents' learning algorithm. Furthermore, we test whether subjects try to influence those algorithms to their advantage in a forward-looking way (strategic teaching). We find that strategic teaching occurs frequently and that all learning algorithms are subject to exploitation with the notable exception of imitation.
引用
收藏
页码:407 / 430
页数:24
相关论文
共 34 条
[1]   The evolutionary stability of perfectly competitive behavior [J].
Alós-Ferrer, C ;
Ania, AB .
ECONOMIC THEORY, 2005, 26 (03) :497-516
[2]  
[Anonymous], 1998, THEORY LEARNING GAME
[3]   Imitation - theory and experimental evidence [J].
Apesteguia, Jose ;
Huck, Steffen ;
Oechssler, Joerg .
JOURNAL OF ECONOMIC THEORY, 2007, 136 (01) :217-235
[4]  
Brown GW., 1951, Activity analysis of production and allocation, V13
[5]   Sophisticated experience-weighted attraction learning and strategic teaching in repeated games [J].
Camerer, CF ;
Ho, TH ;
Chong, JK .
JOURNAL OF ECONOMIC THEORY, 2002, 104 (01) :137-188
[6]   Recommended play and correlated equilibria: an experimental study [J].
Cason, Timothy N. ;
Sharma, Tridib .
ECONOMIC THEORY, 2007, 33 (01) :11-27
[7]  
Coricelli G., 2005, Working Paper
[8]  
Cournot A., 1927, RES MATH PRINCIPLES
[9]   Herding and contrarian behavior in financial markets: An Internet experiment [J].
Drehmann, M ;
Oechssler, J ;
Roider, A .
AMERICAN ECONOMIC REVIEW, 2005, 95 (05) :1403-1426
[10]   Learning from personal experience: One rational guy and the justification of myopia [J].
Ellison, G .
GAMES AND ECONOMIC BEHAVIOR, 1997, 19 (02) :180-210