SIMULTANEOUS STATE ESTIMATION AND LEARNING IN REPEATED COURNOT GAMES

被引：6

作者：

Kebriaei, Hamed ^{[1
]}

Ahmadabadi, Majid Nili ^{[1
,2
]}

Rahimi-Kian, Ashkan ^{[1
]}

机构：

[1] Univ Tehran, Control & Intelligent Proc Ctr Excellence, Sch ECE, Tehran, Iran

[2] Inst Res Fundamental Sci, Sch Cognit Sci, Tehran, Iran

来源：

APPLIED ARTIFICIAL INTELLIGENCE | 2014年 / 28卷 / 01期

关键词：

SIMPLE DYNAMIC-MODEL; PEOPLE PLAY GAMES; BIDDING STRATEGIES; AGENTS;

D O I：

10.1080/08839514.2014.862774

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The aim of this article is to propose that an intelligent agent can be able to decide properly in an incomplete information repeated Cournot game. The market model and the competitors' decision models are not known to the players. The proposed agent employs a combination of the k-nearest neighbor (KNN) method and the Bayes classifier to predict the next action of its rivals, using the market decision history. The agent takes the predicted actions as an estimate of its next state and learns the expected payoff of its state-action pairs interactively using the reinforcement learning (RL) algorithm. The results of the proposed agent's competition with two benchmark competitors in different simulated Cournot games are presented. The simulation results show that the proposed agent can significantly earn more payoffs in comparison with the two benchmark agents.

引用

页码：66 / 89

页数：24

共 24 条

[1]

Bigoni M., 2009, INFORM LEARNING OLIG

[2] Equilibrium selection in a nonlinear duopoly game with adaptive expectations [J].

Bischi, GI ;

Kopel, M .

JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2001, 46 (01) :73-100

[3]

Bischi GI, 2000, ANN INT SOC DYN GAME, V5, P361

[4] A comprehensive survey of multiagent reinforcement learning [J].

Busoniu, Lucian ;

Babuska, Robert ;

De Schutter, Bart .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172

[5] Experience-weighted attraction learning in normal form games [J].

Camerer, C ;

Ho, TH .

ECONOMETRICA, 1999, 67 (04) :827-874

[6]

Cournot A., 1838, RES PRINCIPLES THEOR

[7] An empirical study of applied game theory: Transmission constrained Cournot behavior [J].

Cunningham, LB ;

Baldick, R ;

Baughman, ML .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2002, 17 (01) :166-172

[8]

Erev I, 1998, AM ECON REV, V88, P848

[9]

FISHER FM, 1960, REV ECON STUD, V28, P125

[10]

Fukunaga K, 1990, INTRO STAT PATTERN R, V2nd

← 1 2 3 →