Particle swarm optimization approaches to coevolve strategies for the iterated prisoner's dilemma

被引：83

作者：

Franken, N ^{[1
]}

Engelbrecht, AP ^{[1
]}

机构：

[1] Univ Pretoria, Sch Informat Technol, Dept Comp Sci, ZA-0002 Pretoria, South Africa

来源：

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION | 2005年 / 9卷 / 06期

关键词：

coevolution; iterated prisoner's dilemma (IPD); neural networks (NNs); particle swarm optimization (PSO);

D O I：

10.1109/TEVC.2005.856202

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents and investigates the application of coevolutionary training techniques based on particle swarm optimization (PSO) to evolve playing strategies for the nonzero sum problem of the iterated prisoner's dilemma (IPD). Three different coevolutionary PSO techniques are used, differing in the way that IPD strategies are presented: A neural network (NN) approach in which the NN is used to predict the next action, a binary PSO approach in which the particle represents,a complete playing strategy, and finally, a novel approach that exploits the symmetrical structure of man-made strategies. The last technique uses a PSO algorithm as a function approximator to evolve a function that characterizes the dynamics of the IPD. These different PSO approaches are compared experimentally with one another, and with popular man-made strategies. The performance of these approaches is evaluated in both clean and noisy environments. Results indicate that NNs cooperate well, but may develop weak strategies that can cause catastrophic collapses. The binary PSO technique does not have the same deficiency, instead resulting in an overall state of equilibrium in which some strategies are allowed to exploit the population, but never dominate. The symmetry approach is not as successful as the binary PSO approach in maintaining cooperation in both noisy and noiseless environments-exhibiting selfish behavior against the benchmark strategies and depriving them of receiving almost any payoff. Overall, the PSO techniques are successful at generating a variety of strategies for use in the IPD, duplicating and improving on existing evolutionary IPD population observations.

引用

页码：562 / 579

页数：18

共 40 条

[1]

[Anonymous], P 1999 C EV COMP CEC

[2]

Axelrod R, 2006, EVOLUTION COOPERATIO

[3]

AXELROD R, 2001, ROUTLEDGE ENCY INT P

[4]

Axelrod R., 1987, GENETIC ALGORITHMS S, V1, P1

[5]

*BELL TEL LAB, 1964, TRANSM SYST COMM

[6] Evolution, neural networks, games, and intelligence [J].

Chellapilla, K ;

Fogel, DB .

PROCEEDINGS OF THE IEEE, 1999, 87 (09) :1471-1496

[7]

Chellapilla K, 2000, IEEE C EVOL COMPUTAT, P857, DOI 10.1109/CEC.2000.870729

[8] Evolving neural networks to play checkers without relying on expert knowledge [J].

Chellapilla, K ;

Fogel, DB .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (06) :1382-1391

[9] The particle swarm - Explosion, stability, and convergence in a multidimensional complex space [J].

Clerc, M ;

Kennedy, J .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (01) :58-73

[10]

Darwen P. J., 2002, International Journal of Computational Intelligence and Applications, V2, P83, DOI 10.1142/S1469026802000440

← 1 2 3 4 →