The limits and robustness of reinforcement learning in Lewis signalling games

被引：5

作者：

Catteeuw, David ^{[1
]}

Manderick, Bernard ^{[1
]}

机构：

[1] Vrije Univ Brussel, Artificial Intelligence Lab, B-1050 Brussels, Belgium

来源：

CONNECTION SCIENCE | 2014年 / 26卷 / 02期

关键词：

reinforcement learning; signalling; win-stay/lose-inaction; Lewis signalling games; EVOLUTION;

D O I：

10.1080/09540091.2014.885303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lewis signalling games are a standard model to study the emergence of language. We introduce win-stay/lose-inaction, a random process that only updates behaviour on success and never deviates from what was once successful, prove that it always ends up in a state of optimal communication in all Lewis signalling games, and predict the number of interactions it needs to do so: N-3 interactions for Lewis signalling games with N equiprobable types. We show three reinforcement learning algorithms (Roth-Erev learning, Q-learning, and Learning Automata) that can imitate win-stay/lose-inaction and can even cope with errors in Lewis signalling games.

引用

页码：161 / 177

页数：18

共 50 条

[1] Hybrid learning in signalling games
Barrett, Jeffrey A.
Cochran, Calvin T.
Huttegger, Simon
Fujiwara, Naoki
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (05) : 1119 - 1127
[2] Reinforcement learning in population games
Lahkar, Ratul
Seymour, Robert M.
GAMES AND ECONOMIC BEHAVIOR, 2013, 80 : 10 - 38
[3] Ambiguity and context learning in signalling games
Tang, Liping
JOURNAL OF LOGIC AND COMPUTATION, 2021, 31 (08) : 1979 - 2003
[4] Social aspiration reinforcement learning in Cournot games
Fatas, Enrique
Morales, Antonio J.
Jaramillo-Gutierrez, Ainhoa
ECONOMIC THEORY, 2024,
[5] Reinforcement learning applied to games
Crespo, Joao
Wichert, Andreas
SN APPLIED SCIENCES, 2020, 2 (05):
[6] Reinforcement learning applied to games
João Crespo
Andreas Wichert
SN Applied Sciences, 2020, 2
[7] Robustness of Learning in Games With Heterogeneous Players
Akbar, Aqsa Shehzadi
Jaleel, Hassan
Abbas, Waseem
Shamma, Jeff S.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1553 - 1567
[8] Safe Reinforcement Learning With Dual Robustness
Li, Zeyang
Hu, Chuxiong
Wang, Yunan
Yang, Yujie
Li, Shengbo Eben
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10876 - 10890
[9] Learning in Games via Reinforcement and Regularization
Mertikopoulos, Panayotis
Sandholm, William H.
MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) : 1297 - 1324
[10] Baselines for Reinforcement Learning in Text Games
Zelinka, Mikulas
2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 320 - 327

← 1 2 3 4 5 →