Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引：2

作者：

Qiu, Zizhang ^{[1
]}

Wang, Shouguang ^{[1
]}

You, Dan ^{[1
]}

Zhou, MengChu ^{[1
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 10期

关键词：

Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;

D O I：

10.1109/JAS.2024.124488

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.

引用

页码：2111 / 2122

页数：12

共 50 条

[21] Belief-State Monte Carlo Tree Search for Phantom Go
Wang, Jiao
Zhu, Tan
Li, Hongye
Hsueh, Chu-Husan
Wu, I. -Chen
IEEE TRANSACTIONS ON GAMES, 2018, 10 (02) : 139 - 154
[22] Safe Reinforcement Learning for Autonomous Vehicle Using Monte Carlo Tree Search
Mo, Shuojie
Pei, Xiaofei
Wu, Chaoxian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 6766 - 6773
[23] Reinforcement Learning in Card Game Environments Using Monte Carlo Methods and Artificial Neural Networks
Baykal, Omer
Alpaslan, Ferda Nur
2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 618 - 623
[24] Monte Carlo and Temporal Difference Methods in Reinforcement Learning
Han, Isaac
Oh, Seungwon
Jung, Hoyoun
Chung, Insik
Kim, Kyung-Joong
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2023, 18 (04) : 64 - 65
[25] Deep reinforcement learning applied to Monte Carlo power system reliability analysis
Solheim, Oystein Rognes
Hoverstad, Boye Annfelt
Korpas, Magnus
2023 IEEE BELGRADE POWERTECH, 2023,
[26] Exploring the first-move balance point of Go-Moku based on reinforcement learning and Monte Carlo tree search
Liu, Pengsen
Zhou, Jizhe
Lv, Jiancheng
KNOWLEDGE-BASED SYSTEMS, 2023, 261
[27] Developing an Adaptive AI Agent using Supervised and Reinforcement Learning with Monte Carlo Tree Search in FightingICE
Tomas, John Paul Q.
Aguas, Nathanael Jhonn R.
De Villa, Angela N.
Lim, Jasmin Rose G.
2021 THE 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, CIIS 2021, 2021, : 31 - 36
[28] Reinforcement learning for active distribution network planning based on Monte Carlo tree search
Zhang, Xi
Hua, Weiqi
Liu, Youbo
Duan, Jiajun
Tang, Zhiyuan
Liu, Junyong
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
[29] Monte Carlo Bayesian Hierarchical Reinforcement Learning
Ngo Anh Vien
Hung Ngo
Ertel, Wolfgang
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1551 - 1552
[30] A Game Model for Gomoku Based on Deep Learning and Monte Carlo Tree Search
Li, Xiali
He, Shuai
Wu, Licheng
Chen, Daiyao
Zhao, Yue
PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 88 - 97

← 1 2 3 4 5 →