Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引：2

作者：

Qiu, Zizhang ^{[1
]}

Wang, Shouguang ^{[1
]}

You, Dan ^{[1
]}

Zhou, MengChu ^{[1
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 10期

关键词：

Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;

D O I：

10.1109/JAS.2024.124488

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.

引用

页码：2111 / 2122

页数：12

共 50 条

[1] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
Zhang, Yixian
Li, Zhuoxuan
Cao, Yiding
Zhao, Xuan
Cao, Jinde
IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
[2] On Monte Carlo Tree Search and Reinforcement Learning
Vodopivec, Tom
Samothrakis, Spyridon
Ster, Branko
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 60 : 881 - 936
[3] Automatic Bridge Bidding Using Deep Reinforcement Learning
Yeh, Chih-Kuan
Hsieh, Cheng-Yu
Lin, Hsuan-Tien
IEEE TRANSACTIONS ON GAMES, 2018, 10 (04) : 365 - 377
[4] MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion
Alkilane, Khaled
Lee, Der-Horng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 810 - 823
[5] Monte Carlo Tree Search With Reinforcement Learning for Motion Planning
Weingertner, Philippe
Ho, Minnie
Timofeev, Andrey
Aubert, Sebastien
Pita-Gil, Guillermo
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[6] Learning the Fastest RNA Folding Path Based on Reinforcement Learning and Monte Carlo Tree Search
Mao, Kangkun
Xiao, Yi
MOLECULES, 2021, 26 (15):
[7] DeepMCTS: Deep Reinforcement Learning Assisted Monte Carlo Tree Search for MIMO Detection
Mo, Tz-Wei
Chang, Ronald Y.
Kan, Te-Yi
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
[8] Reinforcement Learning - A Bridge Between Numerical Methods and Monte Carlo
Borkar, Vivek S.
PERSPECTIVES IN MATHEMATICAL SCIENCES I: PROBABILITY AND STATISTICS, 2009, 7 : 71 - 91
[9] Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning
Balaz, Marek
Tarabek, Peter
APPLIED SCIENCES-BASEL, 2023, 13 (03):
[10] MCTSteg: A Monte Carlo Tree Search-Based Reinforcement Learning Framework for Universal Non-Additive Steganography
Mo, Xianbo
Tan, Shunquan
Li, Bin
Huang, Jiwu
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4306 - 4320

← 1 2 3 4 5 →