Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引：2

作者：

Qiu, Zizhang ^{[1
]}

Wang, Shouguang ^{[1
]}

You, Dan ^{[1
]}

Zhou, MengChu ^{[1
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 10期

关键词：

Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;

D O I：

10.1109/JAS.2024.124488

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.

引用

页码：2111 / 2122

页数：12

共 50 条

[31] Learning Relevant Questions for Conversational Product Search using Deep Reinforcement Learning
Montazeralghaem, Ali
Allan, James
WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 746 - 754
[32] Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Models
Guo, Hongye
Chen, Qixin
Xia, Qing
Kang, Chongqing
IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (06) : 5684 - 5696
[33] Autonomous Earthquake Location via Deep Reinforcement Learning
Kuang, Wenhuan
Yuan, Congcong
Zou, Zhihui
Zhang, Jie
Zhang, Wei
SEISMOLOGICAL RESEARCH LETTERS, 2024, 95 (01) : 367 - 377
[34] Road Planning for Slums via Deep Reinforcement Learning
Zheng, Yu
Su, Hongyuan
Ding, Jingtao
Jin, Depeng
Li, Yong
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5695 - 5706
[35] Deep Reinforcement Learning for Strategic Bidding in Electricity Markets
Ye, Yujian
Qiu, Dawei
Sun, Mingyang
Papadaskalopoulos, Dimitrios
Strbac, Goran
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (02) : 1343 - 1355
[36] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
Jia, Shuai
Gan, Zhongxue
Xi, Yugeng
Li, Dewei
Xue, Shibei
Wang, Limin
JOURNAL OF THERMAL SCIENCE, 2020, 29 (05) : 1125 - 1134
[37] Beyond Trial and Error: Lane Keeping with Monte Carlo Tree Search-Driven Optimization of Reinforcement Learning
Kovari, Balint
Pelenczei, Balint
Knab, Istvan Gellert
Becsi, Tamas
ELECTRONICS, 2024, 13 (11)
[38] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
Shuai Jia
Zhongxue Gan
Yugeng Xi
Dewei Li
Shibei Xue
Limin Wang
Journal of Thermal Science, 2020, 29 : 1125 - 1134
[39] Bin Packing Optimization via Deep Reinforcement Learning
Wang, Baoying
Lin, Zhaohui
Kong, Weijie
Dong, Huixu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2542 - 2549
[40] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
Song, Zhilong
Zhou, Qionghua
Lu, Shuaihua
Dieb, Sae
Ling, Chongyi
Wang, Jinlan
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (14) : 3594 - 3601

← 1 2 3 4 5 →