Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引:2
|
作者
Qiu, Zizhang [1 ]
Wang, Shouguang [1 ]
You, Dan [1 ]
Zhou, MengChu [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
关键词
Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;
D O I
10.1109/JAS.2024.124488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
引用
收藏
页码:2111 / 2122
页数:12
相关论文
共 50 条
  • [21] Belief-State Monte Carlo Tree Search for Phantom Go
    Wang, Jiao
    Zhu, Tan
    Li, Hongye
    Hsueh, Chu-Husan
    Wu, I. -Chen
    IEEE TRANSACTIONS ON GAMES, 2018, 10 (02) : 139 - 154
  • [22] Safe Reinforcement Learning for Autonomous Vehicle Using Monte Carlo Tree Search
    Mo, Shuojie
    Pei, Xiaofei
    Wu, Chaoxian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 6766 - 6773
  • [23] Reinforcement Learning in Card Game Environments Using Monte Carlo Methods and Artificial Neural Networks
    Baykal, Omer
    Alpaslan, Ferda Nur
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 618 - 623
  • [24] Monte Carlo and Temporal Difference Methods in Reinforcement Learning
    Han, Isaac
    Oh, Seungwon
    Jung, Hoyoun
    Chung, Insik
    Kim, Kyung-Joong
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2023, 18 (04) : 64 - 65
  • [25] Deep reinforcement learning applied to Monte Carlo power system reliability analysis
    Solheim, Oystein Rognes
    Hoverstad, Boye Annfelt
    Korpas, Magnus
    2023 IEEE BELGRADE POWERTECH, 2023,
  • [26] Exploring the first-move balance point of Go-Moku based on reinforcement learning and Monte Carlo tree search
    Liu, Pengsen
    Zhou, Jizhe
    Lv, Jiancheng
    KNOWLEDGE-BASED SYSTEMS, 2023, 261
  • [27] Developing an Adaptive AI Agent using Supervised and Reinforcement Learning with Monte Carlo Tree Search in FightingICE
    Tomas, John Paul Q.
    Aguas, Nathanael Jhonn R.
    De Villa, Angela N.
    Lim, Jasmin Rose G.
    2021 THE 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, CIIS 2021, 2021, : 31 - 36
  • [28] Reinforcement learning for active distribution network planning based on Monte Carlo tree search
    Zhang, Xi
    Hua, Weiqi
    Liu, Youbo
    Duan, Jiajun
    Tang, Zhiyuan
    Liu, Junyong
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
  • [29] Monte Carlo Bayesian Hierarchical Reinforcement Learning
    Ngo Anh Vien
    Hung Ngo
    Ertel, Wolfgang
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1551 - 1552
  • [30] A Game Model for Gomoku Based on Deep Learning and Monte Carlo Tree Search
    Li, Xiali
    He, Shuai
    Wu, Licheng
    Chen, Daiyao
    Zhao, Yue
    PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 88 - 97