Matching in Multi-arm Bandit with Collision

被引:0
|
作者
Zhang, Yirui [1 ]
Wang, Siwei [2 ]
Fang, Zhixuan [1 ,3 ]
机构
[1] Tsinghua Univ, IIIS, Beijing, Peoples R China
[2] Microsoft Res, Redmond, WA USA
[3] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the matching of multi-agent multi-armed bandit problem, i.e., while agents prefer arms with higher expected reward, arms also have preferences on agents. In such case, agents pulling the same arm may encounter collisions, which leads to a reward of zero. For this problem, we design a specific communication protocol which uses deliberate collision to transmit information among agents, and propose a layer-based algorithm that helps establish optimal stable matching between agents and arms. With this subtle communication protocol, our algorithm achieves a state-of-the-art O(log T) regret in the decentralized matching market, and outperforms existing baselines in experimental results.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] DawnIK: Decentralized Collision-Aware Inverse Kinematics Solver for Heterogeneous Multi-Arm Systems
    University of Bonn, Humanoid Robots Lab, Germany
    不详
    IEEE-RAS Int. Conf. Humanoid Rob., 2023,
  • [22] Multi-Arm Trajectory Planning for Optimal Collision-Free Pick-and-Place Operations
    Mateu-Gomez, Daniel
    Martinez-Peral, Francisco Jose
    Perez-Vidal, Carlos
    TECHNOLOGIES, 2024, 12 (01)
  • [23] Active learning of the collision distance function for high-DOF multi-arm robot systems
    Kim, Jihwan
    Park, Frank Chongwoo
    ROBOTICA, 2024,
  • [24] Skyline Identification in Multi-Arm Bandits
    Cheu, Albert
    Sundaram, Ravi
    Ullman, Jonathan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 1006 - 1010
  • [25] Manipulability optimization for multi-arm teleoperation
    Kennel-Maushart, Florian
    Poranne, Roi
    Coros, Stelian
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 3956 - 3962
  • [26] Research on Collision Avoidance Control of Multi-Arm Medical Robot Based on C-Space
    Zhao, Honghua
    Zhao, Jian
    Duan, Xingguang
    Gong, Xuyin
    Bian, Guibin
    IEEE ACCESS, 2020, 8 : 93219 - 93229
  • [27] DawnIK: Decentralized Collision-Aware Inverse Kinematics Solver for Heterogeneous Multi-Arm Systems
    Marangoz, Salih
    Menon, Rohit
    Dengler, Nils
    Bennewitz, Maren
    arXiv, 2023,
  • [28] MULTI-ARM MULTI-STAGE RANDOMISED TRIALS
    Parmar, Mahesh
    JOURNAL OF THORACIC ONCOLOGY, 2011, 6 (06) : S91 - S91
  • [29] Design of a multi-arm randomized clinical trial with no control arm
    Magaret, Amalia
    Angus, Derek C.
    Adhikari, Neill K. J.
    Banura, Patrick
    Kissoon, Niranjan
    Lawler, James V.
    Jacob, Shevin T.
    CONTEMPORARY CLINICAL TRIALS, 2016, 46 : 12 - 17
  • [30] Multi-arm covariate-adaptive randomization
    Feifang Hu
    Xiaoqing Ye
    Li-Xin Zhang
    Science China Mathematics, 2023, 66 : 163 - 190