KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引:5
作者
Lei, Lanxin [1 ]
Luo, Ruiming [2 ]
Zheng, Renjie [1 ]
Wang, Jingke [1 ]
Zhang, JianWei [1 ]
Qiu, Cong [1 ]
Ma, Liulong [1 ]
Jin, Liyang [1 ]
Zhang, Ping [1 ]
Chen, Junbo [1 ]
机构
[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
来源
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年
关键词
OPTIMIZATION; SPACES;
D O I
10.1109/IROS51168.2021.9636442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.
引用
收藏
页码:4493 / 4500
页数:8
相关论文
共 36 条
  • [1] [Anonymous], 2011, Neural Information Processing Systems
  • [2] [Anonymous], 2008, 22 ANN C NEUR INF PR
  • [3] Anthony T., 2019, POLICY GRADIENT SEAR
  • [4] Finite-time analysis of the multiarmed bandit problem
    Auer, P
    Cesa-Bianchi, N
    Fischer, P
    [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
  • [5] Bansal M., 2018, arXiv preprint arXiv:1812.03079
  • [6] Bishop C. M., 1994, Technical Report No. NCRG/94/004.
  • [7] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
  • [8] Bubeck S, 2011, J MACH LEARN RES, V12, P1655
  • [9] Cai P, 2019, ARXIV190512197
  • [10] PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
    Chaslot, Guillaume M. J-B.
    Winands, Mark H. M.
    Van den Herik, H. Jaap
    Uiterwijk, Jos W. H. M.
    Bouzy, Bruno
    [J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2008, 4 (03) : 343 - 357