Monte-Carlo Planning for Agile Legged Locomotion

被引:0
作者
Clary, Patrick [1 ]
Morais, Pedro [1 ]
Fern, Alan [1 ]
Hurst, Jonathan [1 ]
机构
[1] Oregon State Univ, Collaborat Robot & Intelligent Syst Inst, Corvallis, OR 97331 USA
来源
TWENTY-EIGHTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING (ICAPS 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in legged locomotion research has produced robots that can perform agile blind-walking with robustness comparable to a blindfolded human. However, this walking approach has not yet been integrated with planners for high-level activities. In this paper, we take a step towards high-level task planning for these robots by studying a planar simulated biped that captures their essential dynamics. We investigate variants of Monte-Carlo Tree Search (MCTS) for selecting an appropriate blind-walking controller at each decision cycle. In particular, we consider UCT with an intelligently selected rollout policy, which is shown to be capable of guiding the biped through treacherous terrain. In addition, we develop a new MCTS variant, called Monte-Carlo Discrepancy Search (MCDS), which is shown to make more effective use of limited planning time than UCT for this domain. We demonstrate the effectiveness of these planners in both deterministic and stochastic environments across a range of algorithm parameters. In addition, we present results for using these planners to control a full-order 3D simulation of Cassie, an agile bipedal robot, through complex terrain.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 12 条
[1]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[2]   Optimization-based Full Body Control for the DARPA Robotics Challenge [J].
Feng, Siyuan ;
Whitman, Eric ;
Xinjilefu, X. ;
Atkeson, Christopher G. .
JOURNAL OF FIELD ROBOTICS, 2015, 32 (02) :293-312
[3]  
Heess N, 2017, EMERGENCE LOCOMOTION
[4]  
Hubicki C. M., 2017, IEEE ROBOTICS AUTOMA
[5]   Team IHMC's Lessons Learned from the DARPA Robotics Challenge: Finding Data in the Rubble [J].
Johnson, Matthew ;
Shrewsbury, Brandon ;
Bertrand, Sylvain ;
Calvert, Duncan ;
Wu, Tingfan ;
Duran, Daniel ;
Stephen, Douglas ;
Mertins, Nathan ;
Carff, John ;
Rifenburgh, William ;
Smith, Jesper ;
Schmidt-Wetekam, Chris ;
Faconti, Davide ;
Graber-Tilton, Alex ;
Eyssette, Nicolas ;
Meier, Tobias ;
Kalkov, Igor ;
Craig, Travis ;
Payton, Nick ;
McCrory, Stephen ;
Wiedebach, Georg ;
Layton, Brooke ;
Neuhaus, Peter ;
Pratt, Jerry .
JOURNAL OF FIELD ROBOTICS, 2017, 34 (02) :241-261
[6]   Bandit based Monte-Carlo planning [J].
Kocsis, Levente ;
Szepesvari, Csaba .
MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 :282-293
[7]   Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot [J].
Kuindersma, Scott ;
Deits, Robin ;
Fallon, Maurice ;
Valenzuela, Andres ;
Dai, Hongkai ;
Permenter, Frank ;
Koolen, Twan ;
Marion, Pat ;
Tedrake, Russ .
AUTONOMOUS ROBOTS, 2016, 40 (03) :429-455
[8]   Visual Control of Foot Placement When Walking Over Complex Terrain [J].
Matthis, Jonathan S. ;
Fajen, Brett R. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2014, 40 (01) :106-115
[9]  
Peng X. B., 2017, T GRAPHICS P SIGGRAP
[10]  
Raibert M. H., 1986, LEGGED ROBOTS BALANC, DOI DOI 10.1109/MEX.1986.4307016