Motion Planning With Success Judgement Model Based on Learning From Demonstration

被引:5
作者
Furuta, Daichi [1 ]
Kutsuzawa, Kyo [1 ]
Sakaino, Sho [2 ]
Tsuji, Toshiaki [1 ]
机构
[1] Saitama Univ, Grad Sch Sci & Engn, Saitama 3388570, Japan
[2] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
关键词
Task analysis; Robots; Mathematical model; Training; Data models; Optimization; Force control; learning from demonstration; motion planning; robot learning; PREDICTIVE CONTROL; MANIPULATION;
D O I
10.1109/ACCESS.2020.2987604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A technique named Learning from Demonstration allows robots to learn actions in a human living environment from the demonstrations directly. In a learning method from demonstrations directly, however, teaching actions cannot be reused between situations with different restrictions. In this study, we propose a method for training a success judgment model based on Learning from Demonstration and use this as a differentiable loss function of tasks. By formulating the constraints of the action in a manner in mathematical optimization and combining these constraints with the learned success judgment model into a loss function, an action generation model can be trained by the gradient method. This system was verified with the action of scooping up a pancake.
引用
收藏
页码:73142 / 73150
页数:9
相关论文
共 39 条
  • [1] Abbeel P., 2004, Apprenticeship learning via inverse reinforcement learning. pages, P1, DOI [DOI 10.1145/1015330.1015430, 10.1145/1015330.1015430]
  • [2] Adachi T, 2018, IEEE INT C INT ROBOT, P3648, DOI 10.1109/IROS.2018.8594489
  • [3] [Anonymous], ACTION RECOGNITION D
  • [4] [Anonymous], 2015, P ROBOTICS SCI SYSTE
  • [5] [Anonymous], 1997, Neural Computation
  • [6] [Anonymous], INT C LEARNING REPRE, DOI DOI 10.1145/1830483.1830503
  • [7] [Anonymous], ADV ROBOT
  • [8] [Anonymous], 2004, Open Dynamics Engine
  • [9] A survey of robot learning from demonstration
    Argall, Brenna D.
    Chernova, Sonia
    Veloso, Manuela
    Browning, Brett
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) : 469 - 483
  • [10] Bengio S, 2015, ADV NEUR IN, V28