Behaviour Learning with Adaptive Motif Discovery and Interacting Multiple Model

被引:0
作者
Zhao, Hanqing [1 ]
Manderson, Travis [1 ]
Zhang, Hao [2 ]
Liu, Xue [1 ]
Dudek, Gregory [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
ALGORITHM;
D O I
10.1109/IROS47612.2022.9981588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an approach that enables simultaneous interpretable learning of a high-level discrete behaviour and its low-level rhythmic sub-behaviour. We do this though a unified reward function, where a reward function that only describes low-level behaviour, with less impact on learning of other behaviours is recovered from few-shot motion demonstrations. To this end, we first extract local behaviour motifs from state-only human demonstrations and random driving samples using an adaptive motif discovery approach derived from the Matrix Profile algorithm. We then optimize parameters for motif discovery by maximizing the sum and entropy over motif sizes. Interacting Multiple Model (IMM) estimators are constructed on top of linear-Gaussian dynamics of discovered motifs, the cumulative distributions over motifs estimated by IMMs serve as the basis of the reward function. By combining the recovered reward with the terrain type signal gathered from the environment, we are able to train a dual-objective off-road vehicle controller that demonstrates both terrain selection and human-like driving behaviours. Compared with related approaches across 10 people, our rhythmic behaviour reward recovery approach enables the controller to produce higher preference over human driving demonstrations. In addition to performing more stable across different people with 87% less variance than the best baseline in rhythmic behaviour indicator, our method reduces the negative effects on higher-level behaviour learning while maintaining high interpretability at all stages of the algorithm.
引用
收藏
页码:10788 / 10794
页数:7
相关论文
共 50 条
  • [21] A Robust Interacting Multiple Model Smoother with Heavy-Tailed Measurement Noises
    Cui, Shuai
    Li, Zhi
    Yang, Yanbo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3574 - 3578
  • [22] Interacting Multiple Model Estimator for Networked Control Systems: Stability, Convergence, and Performance
    Lin, Hong
    Lam, James
    Chen, Michael Z. Q.
    Shu, Zhan
    Wu, Zheng-Guang
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (03) : 928 - 943
  • [23] Robust multiple model adaptive estimation for spacecraft autonomous navigation
    Xiong, K.
    Wei, C. L.
    Liu, L. D.
    AEROSPACE SCIENCE AND TECHNOLOGY, 2015, 42 : 249 - 258
  • [24] An adaptive sampling method for Kriging surrogate model with multiple outputs
    Zhai, Zhangming
    Li, Haiyang
    Wang, Xugang
    ENGINEERING WITH COMPUTERS, 2022, 38 (SUPPL 1) : 277 - 295
  • [25] Kernel-Based Adaptive Multiple Model Target Tracking
    Ghoshal, Debarshi Patanjali
    Gopalakrishnan, Kumar
    Michalska, Hannah
    2017 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA 2017), 2017, : 1338 - 1343
  • [26] Multiple Model Adaptive Estimator for Nonlinear System with Unknown Disturbance
    Xiong, Kai
    Wei, Chunling
    ASIAN JOURNAL OF CONTROL, 2015, 17 (06) : 2252 - 2262
  • [27] Small UAV Localization Based Strong Tracking Filters Augmented with Interacting Multiple Model
    Elzoghby, Mostafa
    Li, Fu
    Arif, Usman
    Arafa, Ibrahim I.
    PROCEEDINGS OF 2018 15TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2018, : 310 - 317
  • [28] Spacecraft autonomous navigation using multiple model adaptive estimator
    Xiong, Kai
    Wei, Chunling
    Liu, Liangdong
    AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2015, 87 (05) : 465 - 475
  • [29] Fuzzy-logic-assisted interacting multiple model (FLAIMM) for mobile robot localization
    Lee, Hyoungki
    Jung, Jongdae
    Choi, Kiwan
    Park, Jiyoung
    Myung, Hyun
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2012, 60 (12) : 1592 - 1606
  • [30] Window based Multiple Model Adaptive Estimation for Navigational Framework
    Kottath, Rahul
    Poddar, Shashi
    Das, Amitava
    Kumar, Vipan
    AEROSPACE SCIENCE AND TECHNOLOGY, 2016, 50 : 88 - 95