Behaviour Learning with Adaptive Motif Discovery and Interacting Multiple Model

被引:0
作者
Zhao, Hanqing [1 ]
Manderson, Travis [1 ]
Zhang, Hao [2 ]
Liu, Xue [1 ]
Dudek, Gregory [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
ALGORITHM;
D O I
10.1109/IROS47612.2022.9981588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an approach that enables simultaneous interpretable learning of a high-level discrete behaviour and its low-level rhythmic sub-behaviour. We do this though a unified reward function, where a reward function that only describes low-level behaviour, with less impact on learning of other behaviours is recovered from few-shot motion demonstrations. To this end, we first extract local behaviour motifs from state-only human demonstrations and random driving samples using an adaptive motif discovery approach derived from the Matrix Profile algorithm. We then optimize parameters for motif discovery by maximizing the sum and entropy over motif sizes. Interacting Multiple Model (IMM) estimators are constructed on top of linear-Gaussian dynamics of discovered motifs, the cumulative distributions over motifs estimated by IMMs serve as the basis of the reward function. By combining the recovered reward with the terrain type signal gathered from the environment, we are able to train a dual-objective off-road vehicle controller that demonstrates both terrain selection and human-like driving behaviours. Compared with related approaches across 10 people, our rhythmic behaviour reward recovery approach enables the controller to produce higher preference over human driving demonstrations. In addition to performing more stable across different people with 87% less variance than the best baseline in rhythmic behaviour indicator, our method reduces the negative effects on higher-level behaviour learning while maintaining high interpretability at all stages of the algorithm.
引用
收藏
页码:10788 / 10794
页数:7
相关论文
共 50 条
  • [31] A Hybrid Model and Learning-Based Adaptive Navigation Filter
    Or, Barak
    Klein, Itzik
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [32] Adaptive multiple graph regularized semi-supervised extreme learning machine
    Yi, Yugen
    Qiao, Shaojie
    Zhou, Wei
    Zheng, Caixia
    Liu, Qinghua
    Wang, Jianzhong
    SOFT COMPUTING, 2018, 22 (11) : 3545 - 3562
  • [33] A Novel Fault-Prognostic Approach Based on Interacting Multiple Model Filters and Fuzzy Systems
    Cosme, Luciana Balieiro
    Caminhas, Walmir Matos
    Silveira Vasconcelos D'Angelo, Marcos Flavio
    Palhares, Reinaldo Martinez
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (01) : 519 - 528
  • [34] Energy-efficient model using optimal route discovery based on adaptive spider monkey optimization model
    John, Jean Justus
    Muthukrishnan, Anuradha
    Soundaram, Jothi
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (14)
  • [35] Improper Complex-Valued Multiple-Model Adaptive Estimation
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (06) : 1528 - 1542
  • [36] Distributed interacting multiple model H∞ filtering fusion for multiplatform maneuvering target tracking in clutter
    Li, Wenling
    Jia, Yingmin
    SIGNAL PROCESSING, 2010, 90 (05) : 1655 - 1668
  • [37] Spherical Simplex Unscented Kalman Filter-Based Jumping and Static Interacting Multiple Model
    Pan, Yi
    Ye, Hui
    He, Keke
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (04)
  • [38] Interacting Multiple Model Unscented Filter for Tracking a Ballistic Missile during Its Boost Phase
    Battistini, Simone
    Menegaz, Henrique M. T.
    2017 IEEE AEROSPACE CONFERENCE, 2017,
  • [39] Fuzzy-Logic-Assisted Interacting Multiple Model (FLAIMM) for Mobile Robot Slip Compensation
    Jung, Jongdae
    Lee, Hyoung-Ki
    Myung, Hyun
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [40] Investigation of comfort temperature, adaptive model and the window-opening behaviour in Japanese houses
    Rijal, Hom B.
    Honjo, Miho
    Kobayashi, Ryota
    Nakaya, Takashi
    ARCHITECTURAL SCIENCE REVIEW, 2013, 56 (01) : 54 - 69