Behaviour Learning with Adaptive Motif Discovery and Interacting Multiple Model

被引:0
作者
Zhao, Hanqing [1 ]
Manderson, Travis [1 ]
Zhang, Hao [2 ]
Liu, Xue [1 ]
Dudek, Gregory [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
ALGORITHM;
D O I
10.1109/IROS47612.2022.9981588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an approach that enables simultaneous interpretable learning of a high-level discrete behaviour and its low-level rhythmic sub-behaviour. We do this though a unified reward function, where a reward function that only describes low-level behaviour, with less impact on learning of other behaviours is recovered from few-shot motion demonstrations. To this end, we first extract local behaviour motifs from state-only human demonstrations and random driving samples using an adaptive motif discovery approach derived from the Matrix Profile algorithm. We then optimize parameters for motif discovery by maximizing the sum and entropy over motif sizes. Interacting Multiple Model (IMM) estimators are constructed on top of linear-Gaussian dynamics of discovered motifs, the cumulative distributions over motifs estimated by IMMs serve as the basis of the reward function. By combining the recovered reward with the terrain type signal gathered from the environment, we are able to train a dual-objective off-road vehicle controller that demonstrates both terrain selection and human-like driving behaviours. Compared with related approaches across 10 people, our rhythmic behaviour reward recovery approach enables the controller to produce higher preference over human driving demonstrations. In addition to performing more stable across different people with 87% less variance than the best baseline in rhythmic behaviour indicator, our method reduces the negative effects on higher-level behaviour learning while maintaining high interpretability at all stages of the algorithm.
引用
收藏
页码:10788 / 10794
页数:7
相关论文
共 50 条
  • [41] An adaptive bio-inspired optimisation model based on the foraging behaviour of a social spider
    Otor, Samera Uga
    Akinyemi, Bodunde Odunola
    Aladesanmi, Temitope Adegboye
    Aderounmu, Ganiyu Adesola
    Kamagate, B. H.
    COGENT ENGINEERING, 2019, 6 (01):
  • [42] Mobile Target Tracking and Data Fusion Using Dual-Interacting Multiple Model System
    Wann, Chin-Der
    Shiu, Jia-Yu
    2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [43] Speaker tracking based on distributed particle filter and interacting multiple model in distributed microphone networks
    Wang, Ruifang
    Chen, Zhe
    Yin, Fuliang
    APPLIED ACOUSTICS, 2021, 174
  • [44] Adaptive Active Noise Suppression Using Multiple Model Switching Strategy
    Huang, Quanzhen
    Chen, Suxia
    Huang, Mingming
    Guo, Zhuangzhi
    SHOCK AND VIBRATION, 2017, 2017
  • [45] Multi-Step Learning and Adaptive Search for Learning Complex Model Transformations from Examples
    Baki, Islem
    Sahraoui, Houari
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2016, 25 (03)
  • [46] A-iLearn: An adaptive incremental learning model for spoof fingerprint detection
    Agarwal, Shivang
    Rattani, Ajita
    Chowdary, C. Ravindranath
    MACHINE LEARNING WITH APPLICATIONS, 2022, 7
  • [47] Hierarchical linear and nonlinear adaptive learning model for system identification and prediction
    Abu Jami'in, Mohammad
    Anam, Khairul
    Rulaningtyas, Riries
    Mudjiono, Urip
    Adianto, Adianto
    Wee, Hui-Ming
    APPLIED INTELLIGENCE, 2020, 50 (06) : 1699 - 1710
  • [48] Maximum correntropy quadrature Kalman filter based interacting multiple model approach for maneuvering target tracking
    Liu, Bao
    Wu, Ziwei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [49] Reliable flight performance assessment of multirotor based on interacting multiple model particle filter and health degree
    Zhao, Zhiyao
    Yao, Peng
    Wang, Xiaoyi
    Xu, Jiping
    Wang, Li
    Yu, Jiabin
    CHINESE JOURNAL OF AERONAUTICS, 2019, 32 (02) : 444 - 453
  • [50] Runtime Optimization in Interacting Multiple Model Filtering with Down-Sampling and Out-of-Sequence Measurements
    Ketterer, Pascal
    Hoher, Patrick
    Reuter, Johannes
    2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,