Behaviour Learning with Adaptive Motif Discovery and Interacting Multiple Model

被引：0

作者：

Zhao, Hanqing ^{[1
]}

Manderson, Travis ^{[1
]}

Zhang, Hao ^{[2
]}

Liu, Xue ^{[1
]}

Dudek, Gregory ^{[1
]}

机构：

[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada

[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

来源：

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

ALGORITHM;

D O I：

10.1109/IROS47612.2022.9981588

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose an approach that enables simultaneous interpretable learning of a high-level discrete behaviour and its low-level rhythmic sub-behaviour. We do this though a unified reward function, where a reward function that only describes low-level behaviour, with less impact on learning of other behaviours is recovered from few-shot motion demonstrations. To this end, we first extract local behaviour motifs from state-only human demonstrations and random driving samples using an adaptive motif discovery approach derived from the Matrix Profile algorithm. We then optimize parameters for motif discovery by maximizing the sum and entropy over motif sizes. Interacting Multiple Model (IMM) estimators are constructed on top of linear-Gaussian dynamics of discovered motifs, the cumulative distributions over motifs estimated by IMMs serve as the basis of the reward function. By combining the recovered reward with the terrain type signal gathered from the environment, we are able to train a dual-objective off-road vehicle controller that demonstrates both terrain selection and human-like driving behaviours. Compared with related approaches across 10 people, our rhythmic behaviour reward recovery approach enables the controller to produce higher preference over human driving demonstrations. In addition to performing more stable across different people with 87% less variance than the best baseline in rhythmic behaviour indicator, our method reduces the negative effects on higher-level behaviour learning while maintaining high interpretability at all stages of the algorithm.

引用

页码：10788 / 10794

页数：7

共 50 条

[41] An adaptive bio-inspired optimisation model based on the foraging behaviour of a social spider
Otor, Samera Uga
Akinyemi, Bodunde Odunola
Aladesanmi, Temitope Adegboye
Aderounmu, Ganiyu Adesola
Kamagate, B. H.
COGENT ENGINEERING, 2019, 6 (01):
[42] Mobile Target Tracking and Data Fusion Using Dual-Interacting Multiple Model System
Wann, Chin-Der
Shiu, Jia-Yu
2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
[43] Speaker tracking based on distributed particle filter and interacting multiple model in distributed microphone networks
Wang, Ruifang
Chen, Zhe
Yin, Fuliang
APPLIED ACOUSTICS, 2021, 174
[44] Adaptive Active Noise Suppression Using Multiple Model Switching Strategy
Huang, Quanzhen
Chen, Suxia
Huang, Mingming
Guo, Zhuangzhi
SHOCK AND VIBRATION, 2017, 2017
[45] Multi-Step Learning and Adaptive Search for Learning Complex Model Transformations from Examples
Baki, Islem
Sahraoui, Houari
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2016, 25 (03)
[46] A-iLearn: An adaptive incremental learning model for spoof fingerprint detection
Agarwal, Shivang
Rattani, Ajita
Chowdary, C. Ravindranath
MACHINE LEARNING WITH APPLICATIONS, 2022, 7
[47] Hierarchical linear and nonlinear adaptive learning model for system identification and prediction
Abu Jami'in, Mohammad
Anam, Khairul
Rulaningtyas, Riries
Mudjiono, Urip
Adianto, Adianto
Wee, Hui-Ming
APPLIED INTELLIGENCE, 2020, 50 (06) : 1699 - 1710
[48] Maximum correntropy quadrature Kalman filter based interacting multiple model approach for maneuvering target tracking
Liu, Bao
Wu, Ziwei
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[49] Reliable flight performance assessment of multirotor based on interacting multiple model particle filter and health degree
Zhao, Zhiyao
Yao, Peng
Wang, Xiaoyi
Xu, Jiping
Wang, Li
Yu, Jiabin
CHINESE JOURNAL OF AERONAUTICS, 2019, 32 (02) : 444 - 453
[50] Runtime Optimization in Interacting Multiple Model Filtering with Down-Sampling and Out-of-Sequence Measurements
Ketterer, Pascal
Hoher, Patrick
Reuter, Johannes
2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,

← 1 2 3 4 5 →