Dynamic Preference Learning for Complex Cobotic Tasks

被引:0
作者
Vella, Elena M. [1 ]
Chapman, Airlie [1 ]
机构
[1] Univ Melbourne, Dept Mech Engn, Melbourne, Vic 3010, Australia
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 02期
关键词
Human-Robot Interaction; User Preferences; Control Design; Preference Learning;
D O I
10.1016/j.ifacol.2023.10.789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In complex tasks requiring robots to collaborate with multiple human users for successful human-robot interaction, it is essential to capture user preferences in the control design. This work considers both the subjective judgements of a users preferences, to formulate a preference measure that can be used in control design. Novel to the work, is the consideration of the temporal impact that current preferences have on previously provided preferences. We formulate a weighted set-wise preference learning problem that considers the historical preferences of an individual, capturing the impact of memory on human user's preferences. We address the challenge of how to choose the best preference features as the individuals set-wise preference comparisons are sequentially presented by designing a weighted sequential preference estimator. We build upon the concept of an individual's preferences to a group of users' preferences, modelling group preferences as a normal distribution. We further extend the sequential individual preference estimation approach to multiple users' preference through the derivation of a stochastic gradient descent group preference estimator. An experimental study with 43 participants is conducted to support the proposed approaches. Copyright (c) 2023 The Authors.
引用
收藏
页码:6308 / 6313
页数:6
相关论文
共 24 条
  • [1] Agarwal Shivani, 2006, ACM International Conference Proceeding Series, P25, DOI [DOI 10.1145/1143844.1143848.2,4,9, 10.1145/1143844.1143848, DOI 10.1145/1143844.1143848]
  • [2] Akash K, 2017, P AMER CONTR CONF, P1542, DOI 10.23919/ACC.2017.7963172
  • [3] Alami R., 2006, 5 INT COGNITIVE ROBO
  • [4] [Anonymous], 2010, Advances in neural information processing systems
  • [5] Atkinson R.C., 1968, Psychology of learning and motivation, V2, P89, DOI [DOI 10.1016/S0079-7421, 10.1016/S0079-7421(08)60422-3, 10.1016/S0079-7421]
  • [6] The influence of affect on higher level cognition: A review of research on interpretation, judgement, decision making and reasoning
    Blanchette, Isabelle
    Richards, Anne
    [J]. COGNITION & EMOTION, 2010, 24 (04) : 561 - 595
  • [7] Warehousing in the e-commerce era: A survey
    Boysen, Nils
    de Koster, Rene
    Weidinger, Felix
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 277 (02) : 396 - 411
  • [8] SOME TESTS OF THE DECAY THEORY OF IMMEDIATE MEMORY
    BROWN, J
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1958, 10 (01): : 12 - 21
  • [9] Support vector ordinal regression
    Chu, Wei
    Keerthi, S. Sathiya
    [J]. NEURAL COMPUTATION, 2007, 19 (03) : 792 - 815
  • [10] Chu Wei, 2005, INT C MACH LEARN, P137