Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

被引:0
作者
Taylor, Matthew E. [1 ,2 ,3 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[3] AI Redefined, Montreal, PQ, Canada
来源
HHAI 2023: AUGMENTING HUMAN INTELLECT | 2023年 / 368卷
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement Learning; Human-Agent Interaction; Human in the Loop; Interactive Machine Learning;
D O I
10.3233/FAIA230098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is typically framed as a machine learning paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL is fundamentally human in the loop (HitL). The reward functions (and other components) of a Markov decision process are defined by humans. The decisions to tackle a certain problem, and deploy a learned solution, are taken by humans. Humans can also play a critical role in providing information to the agent throughout its life cycle to better succeed at the problem in question. We end by highlighting a set of critical HitL research questions, which, if ignored, could cause RL to fail to live up to its full potential.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 25 条
  • [1] A survey of robot learning from demonstration
    Argall, Brenna D.
    Chernova, Sonia
    Veloso, Manuela
    Browning, Brett
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) : 469 - 483
  • [2] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
  • [3] Bagaria A, 2020, INT C LEARNING REPRE
  • [4] A conceptual framework for externally-influenced agents: an assisted reinforcement learning review
    Bignold, Adam
    Cruz, Francisco
    Taylor, Matthew E.
    Brys, Tim
    Dazeley, Richard
    Vamplew, Peter
    Foale, Cameron
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (4) : 3621 - 3644
  • [5] Chen C, 2022, PR MACH LEARN RES
  • [6] Christiano PF, 2017, ADV NEUR IN, V30
  • [7] Clark J., 2016, Faulty reward functions in the wild
  • [8] Dulac-Arnold G., 2019, CHALLENGES REAL WORL
  • [9] AutoML: A survey of the state-of-the-art
    He, Xin
    Zhao, Kaiyong
    Chu, Xiaowen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [10] Hu Y, 2020, P 34 INT C NEUR INF