Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

被引:0
|
作者
Tschiatschek, Sebastian [1 ]
Ghosh, Ahana [2 ]
Haug, Luis [3 ]
Devidze, Rati [2 ]
Singla, Adish [2 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] MPI SWS, New York, NY USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
MAXIMUM CAUSAL ENTROPY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inverse reinforcement learning (IRL) enables an agent to learn complex behavior by observing demonstrations from a (near-)optimal policy. The typical assumption is that the learner's goal is to match the teacher's demonstrated behavior. In this paper, we consider the setting where the learner has its own preferences that it additionally takes into consideration. These preferences can for example capture behavioral biases, mismatched worldviews, or physical constraints. We study two teaching approaches: learner-agnosticteaching, where the teacher provides demonstrations from an optimal policy ignoring the learner's preferences, and learner-awareteaching, where the teacher accounts for the learner's preferences. We design learner-aware teaching algorithms and show that significant performance improvements can be achieved over learner-agnostic teaching.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] TEACHING LEARNING PREFERENCES OF THE ADULT LEARNER
    CHECK, JF
    EDUCATION, 1984, 105 (01): : 107 - 112
  • [2] LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation
    Guo, Ke
    Miao, Zhenwei
    Jing, Wei
    Liu, Weiwei
    Li, Weizi
    Hao, Dayang
    Pan, Jia
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15386 - 15395
  • [3] DistHD: A Learner-Aware Dynamic Encoding Method for Hyperdimensional Classification
    Wang, Junyao
    Huang, Sitao
    Imani, Mohsen
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [4] Inverse-Inverse Reinforcement Learning. How to Hide Strategy from an Adversarial Inverse Reinforcement Learner
    Pattanayak, Kunal
    Krishnamurthy, Vikram
    Berry, Christopher
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3631 - 3636
  • [5] Inverse Reinforcement Learning from a Gradient-based Learner
    Ramponi, Giorgia
    Drappo, Gianluca
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Machine Teaching for Human Inverse Reinforcement Learning
    Lee, Michael S.
    Admoni, Henny
    Simmons, Reid
    FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [7] Interactive Teaching Algorithms for Inverse Reinforcement Learning
    Kamalaruban, Parameswaran
    Devidze, Rati
    Cevher, Volkan
    Singla, Adish
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2692 - 2700
  • [8] Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications
    Brown, Daniel S.
    Niekum, Scott
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7749 - 7758
  • [9] Transaction-aware inverse reinforcement learning for trading in stock markets
    Sun, Qizhou
    Gong, Xueyuan
    Si, Yain-Whar
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28186 - 28206
  • [10] Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control
    Chen, Jiayu
    Lan, Tian
    Aggarwal, Vaneet
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5902 - 5908