Zero-Shot Assistance in Sequential Decision Problems

被引:0
作者
De Peuter, Sebastiaan [1 ]
Kaski, Samuel [1 ,2 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
[2] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10 | 2023年
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of creating assistants that can help agents solve new sequential decision problems, assuming the agent is not able to specify the reward function explicitly to the assistant. Instead of acting in place of the agent as in current automation-based approaches, we give the assistant an advisory role and keep the agent in the loop as the main decision maker. The difficulty is that we must account for potential biases of the agent which may cause it to seemingly irrationally reject advice. To do this we introduce a novel formalization of assistance that models these biases, allowing the assistant to infer and adapt to them. We then introduce a new method for planning the assistant's actions which can scale to large decision making problems. We show experimentally that our approach adapts to these agent biases, and results in higher cumulative reward for the agent than automation-based alternatives. Lastly, we show that an approach combining advice and automation outperforms advice alone at the cost of losing some safety guarantees.
引用
收藏
页码:11551 / 11559
页数:9
相关论文
共 37 条
  • [1] Abbeel P., 2004, P 21 INT C MACH LEAR, P1, DOI DOI 10.1145/1015330.1015430
  • [2] [Anonymous], 2019, P 36 INT C MACHINE L
  • [3] A survey of inverse reinforcement learning: Challenges, methods and progress
    Arora, Saurabh
    Doshi, Prashant
    [J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
  • [4] Action understanding as inverse planning
    Baker, Chris L.
    Saxe, Rebecca
    Tenenbaum, Joshua B.
    [J]. COGNITION, 2009, 113 (03) : 329 - 349
  • [5] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
  • [6] Celikok M. M., 2022, INT C AUTONOMOUS AGE, P235
  • [7] Chan LWC, 2021, Arxiv, DOI arXiv:2111.06956
  • [8] Christiano PF, 2017, ADV NEUR IN, V30
  • [9] Dimitrakakis C, 2017, ADV NEUR IN, V30
  • [10] Elmalech A, 2015, AAAI CONF ARTIF INTE, P1313