Regret Analysis of Repeated Delegated Choice

被引:0
作者
Hajiaghayi, MohammadTaghi [1 ]
Mahdavi, Mohammad [1 ]
Rezaei, Keivan [1 ]
Shin, Suho [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9 | 2024年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a study on a repeated delegated choice problem, which is the first to consider an online learning variant of Kleinberg and Kleinberg, EC'18. In this model, a principal interacts repeatedly with an agent who possesses an exogenous set of solutions to search for efficient ones. Each solution can yield varying utility for both the principal and the agent, and the agent may propose a solution to maximize its own utility in a selfish manner. To mitigate this behavior, the principal announces an eligible set which screens out a certain set of solutions. The principal, however, does not have any information on the distribution of solutions nor the number of solutions in advance. Therefore, the principal dynamically announces various eligible sets to efficiently learn the distribution. The principal's objective is to minimize cumulative regret compared to the optimal eligible set in hindsight. We explore two dimensions of the problem setup, whether the agent behaves myopically or strategizes across the rounds, and whether the solutions yield deterministic or stochastic utility. We obtain sublinear regret upper bounds in various regimes, and derive corresponding lower bounds which implies the tightness of the results. Overall, we bridge a well-known problem in economics to the evolving area of online learning, and present a comprehensive study in this problem.
引用
收藏
页码:9757 / 9764
页数:8
相关论文
共 25 条
[1]   Optimal delegation [J].
Alonso, Ricardo ;
Matouschek, Niko .
REVIEW OF ECONOMIC STUDIES, 2008, 75 (01) :259-293
[2]   The Theory of Optimal Delegation With an Application to Tariff Caps [J].
Amador, Manuel ;
Bagwell, Kyle .
ECONOMETRICA, 2013, 81 (04) :1541-1599
[3]  
Amin Kareem, 2013, Advances in Neural Information Processing Systems, V26
[4]  
[Anonymous], 2012, P 11 INT C AUT AG MU
[5]   A Model of Delegated Project Choice [J].
Armstrong, Mark ;
Vickers, John .
ECONOMETRICA, 2010, 78 (01) :213-244
[6]   Posting prices with unknown distributions [J].
Babaioff, Moshe ;
Blumrosen, Liad ;
Dughmi, Shaddin ;
Singer, Yaron .
ACM Transactions on Economics and Computation, 2017, 5 (02)
[7]  
Bai Yutong, 2021, Advances in Neural Information Processing Systems, V34
[8]   Theories of delegation [J].
Bendor, J ;
Glazer, A ;
Hammond, T .
ANNUAL REVIEW OF POLITICAL SCIENCE, 2001, 4 :235-269
[9]  
Birmpas Georgios, 2020, Advances in Neural Information Processing Systems, V33, P20624
[10]  
Guo Y., 2021, Dynamic allocation without money