Post-hoc Estimators for Learning to Defer to an Expert

被引:0
作者
Narasimhan, Harikrishna [1 ]
Jitkrittum, Wittawat [2 ]
Menon, Aditya Krishna [2 ]
Rawat, Ankit Singh [2 ]
Kumar, Sanjiv [2 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Google Res, New York, NY USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many practical settings allow a classifier to defer predictions to one or more costly experts. For example, the learning to defer paradigm allows a classifier to defer to a human expert, at some monetary cost. Similarly, the adaptive inference paradigm allows a base model to defer to one or more large models, at some computational cost. The goal in these settings is to learn classification and deferral mechanisms to optimise a suitable accuracy-cost tradeoff. To achieve this, a central issue studied in prior work is the design of a coherent loss function for both mechanisms. In this work, we demonstrate that existing losses can underfit the training set when there is a non-trivial deferral cost, owing to an implicit application of a high level of label smoothing. To resolve this, we propose two post-hoc estimators that fit a deferral function on top of a base model, either by threshold correction, or by learning when the base model's error rate exceeds the cost of deferring to the expert. Both approaches are equipped with theoretical guarantees, and empirically yield effective accuracy-cost tradeoffs on learning to defer and adaptive inference benchmarks.
引用
收藏
页数:13
相关论文
共 38 条
  • [11] Elkan, 2001, P 17 INT JOINT C ART, V17, P973
  • [12] Goyal P., 2017, CORR, Vabs/1706.02677, P1
  • [13] Han Yizeng, 2021, ABS210204906 CORR
  • [14] He K., 2016, 2016 IEEE C COMP VIS, DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]
  • [15] Huang G., 2018, INT C LEARN REPR ICL, P1
  • [16] Krizhevsky A., 2009, Technical report, P1
  • [17] Multicategory support vector machines: Theory and application to the classification of microarray data and satellite radiance data
    Lee, YK
    Lin, Y
    Wahba, G
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) : 67 - 81
  • [18] Madras D., 2018, Advances in Neural Information Processing Systems, P6150
  • [19] Mozannar H., 2020, ICML, P7076
  • [20] Narasimhan H, 2015, PR MACH LEARN RES, V37, P2398