Active Bayesian Assessment of Black-Box Classifiers

被引:0
作者
Ji, Disi [1 ]
Logan, Robert L. [1 ]
Smyth, Padhraic [1 ]
Steyvers, Mark [2 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92717 USA
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an active Bayesian approach for assessment of classifier performance to satisfy the desiderata of both reliability and label-efficiency. We begin by developing inference strategies to quantify uncertainty for common assessment metrics such as accuracy, misclassification cost, and calibration error. We then propose a general framework for active Bayesian assessment using inferred uncertainty to guide efficient selection of instances for labeling, enabling better performance assessment with fewer labels. We demonstrate significant gains from our proposed active Bayesian approach via a series of systematic empirical experiments assessing the performance of modern neural classifiers (e.g., ResNet and BERT) on several standard image and text classification datasets.
引用
收藏
页码:7935 / 7944
页数:10
相关论文
共 50 条
[41]   An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers [J].
Yutian Zhou ;
Yu-an Tan ;
Quanxin Zhang ;
Xiaohui Kuang ;
Yahong Han ;
Jingjing Hu .
Mobile Networks and Applications, 2021, 26 :1616-1629
[42]   Post-hoc explanation of black-box classifiers using confident itemsets [J].
Moradi, Milad ;
Samwald, Matthias .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[43]   Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers [J].
Gao, Ji ;
Lanchantin, Jack ;
Soffa, Mary Lou ;
Qi, Yanjun .
2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, :50-56
[44]   THE MATHEMATICAL WORLD IN THE BLACK-BOX - SIGNIFICANCE OF THE BLACK-BOX AS A MEDIUM OF MATHEMATIZING [J].
MAASS, J ;
SCHLOGLMANN, W .
CYBERNETICS AND SYSTEMS, 1988, 19 (04) :295-309
[45]   WebRTC Quality Assessment: Dangers of Black-box Testing [J].
Cinar, Yusuf ;
Melvin, Hugh .
2014 10TH INTERNATIONAL CONFERENCE ON DIGITAL TECHNOLOGIES (DT), 2014, :31-35
[46]   INSIDE THE BLACK-BOX [J].
HORGAN, J .
IEEE SPECTRUM, 1986, 23 (11) :65-65
[47]   DORMANCY - THE BLACK-BOX [J].
SEELEY, SD .
HORTSCIENCE, 1994, 29 (11) :1248-1248
[48]   THE TRAGEDY OF THE BLACK-BOX [J].
DUNTEMANN, J .
DR DOBBS JOURNAL, 1991, 16 (12) :123-+
[49]   INSIDE BLACK-BOX [J].
DEAN, DS .
NON-DESTRUCTIVE TESTING, 1970, 3 (03) :181-&
[50]   BLACK-BOX BLUES [J].
SNYDER, EL .
DISCOVER, 1984, 5 (08) :6-6