Leveraging Inter-rater Agreement for Classification in the Presence of Noisy Labels

被引：6

作者：

Bucarelli, Maria Sofia ^{[2
]}

Cassanol, Lucas ^{[1
]}

Siciliano, Federico ^{[2
]}

Mantrachl, Amin ^{[1
]}

Silvestri, Fabrizio ^{[2
,3
]}

机构：

[1] Amazon, Buenos Aires, DF, Argentina

[2] Sapienza Univ Rome, Rome, Italy

[3] CNR, ISTI, Pisa, Italy

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

基金：

欧盟地平线“2020”;

关键词：

D O I：

10.1109/CVPR52729.2023.00335

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In practical settings, classification datasets are obtained through a labelling process that is usually done by humans. Labels can be noisy as they are obtained by aggregating the different individual labels assigned to the same sample by multiple, and possibly disagreeing, annotators. The inter-rater agreement on these datasets can be measured while the underlying noise distribution to which the labels are subject is assumed to be unknown. In this work, we: (i) show how to leverage the inter-annotator statistics to estimate the noise distribution to which labels are subject; (ii) introduce methods that use the estimate of the noise distribution to learn from the noisy dataset; and (iii) establish generalization bounds in the empirical risk minimization framework that depend on the estimated quantities. We conclude the paper by providing experiments that illustrate our findings.

引用

页码：3439 / 3448

页数：10

共 31 条

[1] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
COHEN, J
[J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
[2] Collins K. M., 2022, ELICITING LEARNING S, P2
[3] Feng L, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2206
[4] FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
[5] Ghosh A, 2017, AAAI CONF ARTIF INTE, P1919
[6] Making risk minimization tolerant to label noise
Ghosh, Aritra
Manwani, Naresh
Sastry, P. S.
[J]. NEUROCOMPUTING, 2015, 160 : 93 - 107
[7] Hendrycks D, 2018, ADV NEUR IN, V31
[8] Horn R.A., 2012, Matrix Analysis, DOI 10.1017/CBO9780511810817
[9] Khetan A., 2017, LEARNING NOISY SINGL
[10] L5 Dawid AP, 1979, Journal of the Royal Statistical Society (JRSS), V28, P20

← 1 2 3 4 →