Leveraging Inter-rater Agreement for Classification in the Presence of Noisy Labels

被引:6
作者
Bucarelli, Maria Sofia [2 ]
Cassanol, Lucas [1 ]
Siciliano, Federico [2 ]
Mantrachl, Amin [1 ]
Silvestri, Fabrizio [2 ,3 ]
机构
[1] Amazon, Buenos Aires, DF, Argentina
[2] Sapienza Univ Rome, Rome, Italy
[3] CNR, ISTI, Pisa, Italy
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
基金
欧盟地平线“2020”;
关键词
D O I
10.1109/CVPR52729.2023.00335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In practical settings, classification datasets are obtained through a labelling process that is usually done by humans. Labels can be noisy as they are obtained by aggregating the different individual labels assigned to the same sample by multiple, and possibly disagreeing, annotators. The inter-rater agreement on these datasets can be measured while the underlying noise distribution to which the labels are subject is assumed to be unknown. In this work, we: (i) show how to leverage the inter-annotator statistics to estimate the noise distribution to which labels are subject; (ii) introduce methods that use the estimate of the noise distribution to learn from the noisy dataset; and (iii) establish generalization bounds in the empirical risk minimization framework that depend on the estimated quantities. We conclude the paper by providing experiments that illustrate our findings.
引用
收藏
页码:3439 / 3448
页数:10
相关论文
共 31 条
  • [1] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [2] Collins K. M., 2022, ELICITING LEARNING S, P2
  • [3] Feng L, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2206
  • [4] FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
  • [5] Ghosh A, 2017, AAAI CONF ARTIF INTE, P1919
  • [6] Making risk minimization tolerant to label noise
    Ghosh, Aritra
    Manwani, Naresh
    Sastry, P. S.
    [J]. NEUROCOMPUTING, 2015, 160 : 93 - 107
  • [7] Hendrycks D, 2018, ADV NEUR IN, V31
  • [8] Horn R.A., 2012, Matrix Analysis, DOI 10.1017/CBO9780511810817
  • [9] Khetan A., 2017, LEARNING NOISY SINGL
  • [10] L5 Dawid AP, 1979, Journal of the Royal Statistical Society (JRSS), V28, P20