Certainty weighted voting-based noise correction for crowdsourcing

被引:4
|
作者
Li, Huiru [1 ]
Jiang, Liangxiao [1 ]
Li, Chaoqun [2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowdsourcing; Noise correction; Certainty; Class-dependent; Instance-dependent; Weighted voting; MODEL QUALITY; IMPROVING DATA; TOOL;
D O I
10.1016/j.patcog.2024.110325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In crowdsourcing scenarios, we can obtain each instance's multiple noisy label set from different workers and then use a ground truth inference algorithm to infer its integrated label. Despite the effectiveness of ground truth inference algorithms, there is still a certain level of noise in integrated labels. To reduce the impact of noise, many noise correction algorithms have been proposed in recent years. To the best of our knowledge, almost all these algorithms assume that workers have the same labeling certainty on different classes and instances. However, it is rarely true in reality due to the differences in workers' individual preferences and cognitive abilities. In this paper, we argue that the labeling certainty of a worker should be class -dependent and instance -dependent. Based on this premise, we propose a certainty weighted voting -based noise correction (CWVNC) algorithm. At first, we use the consistency between worker -labeled labels and integrated labels on different classes to estimate the class -dependent certainty. Then, we train a probability -based classifier on the instances labeled by each worker separately and use it to estimate the instance -dependent certainty. Finally, we correct the integrated label of each instance by weighted voting based on class -dependent certainty and instance -dependent certainty. When the proposed algorithm CWVNC is examined, the average noise ratio of CWVNC on 34 simulated datasets is equal to 15.08%, and on two real -world datasets "Income"and "Music_genre"the noise ratio is equal to 25.77% and 26.94%, respectively. The results show that CWVNC significantly outperforms all other state-of-the-art noise correction algorithms used for comparison.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Certainty weighted voting-based noise correction for crowdsourcing
    Li, Huiru
    Jiang, Liangxiao
    Li, Chaoqun
    Pattern Recognition, 2024, 150
  • [2] Neighborhood Weighted Voting-Based Noise Correction for Crowdsourcing
    Li, Huiru
    Jiang, Liangxiao
    Xue, Siqing
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (07)
  • [3] A Weighted Voting-Based Associative Classification Algorithm
    Zhu, Xiaoyan
    Song, Qinbao
    Jia, Zihan
    COMPUTER JOURNAL, 2010, 53 (06): : 786 - 801
  • [4] Weighted voting-based robust image thresholding
    Rahnamayan, Shahryar
    Tizhoosh, Hanfid R.
    Salama, Magdy M. A.
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1129 - +
  • [5] Weighted voting-based consensus clustering for chemical structure databases
    Saeed, Faisal
    Ahmed, Ali
    Shamsir, Mohd Shahir
    Salim, Naomie
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2014, 28 (06) : 675 - 684
  • [6] Weighted voting-based consensus clustering for chemical structure databases
    Faisal Saeed
    Ali Ahmed
    Mohd Shahir Shamsir
    Naomie Salim
    Journal of Computer-Aided Molecular Design, 2014, 28 : 675 - 684
  • [7] A Voting-based Intra Deinterlacing Method for Directional Error Correction
    Cho, Hye-Jeong
    Lee, Yeo-Song
    Oh, Sye-Hoon
    Oh, Seoung-Jun
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (03) : 1713 - 1721
  • [8] A Voting-based Intra Deinterlacing Method For Directional Error Correction
    Oh, Sye-Hoon
    Lee, Yeo-Song
    Oh, Seoung-Jun
    2010 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS ICCE, 2010,
  • [9] Differential Evolution-Based Weighted Majority Voting for Crowdsourcing
    Zhang, Hao
    Jiang, Liangxiao
    Xu, Wenqiang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 228 - 236
  • [10] Heterogeneous Weighted Voting-Based Ensemble (HWVE) for Root-Cause Analysis
    Selvam, Blessy
    Ravimaran, S.
    Selvam, Sheba
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2020, 13 (04) : 63 - 74