Certainty weighted voting-based noise correction for crowdsourcing

被引:5
作者
Li, Huiru [1 ]
Jiang, Liangxiao [1 ]
Li, Chaoqun [2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowdsourcing; Noise correction; Certainty; Class-dependent; Instance-dependent; Weighted voting; MODEL QUALITY; IMPROVING DATA; TOOL;
D O I
10.1016/j.patcog.2024.110325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In crowdsourcing scenarios, we can obtain each instance's multiple noisy label set from different workers and then use a ground truth inference algorithm to infer its integrated label. Despite the effectiveness of ground truth inference algorithms, there is still a certain level of noise in integrated labels. To reduce the impact of noise, many noise correction algorithms have been proposed in recent years. To the best of our knowledge, almost all these algorithms assume that workers have the same labeling certainty on different classes and instances. However, it is rarely true in reality due to the differences in workers' individual preferences and cognitive abilities. In this paper, we argue that the labeling certainty of a worker should be class -dependent and instance -dependent. Based on this premise, we propose a certainty weighted voting -based noise correction (CWVNC) algorithm. At first, we use the consistency between worker -labeled labels and integrated labels on different classes to estimate the class -dependent certainty. Then, we train a probability -based classifier on the instances labeled by each worker separately and use it to estimate the instance -dependent certainty. Finally, we correct the integrated label of each instance by weighted voting based on class -dependent certainty and instance -dependent certainty. When the proposed algorithm CWVNC is examined, the average noise ratio of CWVNC on 34 simulated datasets is equal to 15.08%, and on two real -world datasets "Income"and "Music_genre"the noise ratio is equal to 25.77% and 26.94%, respectively. The results show that CWVNC significantly outperforms all other state-of-the-art noise correction algorithms used for comparison.
引用
收藏
页数:9
相关论文
共 50 条
[41]   Creditability-based weighted voting for reducing false positives and negatives in intrusion detection [J].
Lin, Ying-Dar ;
Lai, Yuan-Cheng ;
Ho, Cheng-Yuan ;
Tai, Wei-Hsuan .
COMPUTERS & SECURITY, 2013, 39 :460-474
[42]   Pose Estimation of 3D Objects Based on Point Pair Feature and Weighted Voting [J].
Lin, Sen ;
Li, Wentao ;
Wang, Yuning .
INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 :383-394
[43]   Improved firefly algorithm for feature selection with the ReliefF-based initialization and the weighted voting mechanism [J].
Xin Yong ;
Yue-lin Gao .
Neural Computing and Applications, 2023, 35 :275-301
[44]   High-accuracy Splice Site Prediction Based on Statistical Difference Table and Weighted Voting [J].
Zeng Ying ;
Chen Yuan ;
Yuan Zhe-Ming .
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2019, 46 (05) :496-503
[45]   Ocean-Land Waveform Classification Based on Multichannel Weighted Voting of Airborne Green Laser [J].
Zhao Xinglei ;
Liang Gang ;
Zhao Jianhu ;
Zhou Fengnian .
LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (09)
[46]   A Weighted Voting Framework for Android App's Vetting Based on Multiple Machine Learning Models [J].
Hui, Honglei ;
Zhi, Yongbo ;
Xi, Ning ;
Liu, Yuanqing .
NETWORK AND SYSTEM SECURITY, NSS 2020, 2020, 12570 :63-78
[47]   Improved firefly algorithm for feature selection with the ReliefF-based initialization and the weighted voting mechanism [J].
Yong, Xin ;
Gao, Yue-lin .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01) :275-301
[48]   DPWeVote: differentially private weighted voting protocol for cloud-based decision-making [J].
Yan, Ziqi ;
Liu, Jiqiang ;
Liu, Shaowu .
ENTERPRISE INFORMATION SYSTEMS, 2019, 13 (02) :236-256
[49]   Abc-based weighted voting deep ensemble learning model for multiple eye disease detection [J].
Uyar, Kubra ;
Yurdakul, Mustafa ;
Tasdemir, Sakir .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
[50]   Multi-model weighted voting method based on convolutional neural network for human activity recognition [J].
Ouyang, Kangyue ;
Pan, Zhongliang .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (29) :73305-73328