Classifying vaguely labeled data based on evidential fusion

被引:44
作者
Song, Moxian [1 ,2 ]
Sun, Chenxi [1 ,2 ]
Cai, Derun [1 ,2 ]
Hong, Shenda [3 ,4 ]
Li, Hongyan [1 ,2 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Key Lab Machine Percept, Minist Educ, Beijing 100871, Peoples R China
[3] Peking Univ, Natl Inst Hlth Data Sci, Beijing 100191, Peoples R China
[4] Peking Univ, Inst Med Technol, Hlth Sci Ctr, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; Dempster-Shafer theory; Evidence theory; Uncertainty; Evidential fusion; Vague label; BELIEF FUNCTIONS; DECISION-MAKING; CLASSIFICATION;
D O I
10.1016/j.ins.2021.11.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification is one of the fundamental supervised learning tasks which learns classifiers from the given training data and related labels. The quality of labels is important in classification tasks. However, in many real-world scenarios, data annotation is often corrupted, especially when the annotation process is done by humans. Vaguely labeled data is one of the common problems caused by limited domain knowledge or partial data observation. In this paper, a novel method is proposed to classify vaguely labeled data based on evidential fusion. Vaguely labeled data are divided into several small data groups by the proposed valid label-set cover assignment algorithm. Evidence theory is applied to vaguely labeled data classification by regarding each base classifier on a small data group as one piece of evidence. This gives the chance of classifying unseen precise labeled data from related vague labels. Note that our approach is not restricted to any specific classifiers. It can be generalized to any off-the-shelf classification methods with probabilistic outputs. Finally, experiments are conducted on both synthetic data and real-world data with different base classifiers. Experimental results show that the proposed method achieves superior performance against compared methods. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:159 / 173
页数:15
相关论文
共 50 条
[1]  
[Anonymous], 1976, A Mathematical Theory of Evidence, DOI 10.1515/9780691214696
[2]  
[Anonymous], 1986, AI Mag., DOI DOI 10.1609/AIMAG.V7I2.542
[3]  
[Anonymous], 2013, Pattern Recognition and Machine Learning, DOI [DOI 10.18637/JSS.V017.B05, 10.1117/1.2819119]
[4]   Energy and entropy: Path from game theory to statistical mechanics [J].
Babajanyan, S. G. ;
Allahverdyan, A. E. ;
Cheong, Kang Hao .
PHYSICAL REVIEW RESEARCH, 2020, 2 (04)
[5]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[6]   Probability transformation of mass function: A weighted network method based on the ordered visibility graph [J].
Chen, Luyuan ;
Deng, Yong ;
Cheong, Kang Hao .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 105
[7]   UPPER AND LOWER PROBABILITIES INDUCED BY A MULTIVALUED MAPPING [J].
DEMPSTER, AP .
ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (02) :325-&
[8]   Combining conflicting evidence based on Pearson correlation coefficient and weighted graph [J].
Deng, Jixiang ;
Deng, Yong ;
Cheong, Kang Hao .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (12) :7443-7460
[9]   Information Volume of Mass Function [J].
Deng, Y. .
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (06) :1-13
[10]   Uncertainty measure in evidence theory [J].
Deng, Yong .
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (11)