A label noise filtering and label missing supplement framework based on game theory

被引:4
作者
Liu, Yuwen [1 ]
Yao, Rongju [2 ]
Jia, Song [3 ]
Wang, Fan [6 ]
Wang, Ruili [4 ]
Ma, Rui [5 ]
Qi, Lianyong [1 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Weifang Univ Sci & Technol, Weifang Key Lab Blockchain Agr Vegetables, Shouguang, Peoples R China
[3] China Unicom Taian Branch, Tai An, Peoples R China
[4] Massey Univ, Sch Nat & Computat Sci, Auckland, New Zealand
[5] Shandong First Med Univ, Shandong Acad Med Sci, Gen Educ Dept, Tai An 271000, Peoples R China
[6] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Label noise; FastText; Cosine similarity; Game theory; LSTM; CLASSIFICATION;
D O I
10.1016/j.dcan.2021.12.008
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Labeled data is widely used in various classification tasks. However, there is a huge challenge that labels are often added artificially. Wrong labels added by malicious users will affect the training effect of the model. The unreliability of labeled data has hindered the research. In order to solve the above problems, we propose a framework of Label Noise Filtering and Missing Label Supplement (LNFS). And we take location labels in Location-Based Social Networks (LBSN) as an example to implement our framework. For the problem of label noise filtering, we first use FastText to transform the restaurant's labels into vectors, and then based on the assumption that the label most similar to all other labels in the location is most representative. We use cosine similarity to judge and select the most representative label. For the problem of label missing, we use simple common word similarity to judge the similarity of users' comments, and then use the label of the similar restaurant to supplement the missing labels. To optimize the performance of the model, we introduce game theory into our model to simulate the game between the malicious users and the model to improve the reliability of the model. Finally, a case study is given to illustrate the effectiveness and reliability of LNFS.
引用
收藏
页码:887 / 895
页数:9
相关论文
共 50 条
[31]   An Ensemble and Iterative Recovery Strategy Based kGNN Method to Edit Data with Label Noise [J].
Chen, Baiyun ;
Huang, Longhai ;
Chen, Zizhong ;
Wang, Guoyin .
MATHEMATICS, 2022, 10 (15)
[32]   Neighborhood repartition-based oversampling algorithm for multiclass imbalanced data with label noise [J].
Shen, Shiyi ;
Li, Zhixin ;
Huan, Zhan ;
Shang, Fanqi ;
Wang, Yongsong ;
Chen, Ying .
NEUROCOMPUTING, 2024, 600
[33]   VPGB: A granular-ball based model for attribute reduction and classification with label noise [J].
Peng, Xiaoli ;
Wang, Ping ;
Xia, Shuyin ;
Wang, Cheng ;
Chen, Weiqi .
INFORMATION SCIENCES, 2022, 611 :504-521
[34]   SVM based multi-label learning with missing labels for image annotation [J].
Liu, Yang ;
Wen, Kaiwen ;
Gao, Quanxue ;
Gao, Xinbo ;
Nie, Feiping .
PATTERN RECOGNITION, 2018, 78 :307-317
[35]   CCMN: A General Framework for Learning With Class-Conditional Multi-Label Noise [J].
Xie, Ming-Kun ;
Huang, Sheng-Jun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :154-166
[36]   A Belief Theory Based Instance Selection Scheme for Label Noise and Outlier Detection from Breast Cancer Data [J].
Faziludeen, Shameer ;
Sankaran, Praveen .
COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 :67-77
[37]   A Self-training-based Label Noise Correction Algorithm for Crowdsourcing [J].
Yang Y. ;
Jiang L.-X. ;
Li C.-Q. .
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (04) :830-844
[38]   Robust Learning Against Label Noise Based on Activation Trend Tracking [J].
Wang, Yilin ;
Zhang, Yulong ;
Jiang, Zhiqiang ;
Zheng, Li ;
Chen, Jinshui ;
Lu, Jiangang .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[39]   A mathematical programming approach to SVM-based classification with label noise [J].
Blanco, Victor ;
Japon, Alberto ;
Puerto, Justo .
COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 172
[40]   Noisy feature decomposition-based multi-label learning with missing labels [J].
Ding, Jiaman ;
Zhang, Yihang ;
Jia, Lianyin ;
Fu, Xiaodong ;
Jiang, Ying .
INFORMATION SCIENCES, 2024, 662