A label noise filtering and label missing supplement framework based on game theory

被引:4
作者
Liu, Yuwen [1 ]
Yao, Rongju [2 ]
Jia, Song [3 ]
Wang, Fan [6 ]
Wang, Ruili [4 ]
Ma, Rui [5 ]
Qi, Lianyong [1 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Weifang Univ Sci & Technol, Weifang Key Lab Blockchain Agr Vegetables, Shouguang, Peoples R China
[3] China Unicom Taian Branch, Tai An, Peoples R China
[4] Massey Univ, Sch Nat & Computat Sci, Auckland, New Zealand
[5] Shandong First Med Univ, Shandong Acad Med Sci, Gen Educ Dept, Tai An 271000, Peoples R China
[6] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Label noise; FastText; Cosine similarity; Game theory; LSTM; CLASSIFICATION;
D O I
10.1016/j.dcan.2021.12.008
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Labeled data is widely used in various classification tasks. However, there is a huge challenge that labels are often added artificially. Wrong labels added by malicious users will affect the training effect of the model. The unreliability of labeled data has hindered the research. In order to solve the above problems, we propose a framework of Label Noise Filtering and Missing Label Supplement (LNFS). And we take location labels in Location-Based Social Networks (LBSN) as an example to implement our framework. For the problem of label noise filtering, we first use FastText to transform the restaurant's labels into vectors, and then based on the assumption that the label most similar to all other labels in the location is most representative. We use cosine similarity to judge and select the most representative label. For the problem of label missing, we use simple common word similarity to judge the similarity of users' comments, and then use the label of the similar restaurant to supplement the missing labels. To optimize the performance of the model, we introduce game theory into our model to simulate the game between the malicious users and the model to improve the reliability of the model. Finally, a case study is given to illustrate the effectiveness and reliability of LNFS.
引用
收藏
页码:887 / 895
页数:9
相关论文
共 50 条
[21]   Euclidean Distance Based Label Noise Cleaning [J].
Malik, Muhammad Ammar ;
Kang, Moonsoo .
2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, :237-239
[22]   Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization [J].
Wei, Qi ;
Sun, Haoliang ;
Lu, Xiankai ;
Yin, Yilong .
COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 :516-532
[23]   A Robust Multilabel Method Integrating Rule-Based Transparent Model, Soft Label Correlation Learning and Label Noise Resistance [J].
Lou, Qiongdan ;
Deng, Zhaohong ;
Sang, Qingbing ;
Xiao, Zhiyong ;
Choi, Kup-Sze ;
Wang, Shitong .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01) :454-473
[24]   A Four-Stage Algorithm for Community Detection Based on Label Propagation and Game Theory in Social Networks [J].
Torkaman, Atefeh ;
Badie, Kambiz ;
Salajegheh, Afshin ;
Bokaei, Mohammad Hadi ;
Ardestani, Seyed Farshad Fatemi .
AI, 2023, 4 (01) :255-269
[25]   Analysis of label noise in graph-based semi-supervised learning [J].
de Aquino Afonso, Bruno Klaus ;
Berton, Lilian .
PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, :1127-1134
[26]   The processing for label noise based on attribute reduction and two-step method [J].
Wu, Xingyu ;
Zhu, Ping .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (7-8) :4833-4847
[27]   Robust TSK Fuzzy System Based on Semisupervised Learning for Label Noise Data [J].
Zhang, Te ;
Deng, Zhaohong ;
Ishibuchi, Hisao ;
Pang, Lie Meng .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (08) :2145-2157
[28]   Label-Weighted Graph-Based Learning for Semi-Supervised Classification Under Label Noise [J].
Liang, Naiyao ;
Yang, Zuyuan ;
Chen, Junhang ;
Li, Zhenni ;
Xie, Shengli .
IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (01) :55-65
[29]   Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise [J].
Sun, Zeren ;
Liu, Huafeng ;
Wang, Qiong ;
Zhou, Tianfei ;
Wu, Qi ;
Tang, Zhenmin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1093-1104
[30]   ML-LRC: Low-rank-constraint-based Multi-label Learning with Label Noise [J].
Wang, Xiaoying ;
Xie, Jun ;
Yu, Lu ;
Tao, Xingliu .
PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, :129-136