A label noise filtering and label missing supplement framework based on game theory

被引:4
作者
Liu, Yuwen [1 ]
Yao, Rongju [2 ]
Jia, Song [3 ]
Wang, Fan [6 ]
Wang, Ruili [4 ]
Ma, Rui [5 ]
Qi, Lianyong [1 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Weifang Univ Sci & Technol, Weifang Key Lab Blockchain Agr Vegetables, Shouguang, Peoples R China
[3] China Unicom Taian Branch, Tai An, Peoples R China
[4] Massey Univ, Sch Nat & Computat Sci, Auckland, New Zealand
[5] Shandong First Med Univ, Shandong Acad Med Sci, Gen Educ Dept, Tai An 271000, Peoples R China
[6] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Label noise; FastText; Cosine similarity; Game theory; LSTM; CLASSIFICATION;
D O I
10.1016/j.dcan.2021.12.008
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Labeled data is widely used in various classification tasks. However, there is a huge challenge that labels are often added artificially. Wrong labels added by malicious users will affect the training effect of the model. The unreliability of labeled data has hindered the research. In order to solve the above problems, we propose a framework of Label Noise Filtering and Missing Label Supplement (LNFS). And we take location labels in Location-Based Social Networks (LBSN) as an example to implement our framework. For the problem of label noise filtering, we first use FastText to transform the restaurant's labels into vectors, and then based on the assumption that the label most similar to all other labels in the location is most representative. We use cosine similarity to judge and select the most representative label. For the problem of label missing, we use simple common word similarity to judge the similarity of users' comments, and then use the label of the similar restaurant to supplement the missing labels. To optimize the performance of the model, we introduce game theory into our model to simulate the game between the malicious users and the model to improve the reliability of the model. Finally, a case study is given to illustrate the effectiveness and reliability of LNFS.
引用
收藏
页码:887 / 895
页数:9
相关论文
共 50 条
  • [1] A framework for label noise filters
    Chen, Qingqiang
    Jiang, Gaoxia
    Cao, Fuyuan
    Men, Changqian
    Wang, Wenjian
    PATTERN RECOGNITION, 2024, 147
  • [2] Label noise filtering based on the data distribution
    Chen Q.
    Wang W.
    Jiang G.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2019, 59 (04): : 262 - 269
  • [3] A Label Noise Filtering Method Based on Relative Outlier Factor
    Hou S.-Y.
    Jiang G.-X.
    Wang W.-J.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (01): : 154 - 168
  • [4] Cluster Validation Measures for Label Noise Filtering
    Boeva, Veselka
    Lundberg, Lars
    Angelova, Milena
    Kohstall, Jan
    2018 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2018, : 109 - 116
  • [5] A reconstruction error-based framework for label noise detection
    Salekshahrezaee, Zahra
    Leevy, Joffrey L.
    Khoshgoftaar, Taghi M.
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [6] Improving Label Noise Filtering by Exploiting Unlabeled Data
    Guan, Donghai
    Wei, Hongqiang
    Yuan, Weiwei
    Han, Guangjie
    Tian, Yuan
    Al-Dhelaan, Mohanmmed
    Al-Dhelaan, Abdullah
    IEEE ACCESS, 2018, 6 : 11154 - 11165
  • [7] A label noise filtering method for regression based on adaptive threshold and noise score
    Li, Chuang
    Mao, Zhizhong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [8] mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework
    Xia, Shuyin
    Chen, Baiyun
    Wang, Guoyin
    Zheng, Yong
    Gao, Xinbo
    Giem, Elisabeth
    Chen, Zizhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2916 - 2930
  • [9] Enhanced Label Noise Filtering with Multiple Voting
    Guan, Donghai
    Hussain, Maqbool
    Yuan, Weiwei
    Khattak, Asad Masood
    Fahim, Muhammad
    Khan, Wajahat Ali
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [10] Classification with label noise: a Markov chain sampling framework
    Zhao, Zijin
    Chu, Lingyang
    Tao, Dacheng
    Pei, Jian
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (05) : 1468 - 1504