On the use of text augmentation for stance and fake news detection

被引:9
|
作者
Salah, Ilhem [1 ,2 ]
Jouini, Khaled [1 ]
Korbaa, Ouajdi [1 ]
机构
[1] Univ Sousse, MARS Res Lab LR17ES05, ISITCom, Hosp Sousse, Sousse, Tunisia
[2] Univ Sousse, Hosp Sousse, MARS Res Lab LR17ES05, ISITCom, Sousse 4011, Tunisia
关键词
Stance and fake news detection; text augmentation; ensemble learning; class imbalance;
D O I
10.1080/24751839.2023.2198820
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Augmentation (DA) aims at synthesizing new training instances by applying transformations to available ones. DA has several well-known benefits such as: (i) increasing generalization ability; (ii) preventing data scarcity; and (iii) helping resolve class imbalance issues. In this work, we investigate the use of DA for stance and fake news detection. In the first part of our work, we explore the effect of various DA techniques on the performance of common classification algorithms. Our study reveals that the motto 'the more, the better' is the wrong approach regarding text augmentation and that there is no one-size-fits-all text augmentation technique. The second part of our work leverages the results of our study to propose a novel augmentation-based, ensemble learning approach. The proposed approach leverages text augmentation to enhance base learners' diversity and accuracy, ergo the predictive performance of the ensemble. The third part of our work experimentally investigates the use of DA to cope with the class imbalance problem. Class imbalance is very common in stance and fake news detection and often results in biased models. In this work we show how and to what extent text augmentation can help resolving moderate and severe imbalance.
引用
收藏
页码:359 / 375
页数:17
相关论文
共 50 条
  • [41] Albanian Fake News Detection
    Canhasi, Ercan
    Shijaku, Rexhep
    Berisha, Erblin
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [42] A Tool for Fake News Detection
    Al Asaad, Bashar
    Erascu, Madalina
    2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, : 379 - 386
  • [43] Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) Task
    Slovikovskaya, Valeriya
    Attardi, Giuseppe
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1211 - 1218
  • [44] Fake news detection on Twitter
    Sharma, Srishti
    Saraswat, Mala
    Dubey, Anil Kumar
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2022, 18 (5/6) : 388 - 412
  • [45] Fake News Detection Using Stance Extracted Multimodal Fusion-Based Hybrid Neural Network
    Sengan, Sudhakar
    Vairavasundaram, Subramaniyaswamy
    Ravi, Logesh
    AlHamad, Ahmad Qasim Mohammad
    Alkhazaleh, Hamzah Ali
    Alharbi, Meshal
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04): : 5146 - 5157
  • [46] Feature analysis of fake news: improving fake news detection in social media
    Leung, Johnathan
    Vatsalan, Dinusha
    Arachchilage, Nalin
    Journal of Cyber Security Technology, 2023, 7 (04) : 224 - 241
  • [47] Fake news detection for epidemic emergencies via deep correlations between text and images
    Zeng, Jiangfeng
    Zhang, Yin
    Ma, Xiao
    Sustainable Cities and Society, 2021, 66
  • [48] TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection
    Guo, Quanjiang
    Kang, Zhao
    Tian, Ling
    Chen, Zhouguo
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [49] Fake news detection for epidemic emergencies via deep correlations between text and images
    Zeng, Jiangfeng
    Zhang, Yin
    Ma, Xiao
    SUSTAINABLE CITIES AND SOCIETY, 2021, 66
  • [50] Combining Neural, Statistical and External Features for Fake News Stance Identification
    Bhatt, Gaurav
    Sharma, Aman
    Sharma, Shivam
    Nagpal, Ankush
    Raman, Balasubramanian
    Mittal, Ankush
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1353 - 1357