Cross-Language Fake News Detection

被引:0
作者
Chu S.K.W. [1 ]
Xie R. [1 ]
Wang Y. [1 ]
机构
[1] University of Hong Kong, Hong Kong
关键词
cross-language study; fake news detection; information detection;
D O I
10.2478/dim-2020-0025
中图分类号
学科分类号
摘要
With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection. © 2021 Samuel Kai Wah Chu et al., published by Sciendo
引用
收藏
页码:100 / 109
页数:9
相关论文
共 50 条
  • [1] Fake news article detection datasets for Hindi language
    Kumar, Sujit
    Shankhdhar, Anant
    Singal, Divyam
    Aggarwal, Bhuvan
    Malhotra, Ahaan Sameer
    Ranbir Singh, Sanasam
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [2] A Review of Fake News Detection Techniques for Arabic Language
    Alotaibi, Taghreed
    Al-Dossari, Hmood
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 392 - 407
  • [3] A Survey on Natural Language Processing for Fake News Detection
    Oshikawa, Ray
    Qian, Jing
    Wang, William Yang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6086 - 6093
  • [4] Cross-Domain Failures of Fake News Detection
    Janicka, Maria
    Pszona, Maria
    Wawer, Aleksander
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 1089 - 1097
  • [5] Fake news detection in Urdu language using machine learning
    Farooq, Muhammad Shoaib
    Naseem, Ansar
    Rustam, Furqan
    Ashraf, Imran
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [6] MMCSC:A Cross-Modal Approach to Fake News Detection
    Zhao Y.
    Hao K.
    Zhao J.
    Xin J.-C.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (01): : 18 - 25
  • [7] Text Data Augmentation Techniques for Fake News Detection in the Romanian Language
    Bucos, Marian
    Tucudean, Georgiana
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [8] Cross-modal Ambiguity Learning for Multimodal Fake News Detection
    Chen, Yixuan
    Li, Dongsheng
    Zhang, Peng
    Sui, Jie
    Lv, Qin
    Lu, Tun
    Shang, Li
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 2897 - 2905
  • [9] Cross-modal Contrastive Learning for Multimodal Fake News Detection
    Wang, Longzheng
    Zhang, Chuang
    Xu, Hongbo
    Xu, Yongxiu
    Xu, Xiaohan
    Wang, Siqi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5696 - 5704
  • [10] Pre-Trained Language Model Ensemble for Arabic Fake News Detection
    Al-Zahrani, Lama
    Al-Yahya, Maha
    MATHEMATICS, 2024, 12 (18)