Multilingual Fake News Detection: A Study on Various Models and Training Scenarios

被引:0
|
作者
Chalehchaleh, Razieh [1 ]
Farahbakhsh, Reza [1 ]
Crespi, Noel [1 ]
机构
[1] Inst Polytech Paris, Telecom SudParis, Palaiseau, France
关键词
Fake news detection; Multilingual; Low-resource; Cross-lingual; Zero-shot; Transfer-learning; EMBEDDINGS;
D O I
10.1007/978-3-031-66428-1_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Amidst the surge in global online news consumption, tackling the escalating challenge of fake news requires a multilingual approach. While extensive research has explored fake news detection from various perspectives, a notable gap persists-the majority of studies concentrate on the English language. This highlights, the need for more research focusing on other languages, especially considering the scarcity of available non-English fake news datasets, particularly in low-resource settings. Focused on mBERT, XLM-RoBERTa, and LASER embeddings, this study addresses three key questions. Firstly, it evaluates the efficacy of several multilingual models across languages, highlighting the robust performance of mBERT and XLM-RoBERTa. Secondly, it examines the impact of multilingual and cross-lingual training data, demonstrating the effectiveness of multilingual training, including its potential in zero-shot and transfer learning scenarios. Thirdly, it compares multilingual models with translation-based strategies, revealing the superior performance of the former in multilingual fake news detection. Leveraging two datasets encompassing news in English, Spanish, French, Portuguese, Italian, Hindi, Indonesian, Swahili, and Vietnamese, our research underscores the effectiveness of multilingual approaches offering valuable insights for future research to combat the global problem of fake news more effectively.
引用
收藏
页码:73 / 89
页数:17
相关论文
共 50 条
  • [1] Multiverse: Multilingual Evidence for Fake News Detection
    Dementieva, Daryna
    Kuimov, Mikhail
    Panchenko, Alexander
    JOURNAL OF IMAGING, 2023, 9 (04)
  • [2] Fake News Detection using Multilingual Evidence
    Dementieva, Daryna
    Panchenko, Alexander
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 775 - 776
  • [3] Evidence-Aware Multilingual Fake News Detection
    Hammouchi, Hicham
    Ghogho, Mounir
    IEEE Access, 2022, 10 : 116808 - 116818
  • [4] Evidence-Aware Multilingual Fake News Detection
    Hammouchi, Hicham
    Ghogho, Mounir
    IEEE ACCESS, 2022, 10 : 116808 - 116818
  • [5] Comparison of Various Machine Learning Models for Accurate Detection of Fake News
    Poddar, Karishnu
    Amali, Geraldine Bessie D.
    Umadevi, K. S.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [6] Mul-FaD: attention based detection of multiLingual fake news
    Ahuja N.
    Kumar S.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (03) : 2481 - 2491
  • [7] A benchmark study of machine learning models for online fake news detection
    Khan, Junaed Younus
    Khondaker, Md. Tawkat Islam
    Afroz, Sadia
    Uddin, Gias
    Iqbal, Anindya
    MACHINE LEARNING WITH APPLICATIONS, 2021, 4
  • [8] A Comparative Study in Large Language Models Usage for Fake News Detection
    Emil, Repede Stefan
    Brad, Remus
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (04): : 2810 - 2823
  • [9] A study on the Detection of Fake news in Spanish
    Galvez, Alba Maribel Sanchez
    Albores, Francisco Javier
    Gonzalez, Ricardo Alvarez
    Conde, Said Gonzalez
    Galvez, Sully Sanchez
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (02): : 85 - 94
  • [10] Benchmarking Hook and Bait Urdu news dataset for domain-agnostic and multilingual fake news detection using large language models
    Sheetal Harris
    Jinshuo Liu
    Hassan Jalil Hadi
    Naveed Ahmad
    Mohammed Ali Alshara
    Scientific Reports, 15 (1)