Multilingual Fake News Detection: A Study on Various Models and Training Scenarios

被引:0
|
作者
Chalehchaleh, Razieh [1 ]
Farahbakhsh, Reza [1 ]
Crespi, Noel [1 ]
机构
[1] Inst Polytech Paris, Telecom SudParis, Palaiseau, France
关键词
Fake news detection; Multilingual; Low-resource; Cross-lingual; Zero-shot; Transfer-learning; EMBEDDINGS;
D O I
10.1007/978-3-031-66428-1_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Amidst the surge in global online news consumption, tackling the escalating challenge of fake news requires a multilingual approach. While extensive research has explored fake news detection from various perspectives, a notable gap persists-the majority of studies concentrate on the English language. This highlights, the need for more research focusing on other languages, especially considering the scarcity of available non-English fake news datasets, particularly in low-resource settings. Focused on mBERT, XLM-RoBERTa, and LASER embeddings, this study addresses three key questions. Firstly, it evaluates the efficacy of several multilingual models across languages, highlighting the robust performance of mBERT and XLM-RoBERTa. Secondly, it examines the impact of multilingual and cross-lingual training data, demonstrating the effectiveness of multilingual training, including its potential in zero-shot and transfer learning scenarios. Thirdly, it compares multilingual models with translation-based strategies, revealing the superior performance of the former in multilingual fake news detection. Leveraging two datasets encompassing news in English, Spanish, French, Portuguese, Italian, Hindi, Indonesian, Swahili, and Vietnamese, our research underscores the effectiveness of multilingual approaches offering valuable insights for future research to combat the global problem of fake news more effectively.
引用
收藏
页码:73 / 89
页数:17
相关论文
共 50 条
  • [31] Albanian Fake News Detection
    Canhasi, Ercan
    Shijaku, Rexhep
    Berisha, Erblin
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [32] Detecting Fake News Spreaders on Twitter from a Multilingual Perspective
    Vogel, Inna
    Meghana, Meghana
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 599 - 606
  • [33] A Tool for Fake News Detection
    Al Asaad, Bashar
    Erascu, Madalina
    2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, : 379 - 386
  • [34] Fake news detection on Twitter
    Sharma, Srishti
    Saraswat, Mala
    Dubey, Anil Kumar
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2022, 18 (5/6) : 388 - 412
  • [35] Arabic fake news detection based on deep contextualized embedding models
    Ali Bou Nassif
    Ashraf Elnagar
    Omar Elgendy
    Yaman Afadar
    Neural Computing and Applications, 2022, 34 : 16019 - 16032
  • [36] Exploring Generalizability of Fine-Tuned Models for Fake News Detection
    Suprem, Abhijit
    Vaidya, Sanjyot
    Pu, Calton
    2022 IEEE 8TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING, CIC, 2022, : 82 - 88
  • [37] Adversarial Training for Fake News Classification
    Tariq, Abdullah
    Mehmood, Abid
    Elhadef, Mourad
    Khan, Muhammad Usman Ghani
    IEEE ACCESS, 2022, 10 : 82706 - 82715
  • [38] Fake news detection using deep learning models: A novel approach
    Kumar, Sachin
    Asthana, Rohan
    Upadhyay, Shashwat
    Upreti, Nidhi
    Akbar, Mohammad
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2020, 31 (02)
  • [39] Arabic fake news detection based on deep contextualized embedding models
    Nassif, Ali Bou
    Elnagar, Ashraf
    Elgendy, Omar
    Afadar, Yaman
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 16019 - 16032
  • [40] Feature analysis of fake news: improving fake news detection in social media
    Leung, Johnathan
    Vatsalan, Dinusha
    Arachchilage, Nalin
    Journal of Cyber Security Technology, 2023, 7 (04) : 224 - 241