Hindi fake news detection using transformer ensembles

被引:16
|
作者
Praseed, Amit [1 ]
Rodrigues, Jelwin [2 ]
Thilagam, P. Santhi [2 ]
机构
[1] Indian Inst Informat Technol Sri City, Dept Comp Sci & Engn, Chittoor, India
[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India
关键词
Fake news; Transformer; Hindi fake news; mBERT; ELECTRA; XLM-RoBERTa; Ensemble;
D O I
10.1016/j.engappai.2022.105731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few decades, due to the growth of social networking sites such as Whatsapp and Facebook, information distribution has been at a level never seen before. Knowing the integrity of information has been a long-standing problem, even more so for the regional languages. Regional languages, such as Hindi, raise challenging problems for fake news detection as they tend to be resource constrained. This limits the amount of data available to efficiently train models for these languages. Most of the existing techniques to detect fake news is targeted towards the English language or involves the manual translation of the language to the English language and then proceeding with Deep Learning methods. Pre-trained transformer based models such as BERT are fine-tuned for the task of fake news detection and are commonly employed for detecting fake news. Other pre-trained transformer models, such as ELECTRA and RoBERTa have also been shown to be able to detect fake news in multiple languages after suitable fine-tuning. In this work, we propose a method for detecting fake news in resource constrained languages such as Hindi more efficiently by using an ensemble of pre-trained transformer models, all of which are individually fine-tuned for the task of fake news detection. We demonstrate that the use of such a transformer ensemble consisting of XLM-RoBERTa, mBERT and ELECTRA is able to improve the efficiency of fake news detection in Hindi by overcoming the drawbacks of individual transformer models.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Fake News Detection using Multilingual Evidence
    Dementieva, Daryna
    Panchenko, Alexander
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 775 - 776
  • [22] Fake News Detection using Bayesian Inference
    Najar, Fatma
    Zamzami, Nuha
    Bouguila, Nizar
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 389 - 394
  • [23] Fake News Detection Using Enhanced BERT
    Aljawarneh, Shadi A.
    Swedat, Safa Ahmad
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04): : 4843 - 4850
  • [24] Fake news detection using ensemble techniques
    Pooja Malhotra
    S. K. Malik
    Multimedia Tools and Applications, 2024, 83 : 42037 - 42062
  • [25] Fake News Detection using Deep Learning
    Kong, Sheng How
    Tan, Li Mei
    Gan, Keng Hoon
    Samsudin, Nur Hana
    IEEE 10TH SYMPOSIUM ON COMPUTER APPLICATIONS AND INDUSTRIAL ELECTRONICS (ISCAIE 2020), 2020, : 102 - 107
  • [26] Fake news detection using ensemble techniques
    Malhotra, Pooja
    Malik, S. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42037 - 42062
  • [27] Fake news detection based on news content and social contexts: a transformer-based approach
    Raza, Shaina
    Ding, Chen
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 13 (04) : 335 - 362
  • [28] Fake news detection based on news content and social contexts: a transformer-based approach
    Shaina Raza
    Chen Ding
    International Journal of Data Science and Analytics, 2022, 13 : 335 - 362
  • [30] Multi-modal transformer using two-level visual features for fake news detection
    Wang, Bin
    Feng, Yong
    Xiong, Xian-cai
    Wang, Yong-heng
    Qiang, Bao-hua
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10429 - 10443