Hindi fake news detection using transformer ensembles

被引:16
|
作者
Praseed, Amit [1 ]
Rodrigues, Jelwin [2 ]
Thilagam, P. Santhi [2 ]
机构
[1] Indian Inst Informat Technol Sri City, Dept Comp Sci & Engn, Chittoor, India
[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India
关键词
Fake news; Transformer; Hindi fake news; mBERT; ELECTRA; XLM-RoBERTa; Ensemble;
D O I
10.1016/j.engappai.2022.105731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few decades, due to the growth of social networking sites such as Whatsapp and Facebook, information distribution has been at a level never seen before. Knowing the integrity of information has been a long-standing problem, even more so for the regional languages. Regional languages, such as Hindi, raise challenging problems for fake news detection as they tend to be resource constrained. This limits the amount of data available to efficiently train models for these languages. Most of the existing techniques to detect fake news is targeted towards the English language or involves the manual translation of the language to the English language and then proceeding with Deep Learning methods. Pre-trained transformer based models such as BERT are fine-tuned for the task of fake news detection and are commonly employed for detecting fake news. Other pre-trained transformer models, such as ELECTRA and RoBERTa have also been shown to be able to detect fake news in multiple languages after suitable fine-tuning. In this work, we propose a method for detecting fake news in resource constrained languages such as Hindi more efficiently by using an ensemble of pre-trained transformer models, all of which are individually fine-tuned for the task of fake news detection. We demonstrate that the use of such a transformer ensemble consisting of XLM-RoBERTa, mBERT and ELECTRA is able to improve the efficiency of fake news detection in Hindi by overcoming the drawbacks of individual transformer models.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Multi-modal transformer using two-level visual features for fake news detection
    Bin Wang
    Yong Feng
    Xian-cai Xiong
    Yong-heng Wang
    Bao-hua Qiang
    Applied Intelligence, 2023, 53 : 10429 - 10443
  • [32] EMET: EMBEDDINGS FROM MULTILINGUAL-ENCODER TRANSFORMER FOR FAKE NEWS DETECTION
    Schwarz, Stephane
    Theophilo, Antonto
    Rocha, Anderson
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2777 - 2781
  • [33] SCATE: Shared Cross Attention Transformer Encoders for Multimodal Fake News Detection
    Sachan, Tanmay
    Pinnaparaju, Nikhil
    Gupta, Manish
    Varma, Vasudeva
    PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2021, 2021, : 399 - 406
  • [34] TRANSFAKE: Multi-task Transformer for Multimodal Enhanced Fake News Detection
    Jing, Quanliang
    Yao, Di
    Fan, Xinxin
    Wang, Baoli
    Tan, Haining
    Bu, Xiangpeng
    Bi, Jingping
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [35] Automatic Fake News Detection in Political Platforms - A Transformer-based Approach
    Raza, Shaina
    CASE 2021: THE 4TH WORKSHOP ON CHALLENGES AND APPLICATIONS OF AUTOMATED EXTRACTION OF SOCIO-POLITICAL EVENTS FROM TEXT (CASE), 2021, : 68 - 78
  • [36] Fake News Classification using transformer based enhanced LSTM and BERT
    Rai N.
    Kumar D.
    Kaushik N.
    Raj C.
    Ali A.
    International Journal of Cognitive Computing in Engineering, 2022, 3 : 98 - 105
  • [37] Knowledge augmented transformer for adversarial multidomain multiclassification multimodal fake news detection
    Song, Chenguang
    Ning, Nianwen
    Zhang, Yunlei
    Wu, Bin
    NEUROCOMPUTING, 2021, 462 : 88 - 100
  • [38] AI and Fake News: A Conceptual Framework for Fake News Detection
    Ameli, Leila
    Chowdhury, Md Shah Alam
    Farid, Farnaz
    Bello, Abubakar
    Sabrina, Fariza
    Maurushat, Alana
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON CYBER SECURITY, CSW 2022, 2022, : 34 - 39
  • [39] Fake News Detection on Fake.Br Using Hierarchical Attention Networks
    Okano, Emerson Yoshiaki
    Liu, Zebin
    Ji, Donghong
    Ruiz, Evandro Eduardo Seron
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 143 - 152
  • [40] Fake News Detection on Twitter Using Propagation Structures
    Meyers, Marion
    Weiss, Gerhard
    Spanakis, Gerasimos
    DISINFORMATION IN OPEN ONLINE MEDIA, MISDOOM 2020, 2020, 12259 : 138 - 158