Hindi fake news detection using transformer ensembles

被引：16

作者：

Praseed, Amit ^{[1
]}

Rodrigues, Jelwin ^{[2
]}

Thilagam, P. Santhi ^{[2
]}

机构：

[1] Indian Inst Informat Technol Sri City, Dept Comp Sci & Engn, Chittoor, India

[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 119卷

关键词：

Fake news; Transformer; Hindi fake news; mBERT; ELECTRA; XLM-RoBERTa; Ensemble;

D O I：

10.1016/j.engappai.2022.105731

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the past few decades, due to the growth of social networking sites such as Whatsapp and Facebook, information distribution has been at a level never seen before. Knowing the integrity of information has been a long-standing problem, even more so for the regional languages. Regional languages, such as Hindi, raise challenging problems for fake news detection as they tend to be resource constrained. This limits the amount of data available to efficiently train models for these languages. Most of the existing techniques to detect fake news is targeted towards the English language or involves the manual translation of the language to the English language and then proceeding with Deep Learning methods. Pre-trained transformer based models such as BERT are fine-tuned for the task of fake news detection and are commonly employed for detecting fake news. Other pre-trained transformer models, such as ELECTRA and RoBERTa have also been shown to be able to detect fake news in multiple languages after suitable fine-tuning. In this work, we propose a method for detecting fake news in resource constrained languages such as Hindi more efficiently by using an ensemble of pre-trained transformer models, all of which are individually fine-tuned for the task of fake news detection. We demonstrate that the use of such a transformer ensemble consisting of XLM-RoBERTa, mBERT and ELECTRA is able to improve the efficiency of fake news detection in Hindi by overcoming the drawbacks of individual transformer models.

引用

页数：11

共 50 条

[31] Multi-modal transformer using two-level visual features for fake news detection
Bin Wang
Yong Feng
Xian-cai Xiong
Yong-heng Wang
Bao-hua Qiang
Applied Intelligence, 2023, 53 : 10429 - 10443
[32] EMET: EMBEDDINGS FROM MULTILINGUAL-ENCODER TRANSFORMER FOR FAKE NEWS DETECTION
Schwarz, Stephane
Theophilo, Antonto
Rocha, Anderson
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2777 - 2781
[33] SCATE: Shared Cross Attention Transformer Encoders for Multimodal Fake News Detection
Sachan, Tanmay
Pinnaparaju, Nikhil
Gupta, Manish
Varma, Vasudeva
PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2021, 2021, : 399 - 406
[34] TRANSFAKE: Multi-task Transformer for Multimodal Enhanced Fake News Detection
Jing, Quanliang
Yao, Di
Fan, Xinxin
Wang, Baoli
Tan, Haining
Bu, Xiangpeng
Bi, Jingping
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[35] Automatic Fake News Detection in Political Platforms - A Transformer-based Approach
Raza, Shaina
CASE 2021: THE 4TH WORKSHOP ON CHALLENGES AND APPLICATIONS OF AUTOMATED EXTRACTION OF SOCIO-POLITICAL EVENTS FROM TEXT (CASE), 2021, : 68 - 78
[36] Fake News Classification using transformer based enhanced LSTM and BERT
Rai N.
Kumar D.
Kaushik N.
Raj C.
Ali A.
International Journal of Cognitive Computing in Engineering, 2022, 3 : 98 - 105
[37] Knowledge augmented transformer for adversarial multidomain multiclassification multimodal fake news detection
Song, Chenguang
Ning, Nianwen
Zhang, Yunlei
Wu, Bin
NEUROCOMPUTING, 2021, 462 : 88 - 100
[38] AI and Fake News: A Conceptual Framework for Fake News Detection
Ameli, Leila
Chowdhury, Md Shah Alam
Farid, Farnaz
Bello, Abubakar
Sabrina, Fariza
Maurushat, Alana
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON CYBER SECURITY, CSW 2022, 2022, : 34 - 39
[39] Fake News Detection on Fake.Br Using Hierarchical Attention Networks
Okano, Emerson Yoshiaki
Liu, Zebin
Ji, Donghong
Ruiz, Evandro Eduardo Seron
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 143 - 152
[40] Fake News Detection on Twitter Using Propagation Structures
Meyers, Marion
Weiss, Gerhard
Spanakis, Gerasimos
DISINFORMATION IN OPEN ONLINE MEDIA, MISDOOM 2020, 2020, 12259 : 138 - 158

← 1 2 3 4 5 →