FakeBERT: Fake news detection in social media with a BERT-based deep learning approach

被引:354
作者
Kaliyar, Rohit Kumar [1 ]
Goswami, Anurag [1 ]
Narang, Pratik [2 ]
机构
[1] Bennett Univ, Dept Comp Sci Engn, Greater Noida, India
[2] BITS Pilani, Dept CSIS, Pilani, Rajasthan, India
关键词
Fake news; Neural network; Social media; Deep learning; BERT; CONVOLUTIONAL NEURAL-NETWORK; REPRESENTATIONS; CLASSIFICATION;
D O I
10.1007/s11042-020-10183-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the modern era of computing, the news ecosystem has transformed from old traditional print media to social media outlets. Social media platforms allow us to consume news much faster, with less restricted editing results in the spread of fake news at an incredible pace and scale. In recent researches, many useful methods for fake news detection employ sequential neural networks to encode news content and social context-level information where the text sequence was analyzed in a unidirectional way. Therefore, a bidirectional training approach is a priority for modelling the relevant information of fake news that is capable of improving the classification performance with the ability to capture semantic and long-distance dependencies in sentences. In this paper, we propose a BERT-based (Bidirectional Encoder Representations from Transformers) deep learning approach (FakeBERT) by combining different parallel blocks of the single-layer deep Convolutional Neural Network (CNN) having different kernel sizes and filters with the BERT. Such a combination is useful to handle ambiguity, which is the greatest challenge to natural language understanding. Classification results demonstrate that our proposed model (FakeBERT) outperforms the existing models with an accuracy of 98.90%.
引用
收藏
页码:11765 / 11788
页数:24
相关论文
共 54 条
[1]   Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
INTELLIGENT, SECURE, AND DEPENDABLE SYSTEMS IN DISTRIBUTED AND CLOUD ENVIRONMENTS (ISDDC 2017), 2017, 10618 :127-138
[2]   Social Media and Fake News in the 2016 Election [J].
Allcott, Hunt ;
Gentzkow, Matthew .
JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235
[3]  
[Anonymous], 2018, P 27 INT C COMP LING
[4]  
Asparouhov T., 2010, Bayesian analysis of latent variable models using Mplus (version 4)
[5]   Detection of Spammers in Twitter marketing: A Hybrid Approach Using Social Media Analytics and Bio Inspired Computing [J].
Aswani, Reema ;
Kar, Arpan Kumar ;
Ilavarasan, P. Vigneswara .
INFORMATION SYSTEMS FRONTIERS, 2018, 20 (03) :515-530
[6]  
Bhattacharyya P, 2018, ARXIV ARXIV 1811
[7]  
Boix X, 2018, LANGUAGE FAKE NEWS O
[8]   A survey on fake news and rumour detection techniques [J].
Bondielli, Alessandro ;
Marcelloni, Francesco .
INFORMATION SCIENCES, 2019, 497 :38-55
[9]  
Castillo C., 2011, P 20 INT C WORLD WID, P675, DOI 10.1145/1963405.1963500
[10]   On the effects of using word2vec representations in neural networks for dialogue act recognition [J].
Cerisara, Christophe ;
Kral, Pavel ;
Lenc, Ladislav .
COMPUTER SPEECH AND LANGUAGE, 2018, 47 :175-193