BDANN: BERT-Based Domain Adaptation Neural Network for Multi-Modal Fake News Detection

被引:73
作者
Zhang, Tong [1 ,2 ]
Wang, Di [2 ,3 ]
Chen, Huanhuan [3 ,4 ]
Zeng, Zhiwei [2 ]
Guo, Wei [5 ,6 ]
Miaoz, Chunyan [2 ,3 ,7 ]
Cui, Lizhen [5 ,6 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China
[2] Nanyang Technol Univ, Joint NTU UBC Res Ctr Excellence Act Living Elder, Singapore, Singapore
[3] Nanyang Technol Univ, Joint NTU WeBank Res Ctr Fintech, Singapore, Singapore
[4] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei, Anhui, Peoples R China
[5] Shandong Univ, Sch Software, Jinan, Shandong, Peoples R China
[6] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res C F, Jinan, Shandong, Peoples R China
[7] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
基金
新加坡国家研究基金会;
关键词
Fake news detection; Multimedia; Natural language processing; Data mining; Deep learning;
D O I
10.1109/ijcnn48605.2020.9206973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, with the rapid growth of microblogging networks for news propagation, there are increasingly more people accessing news through such emerging social media. In the meantime, fake news now spreads at a faster pace and affects a larger population than ever before. Compared with traditional text news, the news posted on microblog often has attached images in the context. So how to correctly and autonomously detect fakes news in a multi-modal manner becomes a prominent challenge to be addressed. In this paper, we propose an end-to-end model, named BERT-based domain adaptation neural network for multi-modal fake news detection (BDANN). BDANN comprises three main modules: a multi-modal feature extractor, a domain classifier and a fake news detector. Specifically, the multi-modal feature extractor employs the pretrained BERT model to extract text features and the pretrained VGG-19 model to extract image features. The extracted features are then concatenated and fed to the detector to distinguish fake news. The role of the domain classifier is mainly to map the multi-modal features of different events to the same feature space. To assess the performance of BDANN, we conduct extensive experiments on two multimedia datasets: Twitter and Weibo. The experimental results show that BDANN outperforms the state-of-the-art models. Moreover, we further discuss the existence of noisy images in the Weibo dataset that may affect the results.
引用
收藏
页数:8
相关论文
共 39 条
[1]  
[Anonymous], 2016, CBCNEWS NOV
[2]  
[Anonymous], 2019, TWITTER NUMBER MONTH
[3]  
[Anonymous], 2014, UNSUPERVISED DOMAIN
[4]  
[Anonymous], 2012, P 50 ANN M ASS COMPU
[5]  
[Anonymous], 2015, WORK NOT P MEDIAEVAL
[6]   VQA: Visual Question Answering [J].
Antol, Stanislaw ;
Agrawal, Aishwarya ;
Lu, Jiasen ;
Mitchell, Margaret ;
Batra, Dhruv ;
Zitnick, C. Lawrence ;
Parikh, Devi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2425-2433
[7]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[8]  
Castillo Carlos, 2011, P 20 INT C WORLD WID, P675, DOI 10.1145/1963405.1963500
[9]  
Chang J., 2016, ABC NEWS
[10]   Modeling and Reasoning about Software Systems Containing Uncertainty and Variability [J].
Chechik, Marsha ;
Famelis, M. ;
Rubin, J. ;
Salay, R. ;
Czarnecki, K. .
PROCEEDINGS 2015 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING, 2015, :1-1