Human Cognition-Based Consistency Inference Networks for Multi-Modal Fake News Detection

被引:45
作者
Wu, Lianwei [1 ,2 ,3 ]
Liu, Pusheng [1 ]
Zhao, Yongqiang [4 ]
Wang, Peng [1 ]
Zhang, Yangning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big D, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
[3] Northwestern Polytech Univ, Chongqing Sci & Technol Innovat Ctr, Xian 710129, Peoples R China
[4] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Fake news detection; social media analysis; multi-modal fusion; network content security;
D O I
10.1109/TKDE.2023.3280555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing models for multi-modal fake news detection focus mainly on capturing common similar semantics between different modalities to improve detection performance. However, they ignore the extraction of inconsistent features between these modalities. The intuitive cognition way people identify a piece of fake news is generally to discover if there are inconsistent semantics among news content itself and its comments, which could be abstracted as "comparing news image-text consistency - finding valuable comments - reasoning in-/consistency between news and comments". Inspired by the cognitive process, we propose Human Cognition-based Consistency Inference Networks (HCCIN) to comprehensively explore consistent and inconsistent semantics for multi-modal fake news detection. Specifically, we first design cross-modal alignment layer to learn consistent semantics between textual and visual information within the multi-modal news, and then the comment clue discovery layer is devoted to ascertaining the most-concerned semantics by audiences between comments. Finally, we develop collaborative inference layer to drive news consistent semantics and the most-concerned semantics to reason and discover consistent and inconsistent information between them. Experiments on three public datasets, including Weibo, Twitter, and PHEME, reveal the superiority of our HCCIN.
引用
收藏
页码:211 / 225
页数:15
相关论文
共 68 条
[1]  
Alam F., 2021, arXiv
[2]  
Barry A.M., 1997, VISUAL INTELLIGENCE
[3]  
Boididou Christina., 2015, MediaEval, V3, P7
[4]  
Cai YT, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2506
[5]  
Castillo C., 2011, Proceedings of the 20th international conference on World wide web, WWW'11, (New York, NY, USA), P675, DOI [10.1145/1963405.1963500, DOI 10.1145/1963405.1963500]
[6]   COMMUNICATION MODALITY AS A DETERMINANT OF MESSAGE PERSUASIVENESS AND MESSAGE COMPREHENSIBILITY [J].
CHAIKEN, S ;
EAGLY, AH .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1976, 34 (04) :605-614
[7]  
Chen JJ, 2022, AAAI CONF ARTIF INTE, P10482
[8]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]  
Fleming N, 2020, NATURE, V583, P155, DOI 10.1038/d41586-020-01834-3