Hierarchical Semantic Enhancement Network for Multimodal Fake News Detection

被引：7

作者：

Zhang, Qiang ^{[1
]}

Liu, Jiawei ^{[1
]}

Zhang, Fanrui ^{[1
]}

Xie, Jingyi ^{[1
]}

Zha, Zheng-Jun ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Fake news detection; Semantic information; Multimodal; Entity; ATTENTION;

D O I：

10.1145/3581783.3612423

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The explosion of multimodal fake news content on social media has sparked widespread concern. Existing multimodal fake news detection methods have made significant contributions to the development of this field, but fail to adequately exploit the potential semantic information of images and ignore the noise embedded in news entities, which severely limits the performance of the models. In this paper, we propose a novel Hierarchical Semantic Enhancement Network (HSEN) for multimodal fake news detection by learning text-related image semantic and precise news high-order knowledge semantic information. Specifically, to complement the image semantic information, HSEN utilizes textual entities as the prompt subject vocabulary and applies reinforcement learning to discover the optimal prompt format for generating image captions specific to the corresponding textual entities, which contain multi-level cross-modal correlation information. Moreover, HSEN extracts visual and textual entities from image and text, and identifies additional visual entities from image captions to extend image semantic knowledge. Based on that, HSEN exploits an adaptive hard attention mechanism to automatically select strongly related news entities and remove irrelevant noise entities to obtain precise high-order knowledge semantic information, while generating attention mask for guiding cross-modal knowledge interaction. Extensive experiments show that our method outperforms state-of-the-art methods.

引用

页码：3424 / 3433

页数：10

共 54 条

[21]

Khoo LMS, 2020, AAAI CONF ARTIF INTE, V34, P8783

[22] Edge-aware Regional Message Passing Controller for Image Forgery Localization [J].

Li, Dong ;

Zhu, Jiaying ;

Wang, Menglu ;

Liu, Jiawei ;

Fu, Xueyang ;

Zha, Zheng-Jun .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :8222-8232

[23] Entity-Oriented Multi-Modal Alignment and Fusion Network for Fake News Detection [J].

Li, Peiguang ;

Sun, Xian ;

Yu, Hongfeng ;

Tian, Yu ;

Yao, Fanglong ;

Xu, Guangluan .

IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 :3455-3468

[24]

Liu Y, 2018, AAAI CONF ARTIF INTE, P354

[25]

Mishra Rahul, 2020, P IEEE CVF C COMP VI, P652

[26] DeSI: Deepfake Source Identifier for Social Media [J].

Narayan, Kartik ;

Agarwal, Harsh ;

Mittal, Surbhi ;

Thakral, Kartik ;

Kundu, Suman ;

Vatsa, Mayank ;

Singh, Richa .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :2857-2866

[27] Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues [J].

Qi, Peng ;

Cao, Juan ;

Li, Xirong ;

Liu, Huan ;

Sheng, Qiang ;

Mi, Xiaoyue ;

He, Qin ;

Lv, Yongbiao ;

Guo, Chenyang ;

Yu, Yingchao .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1212-1220

[28]

Qian F, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3834

[29]

Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, 10.48550/arXiv.1804.02767]

[30] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

← 1 2 3 4 5 6 →