News Image-Text Matching With News Knowledge Graph

被引:6
作者
Zhao Yumeng [1 ,2 ]
Yun Jing [1 ]
Gao Shuo [1 ,2 ]
Liu Limin [1 ,2 ]
机构
[1] Inner Mongolia Univ Technol, Coll Data Sci & Applicat, Hohhot 010080, Peoples R China
[2] Inner Mongolia Autonomous Reg Engn & Technol Res, Hohhot 010080, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Licenses; Buildings; Visualization; Knowledge engineering; Image edge detection; Image-text matching; named entity; indirect relations of named entities;
D O I
10.1109/ACCESS.2021.3093650
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-text matching using the image caption method has made a great progress. However, there are many named entities in news text, and existing approaches are unable to directly generate named entities in the news image caption. It leads to a semantic gap between text and news image caption. Moreover, the existing methods lack the analysis of indirect relations between named entities. Therefore those approaches easily leads to relations error when generating news image caption. To generate the news image caption with named entities by analyzing the indirect relations between named entities. We propose a novel model. In details, we propose the TopNews dataset with related news, which aims to construct the relations between named entities as widely as possible. Then we develop the news knowledge graph by extracting named entities from TopNews dataset. Furthermore, we propose News Knowledge Driven Graph Neural Network (NKD-GNN). We utilize NKD-GNN to analyzing the whole relations of entities in news knowledge graph. In this way, we generate the news image caption with named entities. The results of extensive experiments based on TopNews dataset and common dataset demonstrate that our approach is effective in detecting the consistency of news images and text.
引用
收藏
页码:108017 / 108027
页数:11
相关论文
共 46 条
[1]  
Alberti C, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P2131
[2]   Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J].
Anderson, Peter ;
He, Xiaodong ;
Buehler, Chris ;
Teney, Damien ;
Johnson, Mark ;
Gould, Stephen ;
Zhang, Lei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6077-6086
[3]   DBpedia: A nucleus for a web of open data [J].
Auer, Soeren ;
Bizer, Christian ;
Kobilarov, Georgi ;
Lehmann, Jens ;
Cyganiak, Richard ;
Ives, Zachary .
SEMANTIC WEB, PROCEEDINGS, 2007, 4825 :722-+
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]   Good News, Everyone! Context Driven Entity-Aware Captioning for News Images [J].
Furkan Biten, Ali ;
Gomez, Lluis ;
Rusinol, Marcal ;
Karatzas, Dimosthenis .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12458-12467
[6]   Pricing QoE With Reinforcement Learning For Intelligent Wireless Multimedia Communications [J].
He, Shuan ;
Wang, Wei .
ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[7]  
He X., 2018, P EUR C COMP VIS ECC, P201
[8]   Language-Conditioned Graph Networks for Relational Reasoning [J].
Hu, Ronghang ;
Rohrbach, Anna ;
Darrell, Trevor ;
Saenko, Kate .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :10293-10302
[9]   Temporal Localization and Spatial Segmentation of Joint Attention in Multiple First-Person Videos [J].
Huang, Yifei ;
Cai, Minjie ;
Kera, Hiroshi ;
Yonetani, Ryo ;
Higuchi, Keita ;
Sato, Yoichi .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2313-2321
[10]   Experimental Study of Telco Localization Methods [J].
Huang, Yukun ;
Rao, Weixiong ;
Zhu, Fangzhou ;
Liu, Ning ;
Yuan, Mingxuan ;
Zeng, Jia ;
Yang, Hua .
2017 18TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (IEEE MDM 2017), 2017, :299-306