Radial Graph Convolutional Network for Visual Question Generation

被引：42

作者：

Xu, Xing ^{[1
,2
]}

Wang, Tan ^{[1
,2
]}

Yang, Yang ^{[1
,2
]}

Hanjalic, Alan ^{[3
]}

Shen, Heng Tao ^{[1
,2
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Future Multimedia, Chengdu 610051, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610051, Peoples R China

[3] Delft Univ Technol, Sch Informat & Software Engn, NL-2628 CD Delft, Netherlands

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2021年 / 32卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Visualization; Training; Data models; Semantics; Convolution; Cross-media understanding; graph convolutional network (GCN); visual question generation (VQG);

D O I：

10.1109/TNNLS.2020.2986029

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this article, we address the problem of visual question generation (VQG), a challenge in which a computer is required to generate meaningful questions about an image targeting a given answer. The existing approaches typically treat the VQG task as a reversed visual question answer (VQA) task, requiring the exhaustive match among all the image regions and the given answer. To reduce the complexity, we propose an innovative answer-centric approach termed radial graph convolutional network (Radial-GCN) to focus on the relevant image regions only. Our Radial-GCN method can quickly find the core answer area in an image by matching the latent answer with the semantic labels learned from all image regions. Then, a novel sparse graph of the radial structure is naturally built to capture the associations between the core node (i.e., answer area) and peripheral nodes (i.e., other areas); the graphic attention is subsequently adopted to steer the convolutional propagation toward potentially more relevant nodes for final question generation. Extensive experiments on three benchmark data sets show the superiority of our approach compared with the reference methods. Even in the unexplored challenging zero-shot VQA task, the synthesized questions by our method remarkably boost the performance of several state-of-the-art VQA methods from 0% to over 40%. The implementation code of our proposed method and the successfully generated questions are available at https://github.com/Wangt-CN/VQG-GCN.

引用

页码：1654 / 1667

页数：14

共 50 条

[1] Scene Graph Refinement Network for Visual Question Answering
Qian, Tianwen
Chen, Jingjing
Chen, Shaoxiang
Wu, Bo
Jiang, Yu-Gang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3950 - 3961
[2] Bilinear Graph Networks for Visual Question Answering
Guo, Dalu
Xu, Chang
Tao, Dacheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1023 - 1034
[3] Learning Graph Convolutional Network for Blind Mesh Visual Quality Assessment
Abouelaziz, Ilyass
Chetouani, Aladine
El Hassouni, Mohammed
Cherifi, Hocine
Latecki, Longin Jan
IEEE ACCESS, 2021, 9 : 108200 - 108211
[4] A Multichannel Convolutional Decoding Network for Graph Classification
Guang, Mingjian
Yan, Chungang
Xu, Yuhua
Wang, Junli
Jiang, Changjun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13206 - 13216
[5] Semantic-Interactive Graph Convolutional Network for Multilabel Image Recognition
Chen, Bingzhi
Zhang, Zheng
Lu, Yao
Chen, Fanglin
Lu, Guangming
Zhang, David
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 4887 - 4899
[6] Graph Convolutional Network Hashing
Zhou, Xiang
Shen, Fumin
Liu, Li
Liu, Wei
Nie, Liqiang
Yang, Yang
Shen, Heng Tao
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) : 1460 - 1472
[7] Graph Convolutional Network With Local and Global Feature Fusion for Hyperspectral Image Classification
Wang, Yufan
Yu, Xiaodong
Dong, Hongbin
Zang, Shuying
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[8] Semisupervised Change Detection Using Graph Convolutional Network
Saha, Sudipan
Mou, Lichao
Zhu, Xiao Xiang
Bovolo, Francesca
Bruzzone, Lorenzo
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (04) : 607 - 611
[9] Modulation Recognition With Graph Convolutional Network
Liu, Yabo
Liu, Yi
Yang, Cheng
IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (05) : 624 - 627
[10] Hierarchical Multimodality Graph Reasoning for Remote Sensing Visual Question Answering
Zhang, Han
Wang, Keming
Zhang, Laixian
Wang, Bingshu
Li, Xuelong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62

← 1 2 3 4 5 →