Radial Graph Convolutional Network for Visual Question Generation

被引:42
|
作者
Xu, Xing [1 ,2 ]
Wang, Tan [1 ,2 ]
Yang, Yang [1 ,2 ]
Hanjalic, Alan [3 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Multimedia, Chengdu 610051, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610051, Peoples R China
[3] Delft Univ Technol, Sch Informat & Software Engn, NL-2628 CD Delft, Netherlands
基金
中国国家自然科学基金;
关键词
Task analysis; Visualization; Training; Data models; Semantics; Convolution; Cross-media understanding; graph convolutional network (GCN); visual question generation (VQG);
D O I
10.1109/TNNLS.2020.2986029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we address the problem of visual question generation (VQG), a challenge in which a computer is required to generate meaningful questions about an image targeting a given answer. The existing approaches typically treat the VQG task as a reversed visual question answer (VQA) task, requiring the exhaustive match among all the image regions and the given answer. To reduce the complexity, we propose an innovative answer-centric approach termed radial graph convolutional network (Radial-GCN) to focus on the relevant image regions only. Our Radial-GCN method can quickly find the core answer area in an image by matching the latent answer with the semantic labels learned from all image regions. Then, a novel sparse graph of the radial structure is naturally built to capture the associations between the core node (i.e., answer area) and peripheral nodes (i.e., other areas); the graphic attention is subsequently adopted to steer the convolutional propagation toward potentially more relevant nodes for final question generation. Extensive experiments on three benchmark data sets show the superiority of our approach compared with the reference methods. Even in the unexplored challenging zero-shot VQA task, the synthesized questions by our method remarkably boost the performance of several state-of-the-art VQA methods from 0% to over 40%. The implementation code of our proposed method and the successfully generated questions are available at https://github.com/Wangt-CN/VQG-GCN.
引用
收藏
页码:1654 / 1667
页数:14
相关论文
共 50 条
  • [1] Scene Graph Refinement Network for Visual Question Answering
    Qian, Tianwen
    Chen, Jingjing
    Chen, Shaoxiang
    Wu, Bo
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3950 - 3961
  • [2] Bilinear Graph Networks for Visual Question Answering
    Guo, Dalu
    Xu, Chang
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1023 - 1034
  • [3] Learning Graph Convolutional Network for Blind Mesh Visual Quality Assessment
    Abouelaziz, Ilyass
    Chetouani, Aladine
    El Hassouni, Mohammed
    Cherifi, Hocine
    Latecki, Longin Jan
    IEEE ACCESS, 2021, 9 : 108200 - 108211
  • [4] A Multichannel Convolutional Decoding Network for Graph Classification
    Guang, Mingjian
    Yan, Chungang
    Xu, Yuhua
    Wang, Junli
    Jiang, Changjun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13206 - 13216
  • [5] Semantic-Interactive Graph Convolutional Network for Multilabel Image Recognition
    Chen, Bingzhi
    Zhang, Zheng
    Lu, Yao
    Chen, Fanglin
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 4887 - 4899
  • [6] Graph Convolutional Network Hashing
    Zhou, Xiang
    Shen, Fumin
    Liu, Li
    Liu, Wei
    Nie, Liqiang
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) : 1460 - 1472
  • [7] Graph Convolutional Network With Local and Global Feature Fusion for Hyperspectral Image Classification
    Wang, Yufan
    Yu, Xiaodong
    Dong, Hongbin
    Zang, Shuying
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Semisupervised Change Detection Using Graph Convolutional Network
    Saha, Sudipan
    Mou, Lichao
    Zhu, Xiao Xiang
    Bovolo, Francesca
    Bruzzone, Lorenzo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (04) : 607 - 611
  • [9] Modulation Recognition With Graph Convolutional Network
    Liu, Yabo
    Liu, Yi
    Yang, Cheng
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (05) : 624 - 627
  • [10] Hierarchical Multimodality Graph Reasoning for Remote Sensing Visual Question Answering
    Zhang, Han
    Wang, Keming
    Zhang, Laixian
    Wang, Bingshu
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62