Dual-Branch Collaborative Learning for Visual Question Answering

被引:0
|
作者
Tian, Weidong [1 ,2 ]
Zhao, Junxiang [1 ]
Xu, Wenzheng [1 ]
Zhao, Zhongqiu [1 ,2 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Peoples R China
[2] HFUT, Intelligent Mfg Inst, Hefei, Peoples R China
[3] Guangxi Acad Sci, Nanning, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
VQA; Relational Reasoning; Attention; Collaborative learning;
D O I
10.1007/978-981-97-5588-2_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Good visual question answering models can reason about the underlying relationships in the context of images and questions. Recently, some works have used graph-based methods for visual reasoning, but graph-based methods cannot perform better reasoning when the connection between the question statement and the visual object is unclear. In this paper, we design a dual-branch network based on collaborative learning that can simultaneously focus on relational reasoning and attention-based deep alignment between images and questions. The question-aware enhancement module we designed can better utilize question information, and the joint prediction module we designed can fully integrate the performance of the two branches. Extensive experimental results demonstrate that our proposed method outperforms the current state-of-the-art methods in terms of performance.
引用
收藏
页码:96 / 107
页数:12
相关论文
共 50 条
  • [1] Text-Guided Dual-Branch Attention Network for Visual Question Answering
    Li, Mengfei
    Gu, Li
    Ji, Yi
    Liu, Chunping
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 750 - 760
  • [2] Dual-branch collaborative Siamese network for visual tracking
    Zhou, Wenjun
    Liu, Yao
    Wang, Nan
    Wang, Yifan
    Peng, Bo
    DIGITAL SIGNAL PROCESSING, 2024, 155
  • [3] Dual-branch collaborative learning network for crop disease identification
    Zhang, Weidong
    Sun, Xuewei
    Zhou, Ling
    Xie, Xiwang
    Zhao, Wenyi
    Liang, Zheng
    Zhuang, Peixian
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [4] CoSiNet: Dual-Branch Collaborative Siamese Network for Visual Object Tracking
    Zhou, Wenjun
    Liu, Yao
    Wang, Nan
    Wang, Yifan
    Peng, Bo
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1675 - 1680
  • [5] Dual-branch collaborative transformer for effective
    Qi, Xuanyu
    Song, Tianyu
    Dong, Haobo
    Jin, Jiyu
    Jin, Guiyue
    Li, Pengpeng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [6] Visual Question Generation as Dual Task of Visual Question Answering
    Li, Yikang
    Duan, Nan
    Zhou, Bolei
    Chu, Xiao
    Ouyang, Wanli
    Wang, Xiaogang
    Zhou, Ming
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6116 - 6124
  • [7] Dual-Branch Collaborative Transformer for Virtual Try-On
    Fenocchi, Emanuele
    Morelli, Davide
    Cornia, Marcella
    Baraldi, Lorenzo
    Cesari, Fabio
    Cucchiara, Rita
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2246 - 2250
  • [8] Learning a dual-branch classifier for class incremental learning
    Guo, Lei
    Xie, Gang
    Qu, Youyang
    Yan, Gaowei
    Cui, Lei
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4316 - 4326
  • [9] Dual-feature collaborative relation-attention networks for visual question answering
    Lu Yao
    You Yang
    Juntao Hu
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [10] Dual-feature collaborative relation-attention networks for visual question answering
    Yao, Lu
    Yang, You
    Hu, Juntao
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)