Overcoming language priors in visual question answering with cumulative learning strategy

被引:0
|
作者
Mao, Aihua [1 ]
Chen, Feng [1 ]
Ma, Ziying [1 ]
Lin, Ken [1 ]
机构
[1] School of Computer Science and Engineering, South China University of Technology, Guangzhou,511400, China
关键词
Compendex;
D O I
10.1016/j.neucom.2024.128419
中图分类号
学科分类号
摘要
Contrastive Learning
引用
收藏
相关论文
共 50 条
  • [31] MMQL: Multi-Question Learning for Medical Visual Question Answering
    Chen, Qishen
    Bian, Minjie
    Xu, Huahu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 480 - 489
  • [32] Learning to Specialize with Knowledge Distillation for Visual Question Answering
    Mun, Jonghwan
    Lee, Kimin
    Shin, Jinwoo
    Han, Bohyung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [33] Bidirectional Contrastive Split Learning for Visual Question Answering
    Sun, Yuwei
    Ochiai, Hideya
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21602 - 21609
  • [34] Learning the Meanings of Function Words From Grounded Language Using a Visual Question Answering Model
    Portelance, Eva
    Frank, Michael C.
    Jurafsky, Dan
    COGNITIVE SCIENCE, 2024, 48 (05)
  • [35] Adversarial Learning with Bidirectional Attention for Visual Question Answering
    Li, Qifeng
    Tang, Xinyi
    Jian, Yi
    SENSORS, 2021, 21 (21)
  • [36] Multiple Context Learning Networks for Visual Question Answering
    Zhang, Pufen
    Lan, Hong
    Khan, Muhammad Asim
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [37] Visual Question Answering
    Nada, Ahmed
    Chen, Min
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 6 - 10
  • [38] Quantifying and Alleviating the Language Prior Problem in Visual Question Answering
    Guo, Yangyang
    Cheng, Zhiyong
    Nie, Liqiang
    Liu, Yibing
    Wang, Yinglong
    Kankanhalli, Mohan
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 75 - 84
  • [39] Learning Visual Question Answering by Bootstrapping Hard Attention
    Malinowski, Mateusz
    Doersch, Carl
    Santoro, Adam
    Battaglia, Peter
    COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 3 - 20
  • [40] Modal Feature Contribution Distribution Strategy in Visual Question Answering
    Dong F.
    Wang X.
    Oad A.
    Khoso M.N.
    Journal of Engineering Science and Technology Review, 2022, 15 (01) : 8 - 15