A Question-Centric Model for Visual Question Answering in Medical Imaging

被引:40
|
作者
Vu, Minh H. [1 ]
Lofstedt, Tommy [1 ]
Nyholm, Tufve [1 ]
Sznitman, Raphael [2 ]
机构
[1] Umea Univ, Dept Radiat Sci, S-90187 Umea, Sweden
[2] Univ Bern, ARTORG Ctr, CH-3008 Bern, Switzerland
关键词
Feature extraction; Visualization; Predictive models; Knowledge discovery; Task analysis; Medical diagnostic imaging; Visual question answering; deep learning; medical images; medical questions and answers;
D O I
10.1109/TMI.2020.2978284
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep learning methods have proven extremely effective at performing a variety of medical image analysis tasks. With their potential use in clinical routine, their lack of transparency has however been one of their few weak points, raising concerns regarding their behavior and failure modes. While most research to infer model behavior has focused on indirect strategies that estimate prediction uncertainties and visualize model support in the input image space, the ability to explicitly query a prediction model regarding its image content offers a more direct way to determine the behavior of trained models. To this end, we present a novel Visual Question Answering approach that allows an image to be queried by means of a written question. Experiments on a variety of medical and natural image datasets show that by fusing image and question features in a novel way, the proposed approach achieves an equal or higher accuracy compared to current methods.
引用
收藏
页码:2856 / 2868
页数:13
相关论文
共 50 条
  • [1] Semantic Technology and the Question-Centric Curriculum
    Fost J.
    Innovative Higher Education, 2013, 38 (1) : 31 - 44
  • [2] Consistency-Preserving Visual Question Answering in Medical Imaging
    Tascon-Morales, Sergio
    Marquez-Neila, Pablo
    Sznitman, Raphael
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 386 - 395
  • [3] Medical visual question answering: A survey
    Lin, Zhihong
    Zhang, Donghao
    Tao, Qingyi
    Shi, Danli
    Haffari, Gholamreza
    Wu, Qi
    He, Mingguang
    Ge, Zongyuan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
  • [4] A Transformer-based Medical Visual Question Answering Model
    Liu, Lei
    Su, Xiangdong
    Guo, Hui
    Zhu, Daobin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1712 - 1718
  • [5] MMQL: Multi-Question Learning for Medical Visual Question Answering
    Chen, Qishen
    Bian, Minjie
    Xu, Huahu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 480 - 489
  • [6] MVQAS: A Medical Visual Question Answering System
    Bai, Haoyue
    Shan, Xiaoyan
    Huang, Yefan
    Wang, Xiaoli
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4675 - 4679
  • [7] Localized Questions in Medical Visual Question Answering
    Tascon-Morales, Sergio
    Marquez-Neila, Pablo
    Sznitman, Raphael
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 361 - 370
  • [8] Question Modifiers in Visual Question Answering
    Britton, William
    Sarkhel, Somdeb
    Venugopal, Deepak
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
  • [9] CGMVQA: A New Classification and Generative Model for Medical Visual Question Answering
    Ren, Fuji
    Zhou, Yangyang
    IEEE ACCESS, 2020, 8 : 50626 - 50636
  • [10] Vision-Language Model for Visual Question Answering in Medical Imagery
    Bazi, Yakoub
    Al Rahhal, Mohamad Mahmoud
    Bashmal, Laila
    Zuair, Mansour
    BIOENGINEERING-BASEL, 2023, 10 (03):