A Question-Centric Model for Visual Question Answering in Medical Imaging

被引：40

作者：

Vu, Minh H. ^{[1
]}

Lofstedt, Tommy ^{[1
]}

Nyholm, Tufve ^{[1
]}

Sznitman, Raphael ^{[2
]}

机构：

[1] Umea Univ, Dept Radiat Sci, S-90187 Umea, Sweden

[2] Univ Bern, ARTORG Ctr, CH-3008 Bern, Switzerland

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2020年 / 39卷 / 09期

关键词：

Feature extraction; Visualization; Predictive models; Knowledge discovery; Task analysis; Medical diagnostic imaging; Visual question answering; deep learning; medical images; medical questions and answers;

D O I：

10.1109/TMI.2020.2978284

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep learning methods have proven extremely effective at performing a variety of medical image analysis tasks. With their potential use in clinical routine, their lack of transparency has however been one of their few weak points, raising concerns regarding their behavior and failure modes. While most research to infer model behavior has focused on indirect strategies that estimate prediction uncertainties and visualize model support in the input image space, the ability to explicitly query a prediction model regarding its image content offers a more direct way to determine the behavior of trained models. To this end, we present a novel Visual Question Answering approach that allows an image to be queried by means of a written question. Experiments on a variety of medical and natural image datasets show that by fusing image and question features in a novel way, the proposed approach achieves an equal or higher accuracy compared to current methods.

引用

页码：2856 / 2868

页数：13

共 50 条

[1] Semantic Technology and the Question-Centric Curriculum
Fost J.
Innovative Higher Education, 2013, 38 (1) : 31 - 44
[2] Consistency-Preserving Visual Question Answering in Medical Imaging
Tascon-Morales, Sergio
Marquez-Neila, Pablo
Sznitman, Raphael
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 386 - 395
[3] Medical visual question answering: A survey
Lin, Zhihong
Zhang, Donghao
Tao, Qingyi
Shi, Danli
Haffari, Gholamreza
Wu, Qi
He, Mingguang
Ge, Zongyuan
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
[4] A Transformer-based Medical Visual Question Answering Model
Liu, Lei
Su, Xiangdong
Guo, Hui
Zhu, Daobin
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1712 - 1718
[5] MMQL: Multi-Question Learning for Medical Visual Question Answering
Chen, Qishen
Bian, Minjie
Xu, Huahu
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 480 - 489
[6] MVQAS: A Medical Visual Question Answering System
Bai, Haoyue
Shan, Xiaoyan
Huang, Yefan
Wang, Xiaoli
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4675 - 4679
[7] Localized Questions in Medical Visual Question Answering
Tascon-Morales, Sergio
Marquez-Neila, Pablo
Sznitman, Raphael
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 361 - 370
[8] Question Modifiers in Visual Question Answering
Britton, William
Sarkhel, Somdeb
Venugopal, Deepak
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
[9] CGMVQA: A New Classification and Generative Model for Medical Visual Question Answering
Ren, Fuji
Zhou, Yangyang
IEEE ACCESS, 2020, 8 : 50626 - 50636
[10] Vision-Language Model for Visual Question Answering in Medical Imagery
Bazi, Yakoub
Al Rahhal, Mohamad Mahmoud
Bashmal, Laila
Zuair, Mansour
BIOENGINEERING-BASEL, 2023, 10 (03):

← 1 2 3 4 5 →