Medical knowledge-based network for Patient-oriented Visual Question Answering

被引:19
|
作者
Jian, Huang [1 ]
Chen, Yihao [2 ,3 ]
Yong, Li [4 ]
Yang, Zhenguo [1 ]
Gong, Xuehao [2 ,3 ]
Lee, Wang Fu [5 ]
Xu, Xiaohong [2 ,6 ]
Liu, Wenyin [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[2] Guangdong Med Univ, Clin Med Coll 1, Zhanjiang, Peoples R China
[3] Shenzhen Univ, Shenzhen Peoples Hosp 2, Affiliated Hosp 1, Dept Ultrasound, Shenzhen, Peoples R China
[4] South China Normal Univ, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[5] Hong Kong Metropolitan Univ, Sch Sci & Technol, Hong Kong, Peoples R China
[6] Guangdong Med Univ, Affiliated Hosp, Dept Ultrasound, Zhanjiang, Peoples R China
关键词
Medical visual question answering; Medical knowledge graph; P-VQA;
D O I
10.1016/j.ipm.2022.103241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual Question Answering (VQA) systems have achieved great success in general scenarios. In medical domain, VQA systems are still in their infancy as the datasets are limited by scale and application scenarios. Current medical VQA datasets are designed to conduct basic analyses of medical imaging such as modalities, planes, organ systems, abnormalities, etc., aiming to provide constructive medical suggestions for doctors, containing a large number of professional terms with limited help for patients. In this paper, we introduce a new Patient-oriented Visual Question Answering (P-VQA) dataset, which builds a VQA system for patients by covering an entire treatment process including medical consultation, imaging diagnosis, clinical diagnosis, treatment advice, review, etc. P-VQA covers 20 common diseases with 2,169 medical images, 24,800 question-answering pairs, and a medical knowledge graph containing 419 entities. In terms of methodology, we propose a Medical Knowledge-based VQA Network (MKBN) to answer questions according to the images and a medical knowledge graph in our P-VQA. MKBN learns two cluster embeddings (disease-related and relation-related embeddings) according to structural characteristics of the medical knowledge graph and learns three different interactive features (image-question, image-disease, and question-relation) according to characteristics of diagnosis. For comparisons, we evaluate several state-of-the-art baselines on the P-VQA dataset as benchmarks. Experimental results on P-VQA demonstrate that MKBN achieves the state-of-the-art performance compared with baseline methods. The dataset is available at https: //github.com/cs-jerhuang/P-VQA.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Rich Visual Knowledge-Based Augmentation Network for Visual Question Answering
    Zhang, Liyang
    Liu, Shuaicheng
    Liu, Donghao
    Zeng, Pengpeng
    Li, Xiangpeng
    Song, Jingkuan
    Gao, Lianli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4362 - 4373
  • [2] Explicit Knowledge-based Reasoning for Visual Question Answering
    Wang, Peng
    Wu, Qi
    Shen, Chunhua
    Dick, Anthony
    van den Hengel, Anton
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1290 - 1296
  • [3] A Simple Baseline for Knowledge-Based Visual Question Answering
    Xenos, Alexandros
    Stafylakis, Themos
    Patras, Ioannis
    Tzimiropoulos, Georgios
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14871 - 14877
  • [4] Knowledge-based question answering
    Rinaldi, F
    Dowdall, J
    Hess, M
    Mollá, D
    Schwitter, R
    Kaljurand, K
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 785 - 792
  • [5] Knowledge-based question answering
    Hermjakob, U
    Hovy, EH
    Lin, CY
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: COMPUTER SCIENCE III, 2002, : 66 - 71
  • [6] Knowledge enhancement and scene understanding for knowledge-based visual question answering
    Zhenqiang Su
    Gang Gou
    Knowledge and Information Systems, 2024, 66 : 2193 - 2208
  • [7] Knowledge enhancement and scene understanding for knowledge-based visual question answering
    Su, Zhenqiang
    Gou, Gang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (03) : 2193 - 2208
  • [8] Explainable Knowledge-Based Learning for Online Medical Question Answering
    Cui, Menglin
    Li, Xiang
    Qin, Peng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT V, KSEM 2024, 2024, 14888 : 294 - 304
  • [9] REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
    Lin, Yuanze
    Xie, Yujia
    Chen, Dongdong
    Xu, Yichong
    Zhu, Chenguang
    Yuan, Lu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] VIG: Visual Information-Guided Knowledge-Based Visual Question Answering
    Liu, Heng
    Wang, Boyue
    Sun, Yanfeng
    Li, Xiaoyan
    Hu, Yongli
    Yin, Baocai
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1086 - 1091