Vision-knowledge fusion model for multi-domain medical report generation

被引:12
|
作者
Xu, Dexuan [1 ,2 ]
Zhu, Huashi [1 ,2 ]
Huang, Yu [1 ]
Jin, Zhi [3 ]
Ding, Weiping [4 ]
Li, Hang [5 ,6 ]
Ran, Menglong [5 ,6 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[3] Peking Univ, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[5] Peking Univ, Dept Dermatol, Hosp 1, Beijing 100034, Peoples R China
[6] Natl Clin Res Ctr Skin & Immune Dis, Beijing 100034, Peoples R China
关键词
Medical report generation; Knowledge graph; Multi-modal fusion; Graph neural network;
D O I
10.1016/j.inffus.2023.101817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation with knowledge graph is an essential task in the medical field. Although the existing knowledge graphs have many entities, their semantics are not sufficient due to the challenge of uniformly extracting and fusing the expert knowledge from different diseases. Therefore, it is necessary to automatically construct specific knowledge graph. In this paper, we propose a vision-knowledge fusion model based on medical images and knowledge graphs to fully utilize high-quality data from different diseases and languages. Firstly, we give a general method to automatically construct every domain knowledge graph based on medical standards. Secondly, we design a knowledge-based attention mechanism to effectively fuse image and knowledge. Then, we build a triples restoration module to obtain fine-grained knowledge, and the knowledge-based evaluation metrics are first proposed which are more reasonable and measurable from different dimensions. Finally, we conduct experiments to verify the effectiveness of our model on two different diseases datasets: the IU-Xray chest radiograph public dataset and the NCRC-DS dataset of Chinese dermoscopy reports we compiled. Our model outperforms previous benchmark methods and achieves excellent evaluation scores on both datasets. Additionally, interpretability and clinical usefulness of the model are validated and our method can be generalized to multiple domains and different diseases.
引用
收藏
页数:12
相关论文
共 44 条
  • [31] Multi-source knowledge fusion model for aspect-based sentiment analysis
    Han, Hu
    Hao, Jun
    Zhang, Qiankun
    Zhao, Qitao
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (09): : 2688 - 2695
  • [32] Multi-view contrastive learning and symptom extraction insights for medical report generation
    Qi Bai
    Xiaodi Zou
    Ahmad Alhaskawi
    Yanzhao Dong
    Haiying Zhou
    Sohaib Hasan Abdullah Ezzi
    Vishnu Goutham Kota
    Mohamed Hasan Hasan AbdullaAbdulla
    Sahar Ahmed Abdalbary
    Xianliang Hu
    Hui Lu
    Scientific Reports, 15 (1)
  • [33] From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation
    Liu, Zhizhe
    Zhu, Zhenfeng
    Zheng, Shuai
    Zhao, Yawei
    He, Kunlun
    Zhao, Yao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5987 - 5995
  • [34] Multi-Hop Question Generation with Knowledge Graph-Enhanced Language Model
    Li, Zhenping
    Cao, Zhen
    Li, Pengfei
    Zhong, Yong
    Li, Shaobo
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [35] EAGS: An extracting auxiliary knowledge graph model in multi-turn dialogue generation
    Ning, Bo
    Zhao, Deji
    Liu, Xinyi
    Li, Guanyu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (04): : 1545 - 1566
  • [36] Head and Tail Entity Fusion Model in Medical Knowledge Graph Construction: Case Study for Pituitary Adenoma
    Fang, An
    Lou, Pei
    Hu, Jiahui
    Zhao, Wanqing
    Feng, Ming
    Ren, Huiling
    Chen, Xianlai
    JMIR MEDICAL INFORMATICS, 2021, 9 (07)
  • [37] EAGS: An extracting auxiliary knowledge graph model in multi-turn dialogue generation
    Bo Ning
    Deji Zhao
    Xinyi Liu
    Guanyu Li
    World Wide Web, 2023, 26 : 1545 - 1566
  • [38] Research on Cross-domain Fake News detection based on Multi-space Fusion and Knowledge Graph Embedding
    Liu, Chao
    Song, Junlong
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 113 - 117
  • [39] Aspect-based sentiment analysis model based on multi-dependency graph and knowledge fusion
    He Y.
    Han H.
    Kong B.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (04): : 737 - 747and837
  • [40] RKC-H: A Rich Knowledge Based Model for Multi-turn Dialogue Generation
    Xu, Feifei
    Ding, Guanqun
    Zhang, Wenkai
    Audrey
    2020 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING (TASE 2020), 2020, : 25 - 32