Chest radiology report generation based on cross-modal multi-scale feature fusion

被引:4
作者
Pan, Yu [1 ]
Liu, Li -Jun [1 ,2 ,3 ]
Yang, Xiao-Bing [1 ]
Peng, Wei [1 ]
Huang, Qing-Song [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming, Peoples R China
[2] Yunnan Key Lab Comp Technol Applicat, Kunming, Peoples R China
[3] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Wujiaying St, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Report generation; Cross; -modal; Multi; -scale; Medical image; Attention mechanism; Deep learning;
D O I
10.1016/j.jrras.2024.100823
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Chest radiology imaging plays a crucial role in the early screening, diagnosis, and treatment of chest diseases. The accurate interpretation of radiological images and the automatic generation of radiology reports not only save the doctor's time but also mitigate the risk of errors in diagnosis. The core objective of automatic radiology report generation is to achieve precise mapping of visual features and lesion descriptions at multi-scale and finegrained levels. Existing methods typically combine global visual features and textual features to generate radiology reports. However, these approaches may ignore the key lesion areas and lack sensitivity to crucial lesion location information. Furthermore, achieving multi-scale characterization and fine-grained alignment of medical visual features and report text features proves challenging, leading to a reduction in the quality of radiology report generation. Addressing these issues, we propose a method for chest radiology report generation based on cross-modal multi-scale feature fusion. First, an auxiliary labeling module is designed to guide the model to focus on the lesion region of the radiological image. Second, a channel attention network is employed to enhance the characterization of location information and disease features. Finally, a cross-modal features fusion module is constructed by combining memory matrices, facilitating fine-grained alignment between multi-scale visual features and reporting text features on corresponding scales. The proposed method is experimentally evaluated on two publicly available radiological image datasets. The results demonstrate superior performance based on BLEU and ROUGE metrics compared to existing methods. Particularly, there are improvements of 4.8% in the ROUGE metric and 9.4% in the METEOR metric on the IU X-Ray dataset. Moreover, there is a 7.4% enhancement in BLEU-1 and a 7.6% improvement in the BLEU-2 on the MIMIC-CXR dataset.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A multi-scale feature fusion target detection algorithm
    Dong, Chong
    Li, Jingmei
    Wang, Jiaxiang
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [42] Multi-Scale Attentive Feature Fusion Network for Single Image Dehazing
    Zhang, Chenxi
    Wu, Chunming
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [43] Monocular depth estimation with multi-scale feature fusion
    Wang Q.
    Zhang S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (05): : 7 - 12
  • [44] Adaptive feature fusion with attention mechanism for multi-scale target detection
    Moran Ju
    Jiangning Luo
    Zhongbo Wang
    Haibo Luo
    Neural Computing and Applications, 2021, 33 : 2769 - 2781
  • [45] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [46] Multi-scale Remote Sensing Image Classification Based on Weighted Feature Fusion
    Cheng Yinzhu
    Liu Song
    Wang Nan
    Shi Yuetian
    Zhang Geng
    ACTA PHOTONICA SINICA, 2023, 52 (11)
  • [47] Liver segmentation network based on detail enhancement and multi-scale feature fusion
    Lu, Tinglan
    Qin, Jun
    Qin, Guihe
    Shi, Weili
    Zhang, Wentao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [48] Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion
    Du, Jing
    Jiang, Zuning
    Huang, Shangfeng
    Wang, Zongyue
    Su, Jinhe
    Su, Songjian
    Wu, Yundong
    Cai, Guorong
    SENSORS, 2021, 21 (05) : 1 - 20
  • [49] Multi-scale Feature Fusion Single Shot Object Detector Based on DenseNet
    Zhai, Minghao
    Liu, Junchen
    Zhang, Wei
    Liu, Chen
    Li, Wei
    Cao, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT V, 2019, 11744 : 450 - 460
  • [50] A Robust Vehicle Detection Model Based on Attention and Multi-scale Feature Fusion
    Zhu, Yuxin
    Liu, Wenbo
    Yan, Fei
    Li, Jun
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 143 - 148