Co-Occurrence Relationship Driven Hierarchical Attention Network for Brain CT Report Generation

被引:0
|
作者
Zhang, Xiaodan [1 ]
Dou, Shixin [1 ]
Ji, Junzhong [1 ]
Liu, Ying [2 ]
Wang, Zheng [2 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing 100021, Peoples R China
[2] Peking Univ Third Hosp, Dept Radiol, Beijing 100191, Peoples R China
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 05期
基金
中国国家自然科学基金;
关键词
Biomedical imaging; Pathology; Semantics; Visualization; Feature extraction; Computed tomography; Medical diagnostic imaging; Co-occurrence relationship; hierarchical attention mechanism; medical report generation; Brain CT;
D O I
10.1109/TETCI.2024.3413002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic generation of medical reports for Brain Computed Tomography (CT) imaging is crucial for helping radiologists make more accurate clinical diagnoses efficiently. Brain CT imaging typically contains rich pathological information, including common pathologies that often co-occur in one report and rare pathologies that appear in medical reports with lower frequency. However, current research ignores the potential co-occurrence between common pathologies and pays insufficient attention to rare pathologies, severely restricting the accuracy and diversity of the generated medical reports. In this paper, we propose a Co-occurrence Relationship Driven Hierarchical Attention Network (CRHAN) to improve Brain CT report generation by mining common and rare pathologies in Brain CT imaging. Specifically, the proposed CRHAN follows a general encoder-decoder framework with two novel attention modules. In the encoder, a co-occurrence relationship guided semantic attention (CRSA) module is proposed to extract the critical semantic features by embedding the co-occurrence relationship of common pathologies into semantic attention. In the decoder, a common-rare topic driven visual attention (CRVA) module is proposed to fuse the common and rare semantic features as sentence topic vectors, and then guide the visual attention to capture important lesion features for medical report generation. Experiments on the Brain CT dataset demonstrate the effectiveness of the proposed method.
引用
收藏
页码:3643 / 3653
页数:11
相关论文
共 30 条
  • [1] Weakly guided attention model with hierarchical interaction for brain CT report generation
    Zhang, Xiaodan
    Yang, Sisi
    Shi, Yanzhao
    Ji, Junzhong
    Liu, Ying
    Wang, Zheng
    Xu, Huimin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [2] Discriminative Feature Learning With Co-Occurrence Attention Network for Vehicle ReID
    Sheng, Hao
    Wang, Shuai
    Chen, Haobo
    Yang, Da
    Huang, Yang
    Shen, Jiahao
    Ke, Wei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3510 - 3522
  • [3] Semantic Co-Occurrence and Relationship Modeling for Remote Sensing Image Segmentation
    Zhang, Yinxing
    Song, Haochen
    Wang, Qingwang
    Jin, Pengcheng
    Shen, Tao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6630 - 6640
  • [4] Distributed representations of diseases based on co-occurrence relationship
    Wang, Haoqing
    Mai, Huiyu
    Deng, Zhi-hong
    Yang, Chao
    Zhang, Luxia
    Wang, Huai-yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [5] Prior tissue knowledge-driven contrastive learning for brain CT report generation
    Yanzhao Shi
    Junzhong Ji
    Xiaodan Zhang
    Ying Liu
    Zheng Wang
    Huimin Xu
    Multimedia Systems, 2024, 30
  • [6] Prior tissue knowledge-driven contrastive learning for brain CT report generation
    Shi, Yanzhao
    Ji, Junzhong
    Zhang, Xiaodan
    Liu, Ying
    Wang, Zheng
    Xu, Huimin
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [7] Computing Text Semantic Similarity with Syntactic Network of Co-occurrence Distance
    Jiao Y.
    Jing M.
    Kang F.
    Data Analysis and Knowledge Discovery, 2019, 3 (12) : 93 - 100
  • [8] Co-occurrence Relationship Encoding via Channel Merging for Vehicle Part Recognition
    Chang, Qinwei
    Sang, Nong
    Gao, Changxin
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [9] VARIABLE QUEST: Network Visualization of Variable Labels Unifying Co-occurrence Graphs
    Hayashi, Teruaki
    Ohsawa, Yukio
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 577 - 583
  • [10] A vision attention driven Language framework for medical report generation
    Arisoy, Merve Varol
    Arisoy, Ayhan
    Uysal, Ilhan
    SCIENTIFIC REPORTS, 2025, 15 (01):