Co-Occurrence Relationship Driven Hierarchical Attention Network for Brain CT Report Generation

被引:0
作者
Zhang, Xiaodan [1 ]
Dou, Shixin [1 ]
Ji, Junzhong [1 ]
Liu, Ying [2 ]
Wang, Zheng [2 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing 100021, Peoples R China
[2] Peking Univ Third Hosp, Dept Radiol, Beijing 100191, Peoples R China
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 05期
基金
中国国家自然科学基金;
关键词
Biomedical imaging; Pathology; Semantics; Visualization; Feature extraction; Computed tomography; Medical diagnostic imaging; Co-occurrence relationship; hierarchical attention mechanism; medical report generation; Brain CT;
D O I
10.1109/TETCI.2024.3413002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic generation of medical reports for Brain Computed Tomography (CT) imaging is crucial for helping radiologists make more accurate clinical diagnoses efficiently. Brain CT imaging typically contains rich pathological information, including common pathologies that often co-occur in one report and rare pathologies that appear in medical reports with lower frequency. However, current research ignores the potential co-occurrence between common pathologies and pays insufficient attention to rare pathologies, severely restricting the accuracy and diversity of the generated medical reports. In this paper, we propose a Co-occurrence Relationship Driven Hierarchical Attention Network (CRHAN) to improve Brain CT report generation by mining common and rare pathologies in Brain CT imaging. Specifically, the proposed CRHAN follows a general encoder-decoder framework with two novel attention modules. In the encoder, a co-occurrence relationship guided semantic attention (CRSA) module is proposed to extract the critical semantic features by embedding the co-occurrence relationship of common pathologies into semantic attention. In the decoder, a common-rare topic driven visual attention (CRVA) module is proposed to fuse the common and rare semantic features as sentence topic vectors, and then guide the visual attention to capture important lesion features for medical report generation. Experiments on the Brain CT dataset demonstrate the effectiveness of the proposed method.
引用
收藏
页码:3643 / 3653
页数:11
相关论文
共 30 条
  • [21] Co-Occurrence of Multiple Sclerosis and Amyotrophic Lateral Sclerosis in an FUS-Mutated Patient: A Case Report
    Fiondella, Luigi
    Cavallieri, Francesco
    Canali, Elena
    Cabboi, Maria Paola
    Marti, Alessandro
    Sireci, Francesca
    Fiocchi, Alena
    Montanari, Gloria
    Montepietra, Sara
    Valzania, Franco
    BRAIN SCIENCES, 2022, 12 (05)
  • [22] A histogram-driven generative adversarial network for brain MRI to CT synthesis
    Peng, Yanjun
    Sun, Jindong
    Ren, Yande
    Li, Dapeng
    Guo, Yanfei
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [23] The combination of gray level co-occurrence matrix and back propagation neural network for classifying stairs descent and floor
    Utaminingrum, Fitri
    Sarosa, Syam Julio A.
    Karim, Corina
    Gapsari, Femiana
    Wihandika, Randy Cahya
    ICT EXPRESS, 2022, 8 (01): : 151 - 160
  • [24] CSC-Net: Cross-Color Spatial Co-Occurrence Matrix Network for Detecting Synthesized Fake Images
    Qiao, Tong
    Chen, Yuxing
    Zhou, Xiaofei
    Shi, Ran
    Shao, Hang
    Shen, Kunye
    Luo, Xiangyang
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 369 - 379
  • [25] Automatic Report Generation Method based on Multiscale Feature Extraction and Word Attention Network
    Du, Xin
    Pan, Haiwei
    Zhang, Kejia
    He, Shuning
    Bian, Xiaofei
    Chen, Weipeng
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 520 - 528
  • [26] Phosphorus availability shapes size structure, co-occurrence patterns and network stability of surface microeukaryotic plankton communities in an urbanized estuarine ecosystem
    Zhang, Liming
    Zhang, Hao
    Liu, Hongbin
    Wu, Wenxue
    Xu, Zhimeng
    Tan, Yehui
    Shi, Zhen
    Xia, Xiaomin
    ECOLOGICAL INDICATORS, 2023, 154
  • [27] ADCNet: Anomaly-Driven Cross-Modal Contrastive Network for Medical Report Generation
    Liu, Yuxue
    Zhang, Junsan
    Liu, Kai
    Tan, Lizhuang
    ELECTRONICS, 2025, 14 (03):
  • [28] Towards a holistic framework for multimodal LLM in 3D brain CT radiology report generation
    Li, Cheng-Yi
    Chang, Kao-Jung
    Yang, Cheng-Fu
    Wu, Hsin-Yu
    Chen, Wenting
    Bansal, Hritik
    Chen, Ling
    Yang, Yi-Ping
    Chen, Yu-Chun
    Chen, Shih-Pin
    Chen, Shih-Jen
    Lirng, Jiing-Feng
    Chang, Kai-Wei
    Chiou, Shih-Hwa
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [29] HMSAM-UNet: A Hierarchical Multi-Scale Attention Module-Based Convolutional Neural Network for Improved CT Image Segmentation
    Liu, Na
    Lu, Zhonghua
    Lian, Wenyong
    Tian, Min
    Ma, Chiyue
    Peng, Lijuan
    IEEE ACCESS, 2024, 12 : 79415 - 79427
  • [30] Multi-layer perceptron classification & quantification of neuronal survival in hypoxic-ischemic brain image slices using a novel gradient direction, grey level co-occurrence matrix image training
    Bhattacharya, Saheli
    Bennet, Laura
    Davidson, Joanne O.
    Unsworth, Charles P.
    PLOS ONE, 2022, 17 (12):