Co-Occurrence Relationship Driven Hierarchical Attention Network for Brain CT Report Generation

被引：0

作者：

Zhang, Xiaodan ^{[1
]}

Dou, Shixin ^{[1
]}

Ji, Junzhong ^{[1
]}

Liu, Ying ^{[2
]}

Wang, Zheng ^{[2
]}

机构：

[1] Beijing Univ Technol, Coll Comp Sci, Beijing 100021, Peoples R China

[2] Peking Univ Third Hosp, Dept Radiol, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Biomedical imaging; Pathology; Semantics; Visualization; Feature extraction; Computed tomography; Medical diagnostic imaging; Co-occurrence relationship; hierarchical attention mechanism; medical report generation; Brain CT;

D O I：

10.1109/TETCI.2024.3413002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic generation of medical reports for Brain Computed Tomography (CT) imaging is crucial for helping radiologists make more accurate clinical diagnoses efficiently. Brain CT imaging typically contains rich pathological information, including common pathologies that often co-occur in one report and rare pathologies that appear in medical reports with lower frequency. However, current research ignores the potential co-occurrence between common pathologies and pays insufficient attention to rare pathologies, severely restricting the accuracy and diversity of the generated medical reports. In this paper, we propose a Co-occurrence Relationship Driven Hierarchical Attention Network (CRHAN) to improve Brain CT report generation by mining common and rare pathologies in Brain CT imaging. Specifically, the proposed CRHAN follows a general encoder-decoder framework with two novel attention modules. In the encoder, a co-occurrence relationship guided semantic attention (CRSA) module is proposed to extract the critical semantic features by embedding the co-occurrence relationship of common pathologies into semantic attention. In the decoder, a common-rare topic driven visual attention (CRVA) module is proposed to fuse the common and rare semantic features as sentence topic vectors, and then guide the visual attention to capture important lesion features for medical report generation. Experiments on the Brain CT dataset demonstrate the effectiveness of the proposed method.

引用

页码：3643 / 3653

页数：11

共 30 条

[1] Weakly guided attention model with hierarchical interaction for brain CT report generation
Zhang, Xiaodan
Yang, Sisi
Shi, Yanzhao
Ji, Junzhong
Liu, Ying
Wang, Zheng
Xu, Huimin
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
[2] Discriminative Feature Learning With Co-Occurrence Attention Network for Vehicle ReID
Sheng, Hao
Wang, Shuai
Chen, Haobo
Yang, Da
Huang, Yang
Shen, Jiahao
Ke, Wei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3510 - 3522
[3] Semantic Co-Occurrence and Relationship Modeling for Remote Sensing Image Segmentation
Zhang, Yinxing
Song, Haochen
Wang, Qingwang
Jin, Pengcheng
Shen, Tao
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6630 - 6640
[4] Distributed representations of diseases based on co-occurrence relationship
Wang, Haoqing
Mai, Huiyu
Deng, Zhi-hong
Yang, Chao
Zhang, Luxia
Wang, Huai-yu
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
[5] Prior tissue knowledge-driven contrastive learning for brain CT report generation
Yanzhao Shi
Junzhong Ji
Xiaodan Zhang
Ying Liu
Zheng Wang
Huimin Xu
Multimedia Systems, 2024, 30
[6] Prior tissue knowledge-driven contrastive learning for brain CT report generation
Shi, Yanzhao
Ji, Junzhong
Zhang, Xiaodan
Liu, Ying
Wang, Zheng
Xu, Huimin
MULTIMEDIA SYSTEMS, 2024, 30 (02)
[7] Computing Text Semantic Similarity with Syntactic Network of Co-occurrence Distance
Jiao Y.
Jing M.
Kang F.
Data Analysis and Knowledge Discovery, 2019, 3 (12) : 93 - 100
[8] Co-occurrence Relationship Encoding via Channel Merging for Vehicle Part Recognition
Chang, Qinwei
Sang, Nong
Gao, Changxin
MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
[9] VARIABLE QUEST: Network Visualization of Variable Labels Unifying Co-occurrence Graphs
Hayashi, Teruaki
Ohsawa, Yukio
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 577 - 583
[10] A vision attention driven Language framework for medical report generation
Arisoy, Merve Varol
Arisoy, Ayhan
Uysal, Ilhan
SCIENTIFIC REPORTS, 2025, 15 (01):

← 1 2 3 →