Attribute Prototype-Guided Iterative Scene Graph for Explainable Radiology Report Generation

被引:1
|
作者
Zhang, Ke [1 ]
Yang, Yan [1 ]
Yu, Jun [2 ,3 ]
Fan, Jianping [4 ]
Jiang, Hanliang [5 ]
Huang, Qingming [6 ]
Han, Weidong [5 ,6 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310018, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[4] Lenovo Res, AI Lab, Beijing 100094, Peoples R China
[5] Zhejiang Univ, Sir Run Run Shaw Hosp, Natl Inst Resp Dis, Coll Med,Reg Med Ctr, Hangzhou 310016, Peoples R China
[6] Zhejiang Normal Univ, Coll Math Med, Jinhua 321017, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Prototypes; Biomedical imaging; Lung; Cognition; Iterative methods; Visualization; Radiology report generation; scene graph generation; prototype learning; interpretability;
D O I
10.1109/TMI.2024.3424505
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The potential of automated radiology report generation in alleviating the time-consuming tasks of radiologists is increasingly being recognized in medical practice. Existing report generation methods have evolved from using image-level features to the latest approach of utilizing anatomical regions, significantly enhancing interpretability. However, directly and simplistically using region features for report generation compromises the capability of relation reasoning and overlooks the common attributes potentially shared across regions. To address these limitations, we propose a novel region-based Attribute Prototype-guided Iterative Scene Graph generation framework (AP-ISG) for report generation, utilizing scene graph generation as an auxiliary task to further enhance interpretability and relational reasoning capability. The core components of AP-ISG are the Iterative Scene Graph Generation (ISGG) module and the Attribute Prototype-guided Learning (APL) module. Specifically, ISSG employs an autoregressive scheme for structural edge reasoning and a contextualization mechanism for relational reasoning. APL enhances intra-prototype matching and reduces inter-prototype semantic overlap in the visual space to fully model the potential attribute commonalities among regions. Extensive experiments on the MIMIC-CXR with Chest ImaGenome datasets demonstrate the superiority of AP-ISG across multiple metrics.
引用
收藏
页码:4470 / 4482
页数:13
相关论文
共 16 条
  • [1] Classwise Prototype-Guided Alignment Network for Cross-Scene Hyperspectral Image Classification
    Xie, Zhuojun
    Duan, Puhong
    Kang, Xudong
    Liu, Wang
    Li, Shutao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Cross-Modal Prototype Driven Network for Radiology Report Generation
    Wang, Jun
    Bhalerao, Abhir
    He, Yulan
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 563 - 579
  • [3] Decomposed Prototype Learning for Few-Shot Scene Graph Generation
    Li, Xingchen
    Xiao, Jun
    Chen, Guikun
    Feng, Yinfu
    Yang, Yi
    Liu, An-an
    Chen, Long
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [4] A causality guided loss for imbalanced learning in scene graph generation
    Peng, Ru
    Zhao, Chao
    Chen, Xingyu
    Wang, Ziru
    Liu, Yaxin
    Liu, Yulong
    Lan, Xuguang
    NEUROCOMPUTING, 2024, 599
  • [5] Hypercomplex context guided interaction modeling for scene graph generation
    Wang, Zheng
    Xu, Xing
    Luo, Yadan
    Wang, Guoqing
    Yang, Yang
    PATTERN RECOGNITION, 2023, 141
  • [6] SGG-MVAR: Cross-Modal Retrieval With Scene Graph Generation and Multiview Attribute Relationship Guidance
    Wang, Suping
    Zhou, Fei
    Yang, Ming
    Shi, Lei
    Tan, Chaohong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [7] OG-SGG: Ontology-Guided Scene Graph Generation-A Case Study in Transfer Learning for Telepresence Robotics
    Amodeo, Fernando
    Caballero, Fernando
    Diaz-Rodriguez, Natalia
    Merino, Luis
    IEEE ACCESS, 2022, 10 : 132564 - 132583
  • [8] Trust it or not: Confidence-guided automatic radiology report generation
    Wang, Yixin
    Lin, Zihao
    Xu, Zhe
    Dong, Haoyu
    Luo, Jie
    Tian, Jiang
    Shi, Zhongchao
    Huang, Lifu
    Zhang, Yang
    Fan, Jianping
    He, Zhiqiang
    NEUROCOMPUTING, 2024, 578
  • [9] Intensive vision-guided network for radiology report generation
    Zheng, Fudan
    Li, Mengfei
    Wang, Ying
    Yu, Weijiang
    Wang, Ruixuan
    Chen, Zhiguang
    Xiao, Nong
    Lu, Yutong
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (04)
  • [10] Eye Gaze Guided Cross-Modal Alignment Network for Radiology Report Generation
    Peng, Peixi
    Fan, Wanshu
    Shen, Yue
    Liu, Wenfei
    Yang, Xin
    Zhang, Qiang
    Wei, Xiaopeng
    Zhou, Dongsheng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (12) : 7406 - 7419