TranSQ: Transformer-Based Semantic Query for Medical Report Generation

被引:12
|
作者
Kong, Ming [1 ]
Huang, Zhengxing [1 ]
Kuang, Kun [1 ,2 ]
Zhu, Qiang [1 ]
Wu, Fei [1 ,2 ,3 ,4 ]
机构
[1] Zhejiang Univ, Inst Artificial Intelligence, Hangzhou, Peoples R China
[2] Key Lab Corneal Dis Res Zhejiang Prov, Hangzhou, Peoples R China
[3] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
关键词
Medical report generation; Transformer; Auxiliary diagnosis; Deep learning;
D O I
10.1007/978-3-031-16452-1_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation, which aims at automatically generating coherent reports with multiple sentences for the given medical images, has received growing research interest due to its tremendous potential in facilitating clinical workflow and improving health services. Due to the highly patterned nature of medical reports, each sentence can be viewed as the description of an image observation with a specific purpose. To this end, this study proposes a novel Transformer-based Semantic Query (TranSQ) model that treats the medical report generation as a direct set prediction problem. Specifically, our model generates a set of semantic features to match plausible clinical concerns and compose the report with sentence retrieval and selection. Experimental results on two prevailing radiology report datasets, i.e., IU X-Ray and MIMIC-CXR, demonstrate that our model outperforms state-of-the-art models on the generation task in terms of both language generation effectiveness and clinical efficacy, which highlights the utility of our approach in generating medical reports with topics of clinical concern as well as sentence-level visual-semantic attention mappings. The source code is available at https://github.com/zjukongming/TranSQ.
引用
收藏
页码:610 / 620
页数:11
相关论文
共 50 条
  • [21] Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
    Cui, Yiming
    Yang, Linjie
    Yu, Haichao
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [22] A Transformer-based Semantic Segmentation Model for Street Fashion Images
    Peng, Dingjie
    Kameyama, Wataru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [23] TRANSFORMER-BASED METHOD FOR SEMANTIC SEGMENTATION AND RECONSTRUCTION OF THE MARTIAN SURFACE
    Li, Z.
    Wu, B.
    Chen, Z.
    Ma, Y.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1643 - 1649
  • [24] Transformer-Based Approach for Automatic Semantic Financial Document Verification
    Toprak, Ahmet
    Turan, Metin
    IEEE ACCESS, 2024, 12 : 184327 - 184349
  • [25] Recent progress in transformer-based medical image analysis
    Liu, Zhaoshan
    Lv, Qiujie
    Yang, Ziduo
    Li, Yifan
    Lee, Chau Hung
    Shen, Lei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [26] Transformer-Based Classification of User Queries for Medical Consultancy
    Lyutkin, D. A.
    Pozdnyakov, D. V.
    Soloviev, A. A.
    Zhukov, D. V.
    Malik, M. S. I.
    Ignatov, D. I.
    AUTOMATION AND REMOTE CONTROL, 2024, 85 (03) : 297 - 308
  • [27] Transformer-based Question Text Generation in the Learning System
    Li, Jiajun
    Song, Huazhu
    Li, Jun
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 50 - 56
  • [28] Applying Transformer-Based Text Summarization for Keyphrase Generation
    Glazkova A.V.
    Morozov D.A.
    Lobachevskii Journal of Mathematics, 2023, 44 (1) : 123 - 136
  • [29] Transformer-Based Graph Neural Networks for Outfit Generation
    Becattini, Federico
    Teotini, Federico Maria
    Bimbo, Alberto Del
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (01) : 213 - 223
  • [30] A Transformer-based Medical Visual Question Answering Model
    Liu, Lei
    Su, Xiangdong
    Guo, Hui
    Zhu, Daobin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1712 - 1718