Segmentation prompts classification: A nnUNet-based 3D transfer learning framework with ROI tokenization and cross-task attention for esophageal cancer T-stage diagnosis

被引:2
作者
Li, Chen [1 ,2 ]
Wang, Runyuan [1 ]
He, Ping [4 ]
Chen, Wei [2 ]
Wu, Wei [3 ]
Wu, Yi [1 ]
机构
[1] Army Med Univ, Mil Med Univ 3, Coll Biomed Engn & Med Imaging, Dept Digital Med, Chongqing, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha, Peoples R China
[3] Army Med Univ, Mil Med Univ 3, Southwest Hosp, Dept Thorac Surg, Chongqing, Peoples R China
[4] Army Med Univ, Mil Med Univ 3, Southwest Hosp, Dept Cardiac Surg, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Transfer learning; Cross-task attention; ROI tokenization; nnU-Net; 3D segmentation; 3D classification; Esophageal cancer T-stage diagnosis; CT images; CONVOLUTIONAL NEURAL-NETWORK; SYSTEM; CT;
D O I
10.1016/j.eswa.2024.125067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The computer-aided diagnosis system for esophageal cancer (EC) holds vital significance in EC diagnosis and treatment making, with a primary focus on accurate segmentation of EC-related organs and classification of EC's T-stage. Above two tasks are closely related and crucial in assisting surgeon segment and diagnose cancer early. Note that this paradigm is still at its infancy and limited by closely related open issues: (1) how to link the complementary relationship between these two tasks and improve the originally poor performance? and (2) how to determine whether the tumor has invaded the surrounding muscle layers from CT images? Aiming at these issues, this study develops nn-TransEC, a 3D transfer learning framework that builds upon nnU-Net and synergizes segmentation and classification. nn-TransEC focuses on prompting fine-grained classification of EC's T-stage with the aid of prior segmentation, which is implemented in two parts: (1) A nnUNet-configured multi-task learning network (nn-MTNet) is designed for complementary segmentation of EC-related organs and classification of EC's T-stages with cross-task attention gates and transfer learning. (2) A knowledge-embedded ROI tokenization method (KRT) is defined to mimic the diagnostic workflow of doctors for classifying EC's T-stage. KRT is implemented by cropping the most concerned regions from entire CT volume based on prior segmentation. Experiments have been conducted on a private dataset collected from 169 patients with confirmed EC through pathological diagnosis. Our proposed nn-TransEC is compared against the state-of-the-art counterparts (e.g., nnU-Net and nnFormer), and results demonstrate that: nn-TransEC excels in all compared methods in multi-organ segmentation and classification of EC's T-stages, with 3D Dice of EC and average AUC of T-stages reaching 0.844 and 0.941, respectively. In contrast, the state-of-the-art method nnFormer achieves 0.814 and 0.927, respectively. Meanwhile, nn-TransEC also outperforms state-of-the-art multi-task learning models in joint segmentation and classification, with Hausdorff Distance of EC and average precision of Tstages reaching 8.497 and 0.845, respectively. In contrast, the state-of-the-art method TransMT-Net achieves 12.206 and 0.730, respectively.
引用
收藏
页数:23
相关论文
共 89 条
[1]  
Atlanta, 2023, Cancer facts & figures.
[2]   COMPUTER-AIDED SYSTEM FOR THE DIAGNOSIS OF HEPATO-BILIARY DISEASES - COMPARISON WITH THE PERFORMANCE OF PHYSICIANS [J].
BEGON, F ;
LOCKHART, AM ;
METREAU, JM ;
DHUMEAUX, D .
MEDICAL INFORMATICS, 1979, 4 (01) :35-42
[3]   STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
LANCET, 1986, 1 (8476) :307-310
[4]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[5]   A shape-constraint adversarial framework with instance-normalized spatio-temporal features for inter-fetal membrane segmentation [J].
Casella, Alessandro ;
Moccia, Sara ;
Paladini, Dario ;
Frontoni, Emanuele ;
De Momi, Elena ;
Mattos, Leonard S. .
MEDICAL IMAGE ANALYSIS, 2021, 70
[6]   DRDA-Net: Dense residual dual-shuffle attention network for breast cancer classification using histopathological images [J].
Chattopadhyay, Soham ;
Dey, Arijit ;
Singh, Pawan Kumar ;
Sarkar, Ram .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
[7]  
Chen J., 2021, arXiv
[8]  
Chen S, 2018, I S BIOMED IMAGING, P1126, DOI 10.1109/ISBI.2018.8363769
[9]  
Chen T, 2020, PR MACH LEARN RES, V119
[10]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49