FDTrans: Frequency Domain Transformer Model for predicting subtypes of lung cancer using multimodal data

被引：7

作者：

Cai, Meiling ^{[1
]}

Zhao, Lin ^{[1
]}

Hou, Guojie ^{[1
]}

Zhang, Yanan ^{[1
]}

Wu, Wei ^{[2
]}

Jia, Liye ^{[1
]}

Zhao, JuanJuan ^{[1
,3
]}

Wang, Long ^{[3
]}

Qiang, Yan ^{[1
]}

机构：

[1] Taiyuan Univ Technol, Coll Informat & Comp, Taiyuan 030002, Peoples R China

[2] Shanxi Prov Peoples Hosp, Dept Clin Lab, Taiyuan 030002, Peoples R China

[3] Coll Informat, Jinzhong Coll Informat, Jinzhong 030002, Peoples R China

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2023年 / 158卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Histopathological; Lung cancer subtypes; Frequency domain; Multimodal learning; FUSION;

D O I：

10.1016/j.compbiomed.2023.106812

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Background and purpose: Accurate identification of lung cancer subtypes in medical images is of great significance for the diagnosis and treatment of lung cancer. Despite substantial progress in existing methods, they remain challenging due to limited annotated datasets, large intra-class differences, and high inter-class similarities. Methods: To address these challenges, we propose a Frequency Domain Transformer Model (FDTrans) to identify patients' lung cancer subtypes using the TCGA lung cancer dataset. We add a pre-processing process to transfer histopathological images to the frequency domain using a block-based discrete cosine transform and design a coordinate Coordinate-Spatial Attention Module (CSAM) to obtain critical detail information by reassigning weights to the location information and channel information of different frequency vectors. Then, a Cross-Domain Transformer Block (CDTB) is designed for Y, Cb, and Cr channel features, capturing the long-term dependencies and global contextual connections between different component features. At the same time, feature extraction is performed on the genomic data to obtain specific features. Finally, the image branch and the gene branch are fused, and the classification result is output through the fully connected layer. Results: In 10-fold cross-validation, the method achieves an AUC of 93.16% and overall accuracy of 92.33%, which is better than similar current lung cancer subtypes classification detection methods. Conclusion: This method can help physicians diagnose the subtypes classification of lung cancer in patients and can benefit from both spatial and frequency domain information.

引用

页数：8

共 41 条

[1] Deep Orthogonal Fusion: Multimodal Prognostic Biomarker Discovery Integrating Radiology, Pathology, Genomic, and Clinical Data [J].

Braman, Nathaniel ;

Gordon, Jacob W. H. ;

Goossens, Emery T. ;

Willis, Caleb ;

Stumpe, Martin C. ;

Venkataraman, Jagadish .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 :667-677

[2]

Chen J, 2021, arXiv

[3]

Chen RJ, 2022, IEEE T MED IMAGING, V41, P757, DOI [10.1109/TITS.2020.3030218, 10.1109/TMI.2020.3021387]

[4] Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning [J].

Coudray, Nicolas ;

Ocampo, Paolo Santiago ;

Sakellaropoulos, Theodore ;

Narula, Navneet ;

Snuderl, Matija ;

Fenyo, David ;

Moreira, Andre L. ;

Razavian, Narges ;

Tsirigos, Aristotelis .

NATURE MEDICINE, 2018, 24 (10) :1559-+

[5] Multi-channel multi-task deep learning for predicting EGFR and KRAS mutations of non-small cell lung cancer on CT images [J].

Dong, Yunyun ;

Hou, Lina ;

Yang, Wenkai ;

Han, Jiahao ;

Wang, Jiawen ;

Qiang, Yan ;

Zhao, Juanjuan ;

Hou, Jiaxin ;

Song, Kai ;

Ma, Yulan ;

Kazihise, Ntikurako Guy Fernand ;

Cui, Yanfen ;

Yang, Xiaotang .

QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2021, 11 (06) :2354-2375

[6] Deep Residual Learning in the JPEG Transform Domain [J].

Ehrlich, Max ;

Davis, Larry .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3483-3492

[7]

Gueguen L, 2018, ADV NEUR IN, V31

[8] Multi-scale Domain-adversarial Multiple-instance CNN for Cancer Subtype Classification with Unannotated Histopathological Images [J].

Hashimoto, Noriaki ;

Fukushima, Daisuke ;

Koga, Ryoichi ;

Takagi, Yusuke ;

Ko, Kaho ;

Kohno, Kei ;

Nakaguro, Masato ;

Nakamura, Shigeo ;

Hontani, Hidekata ;

Takeuchi, Ichiro .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3851-3860

[9] Coordinate Attention for Efficient Mobile Network Design [J].

Hou, Qibin ;

Zhou, Daquan ;

Feng, Jiashi .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717

[10]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]

← 1 2 3 4 5 →