FDTrans: Frequency Domain Transformer Model for predicting subtypes of lung cancer using multimodal data

被引:3
作者
Cai, Meiling [1 ]
Zhao, Lin [1 ]
Hou, Guojie [1 ]
Zhang, Yanan [1 ]
Wu, Wei [2 ]
Jia, Liye [1 ]
Zhao, JuanJuan [1 ,3 ]
Wang, Long [3 ]
Qiang, Yan [1 ]
机构
[1] Taiyuan Univ Technol, Coll Informat & Comp, Taiyuan 030002, Peoples R China
[2] Shanxi Prov Peoples Hosp, Dept Clin Lab, Taiyuan 030002, Peoples R China
[3] Coll Informat, Jinzhong Coll Informat, Jinzhong 030002, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Histopathological; Lung cancer subtypes; Frequency domain; Multimodal learning; FUSION;
D O I
10.1016/j.compbiomed.2023.106812
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background and purpose: Accurate identification of lung cancer subtypes in medical images is of great significance for the diagnosis and treatment of lung cancer. Despite substantial progress in existing methods, they remain challenging due to limited annotated datasets, large intra-class differences, and high inter-class similarities. Methods: To address these challenges, we propose a Frequency Domain Transformer Model (FDTrans) to identify patients' lung cancer subtypes using the TCGA lung cancer dataset. We add a pre-processing process to transfer histopathological images to the frequency domain using a block-based discrete cosine transform and design a coordinate Coordinate-Spatial Attention Module (CSAM) to obtain critical detail information by reassigning weights to the location information and channel information of different frequency vectors. Then, a Cross-Domain Transformer Block (CDTB) is designed for Y, Cb, and Cr channel features, capturing the long-term dependencies and global contextual connections between different component features. At the same time, feature extraction is performed on the genomic data to obtain specific features. Finally, the image branch and the gene branch are fused, and the classification result is output through the fully connected layer. Results: In 10-fold cross-validation, the method achieves an AUC of 93.16% and overall accuracy of 92.33%, which is better than similar current lung cancer subtypes classification detection methods. Conclusion: This method can help physicians diagnose the subtypes classification of lung cancer in patients and can benefit from both spatial and frequency domain information.
引用
收藏
页数:8
相关论文
共 41 条
  • [1] Deep Orthogonal Fusion: Multimodal Prognostic Biomarker Discovery Integrating Radiology, Pathology, Genomic, and Clinical Data
    Braman, Nathaniel
    Gordon, Jacob W. H.
    Goossens, Emery T.
    Willis, Caleb
    Stumpe, Martin C.
    Venkataraman, Jagadish
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 667 - 677
  • [2] Chen J., 2021, arXiv
  • [3] Chen RJ, 2022, IEEE T MED IMAGING, V41, P757, DOI [10.1109/TMI.2020.3021387, 10.1109/TITS.2020.3030218]
  • [4] Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning
    Coudray, Nicolas
    Ocampo, Paolo Santiago
    Sakellaropoulos, Theodore
    Narula, Navneet
    Snuderl, Matija
    Fenyo, David
    Moreira, Andre L.
    Razavian, Narges
    Tsirigos, Aristotelis
    [J]. NATURE MEDICINE, 2018, 24 (10) : 1559 - +
  • [5] Multi-channel multi-task deep learning for predicting EGFR and KRAS mutations of non-small cell lung cancer on CT images
    Dong, Yunyun
    Hou, Lina
    Yang, Wenkai
    Han, Jiahao
    Wang, Jiawen
    Qiang, Yan
    Zhao, Juanjuan
    Hou, Jiaxin
    Song, Kai
    Ma, Yulan
    Kazihise, Ntikurako Guy Fernand
    Cui, Yanfen
    Yang, Xiaotang
    [J]. QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2021, 11 (06) : 2354 - 2375
  • [6] Deep Residual Learning in the JPEG Transform Domain
    Ehrlich, Max
    Davis, Larry
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3483 - 3492
  • [7] Gueguen L, 2018, ADV NEUR IN, V31
  • [8] Multi-scale Domain-adversarial Multiple-instance CNN for Cancer Subtype Classification with Unannotated Histopathological Images
    Hashimoto, Noriaki
    Fukushima, Daisuke
    Koga, Ryoichi
    Takagi, Yusuke
    Ko, Kaho
    Kohno, Kei
    Nakaguro, Masato
    Nakamura, Shigeo
    Hontani, Hidekata
    Takeuchi, Ichiro
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3851 - 3860
  • [9] Coordinate Attention for Efficient Mobile Network Design
    Hou, Qibin
    Zhou, Daquan
    Feng, Jiashi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
  • [10] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]