A Hybrid Deep Learning Model for Predicting Molecular Subtypes of Human Breast Cancer Using Multimodal Data

被引:62
作者
Liu, T. [1 ]
Huang, J. [1 ]
Liao, T. [2 ]
Pu, R. [2 ]
Liu, S. [2 ]
Peng, Y. [1 ]
机构
[1] Sichuan Agr Univ, Coll Informat Engn, Yaan 0086625014, Peoples R China
[2] Sichuan Agr Univ, Coll Sci, Yaan 0086625014, Peoples R China
关键词
Breast cancer subtypes; Deep learning; Prediction; Multimodal fusion; Image filtering; NEURAL-NETWORKS; CLASSIFICATION;
D O I
10.1016/j.irbm.2020.12.002
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background: The prediction of breast cancer subtypes plays a key role in the diagnosis and prognosis of breast cancer. In recent years, deep learning (DL) has shown good performance in the intelligent prediction of breast cancer subtypes. However, most of the traditional DL models use single modality data, which can just extract a few features, so it cannot establish a stable relationship between patient characteristics and breast cancer subtypes. Dataset: We used the TCGA-BRCA dataset as a sample set for molecular subtype prediction of breast cancer. It is a public dataset that can be obtained through the following link: https://portal .gdc .cancer. gov /projects /TCGA-BRCA Methods: In this paper, a Hybrid DL model based on the multimodal data is proposed. We combine the patient's gene modality data with image modality data to construct a multimodal fusion framework. According to the different forms and states, we set up feature extraction networks respectively, and then we fuse the output of the two feature networks based on the idea of weighted linear aggregation. Finally, the fused features are used to predict breast cancer subtypes. In particular, we use the principal component analysis to reduce the dimensionality of high-dimensional data of gene modality and filter the data of image modality. Besides, we also improve the traditional feature extraction network to make it show better performance. Results: The results show that compared with the traditional DL model, the Hybrid DL model proposed in this paper is more accurate and efficient in predicting breast cancer subtypes. Our model achieved a prediction accuracy of 88.07% in 10 times of 10-fold cross-validation. We did a separate AUC test for each subtype, and the average AUC value obtained was 0.9427. In terms of subtype prediction accuracy, our model is about 7.45% higher than the previous average. (c) 2021 AGBM. Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页码:62 / 74
页数:13
相关论文
共 75 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Deep convolutional network for breast cancer classification: enhanced loss function (ELF) [J].
Acharya, Smarika ;
Alsadoon, Abeer ;
Prasad, P. W. C. ;
Abdullah, Salma ;
Deva, Anand .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (11) :8548-8565
[3]  
Ajantha Devi V, 2020, DEEP LEARNING CANC D, P1
[4]   Evaluation of deep learning detection and classification towards computer-aided diagnosis of breast lesions in digital X-ray mammograms [J].
Al-antari, Mugahed A. ;
Han, Seung-Moo ;
Kim, Tae-Seong .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196
[5]   Machine Learning from Theory to Algorithms: An Overview [J].
Alzubi, Jafar ;
Nayyar, Anand ;
Kumar, Akshi .
SECOND NATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE (NCCI 2018), 2018, 1142
[6]  
[Anonymous], 1989, PRINCIPAL COMPONENTS
[7]  
[Anonymous], 1997, P 13 C UNCERTAINTY A
[8]  
[Anonymous], 2017, ARXIV171204711
[9]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[10]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828