Identifying Cancer Subtypes Using a Residual Graph Convolution Model on a Sample Similarity Network

被引:11
作者
Dai, Wei [1 ]
Yue, Wenhao [1 ]
Peng, Wei [1 ,2 ]
Fu, Xiaodong [1 ,2 ]
Liu, Li [1 ,2 ]
Liu, Lijun [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650050, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Comp Technol Applicat Key Lab Yunnan Prov, Kunming 650050, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
residual graph convolutional network; cancer subtype classification; deep learning; sample interaction; BREAST; CLASSIFICATION;
D O I
10.3390/genes13010065
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Cancer subtype classification helps us to understand the pathogenesis of cancer and develop new cancer drugs, treatment from which patients would benefit most. Most previous studies detect cancer subtypes by extracting features from individual samples, ignoring their associations with others. We believe that the interactions of cancer samples can help identify cancer subtypes. This work proposes a cancer subtype classification method based on a residual graph convolutional network and a sample similarity network. First, we constructed a sample similarity network regarding cancer gene co-expression patterns. Then, the gene expression profiles of cancer samples as initial features and the sample similarity network were passed into a two-layer graph convolutional network (GCN) model. We introduced the initial features to the GCN model to avoid over-smoothing during the training process. Finally, the classification of cancer subtypes was obtained through a softmax activation function. Our model was applied to breast invasive carcinoma (BRCA), glioblastoma multiforme (GBM) and lung cancer (LUNG) datasets. The accuracy values of our model reached 82.58%, 85.13% and 79.18% for BRCA, GBM and LUNG, respectively, which outperformed the existing methods. The survival analysis of our results proves the significant clinical features of the cancer subtypes identified by our model. Moreover, we can leverage our model to detect the essential genes enriched in gene ontology (GO) terms and the biological pathways related to a cancer subtype.
引用
收藏
页数:16
相关论文
共 45 条
[1]   A Machine Learning Approach for the Classification of Kidney Cancer Subtypes Using miRNA Genome Data [J].
Ali, Ali Muhamed ;
Zhuang, Hanqi ;
Ibrahim, Ali ;
Rehman, Oneeb ;
Huang, Michelle ;
Wu, Andrew .
APPLIED SCIENCES-BASEL, 2018, 8 (12)
[2]   Tumour heterogeneity in the clinic [J].
Bedard, Philippe L. ;
Hansen, Aaron R. ;
Ratain, Mark J. ;
Siu, Lillian L. .
NATURE, 2013, 501 (7467) :355-364
[3]   The emerging clinical relevance of genomics in cancer medicine [J].
Berger, Michael F. ;
Mardis, Elaine R. .
NATURE REVIEWS CLINICAL ONCOLOGY, 2018, 15 (06) :353-365
[4]   Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data [J].
Chen, Runpu ;
Yang, Le ;
Goodison, Steve ;
Sun, Yijun .
BIOINFORMATICS, 2020, 36 (05) :1476-1483
[5]   TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data [J].
Colaprico, Antonio ;
Silva, Tiago C. ;
Olsen, Catharina ;
Garofano, Luciano ;
Cava, Claudia ;
Garolini, Davide ;
Sabedot, Thais S. ;
Malta, Tathiane M. ;
Pagnotta, Stefano M. ;
Castiglioni, Isabella ;
Ceccarelli, Michele ;
Bontempi, Gianluca ;
Noushmehr, Houtan .
NUCLEIC ACIDS RESEARCH, 2016, 44 (08) :e71
[6]   The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups [J].
Curtis, Christina ;
Shah, Sohrab P. ;
Chin, Suet-Feung ;
Turashvili, Gulisa ;
Rueda, Oscar M. ;
Dunning, Mark J. ;
Speed, Doug ;
Lynch, Andy G. ;
Samarajiwa, Shamith ;
Yuan, Yinyin ;
Graef, Stefan ;
Ha, Gavin ;
Haffari, Gholamreza ;
Bashashati, Ali ;
Russell, Roslin ;
McKinney, Steven ;
Langerod, Anita ;
Green, Andrew ;
Provenzano, Elena ;
Wishart, Gordon ;
Pinder, Sarah ;
Watson, Peter ;
Markowetz, Florian ;
Murphy, Leigh ;
Ellis, Ian ;
Purushotham, Arnie ;
Borresen-Dale, Anne-Lise ;
Brenton, James D. ;
Tavare, Simon ;
Caldas, Carlos ;
Aparicio, Samuel .
NATURE, 2012, 486 (7403) :346-352
[7]   Network Embedding the Protein-Protein Interaction Network for Human Essential Genes Identification [J].
Dai, Wei ;
Chang, Qi ;
Peng, Wei ;
Zhong, Jiancheng ;
Li, Yongjiang .
GENES, 2020, 11 (02)
[8]  
Dai XF, 2015, AM J CANCER RES, V5, P2929
[9]  
Fakoor R., 2013, P INT C MACH LEARN
[10]   Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data [J].
Franco, Edian F. ;
Rana, Pratip ;
Cruz, Aline ;
Calderon, Victor V. ;
Azevedo, Vasco ;
Ramos, Rommel T. J. ;
Ghosh, Preetam .
CANCERS, 2021, 13 (09)