Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer

被引:58
作者
Guo, Yang [1 ]
Shang, Xuequn [1 ]
Li, Zhanhuai [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Autoencoder; Gene expression; Alternative splicing; Cancer subtype; CLASSIFICATION; DISCOVERY; HETEROGENEITY; NETWORK; EVENTS; GENOME;
D O I
10.1016/j.neucom.2018.03.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification of cancer subtypes is vital to advance the precision of cancer disease diagnosis and therapy. Several works had been done to integrate multiple types of genomics data to investigate cancer subtypes. However, (1) few of them particularly considered the intrinsic correlations in each type of data; (2) to the best of our knowledge, none of them considered transcriptome alternative splicing regulation in data integration. It has been demonstrated that many cancers are related to abnormal alternative splicing regulations in recent years. In this paper, we propose a hierarchical deep learning framework, HI-SAE, to integrate gene expression and transcriptome alternative splicing profiles data to identify cancer subtypes. We adopt the stacked autoencoder (SAE) neural network to learn high-level representations in each type of data, respectively, and then integrate all the learned high-level representations by another learning layer to learn more complex data representations. Based on the final learned data representations, we cluster patients into different cancer subtype groups. Comprehensive experiments based on TCGA breast cancer data demonstrate that our model provides an effective and useful approach to integrate multiple types of transcriptomics data to identify cancer subtypes and the transcriptome alternative splicing data offers distinguishable clues of cancer subtypes. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:20 / 30
页数:11
相关论文
共 44 条
[1]  
[Anonymous], 2017, Complexity
[2]  
[Anonymous], 2017, COMPLEXITY
[3]   Pupylation sites prediction with ensemble classification model [J].
Bao, Wenzheng ;
Huang, Zhenhua ;
Yuan, Chang-An ;
Huang, De-Shuang .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (02) :91-104
[4]   DNA methylation epigenotypes in breast cancer molecular subtypes [J].
Bediaga, Naiara G. ;
Acha-Sagredo, Amelia ;
Guerra, Isabel ;
Viguri, Amparo ;
Albaina, Carmen ;
Ruiz Diaz, Irune ;
Rezola, Ricardo ;
Jesus Alberdi, Maria ;
Dopazo, Joaquin ;
Montaner, David ;
de Renobales, Mertxe ;
Fernandez, Agustin F. ;
Field, John K. ;
Fraga, Mario F. ;
Liloglou, Triantafillos ;
de Pancorbo, Marian M. .
BREAST CANCER RESEARCH, 2010, 12 (05)
[5]   MicroRNA signatures highlight new breast cancer subtypes [J].
Bhattacharyya, Malay ;
Nath, Joyshree ;
Bandyopadhyay, Sanghamitra .
GENE, 2015, 556 (02) :192-198
[6]   Colorectal Cancer Classification and Cell Heterogeneity: A Systems Oncology Approach [J].
Blanco-Calvo, Moises ;
Concha, Angel ;
Figueroa, Angelica ;
Garrido, Federico ;
Valladares-Ayerbes, Manuel .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (06) :13610-13632
[7]   Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[8]   MicroRNA-mRNA interactions underlying colorectal cancer molecular subtypes [J].
Cantini, Laura ;
Isella, Claudio ;
Petti, Consalvo ;
Picco, Gabriele ;
Chiola, Simone ;
Ficarra, Elisa ;
Caselle, Michele ;
Medico, Enzo .
NATURE COMMUNICATIONS, 2015, 6
[9]   Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks [J].
Ciresan, Dan C. ;
Giusti, Alessandro ;
Gambardella, Luca M. ;
Schmidhuber, Juergen .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2013, PT II, 2013, 8150 :411-418
[10]  
Dai XF, 2015, AM J CANCER RES, V5, P2929