Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer

被引:56
作者
Guo, Yang [1 ]
Shang, Xuequn [1 ]
Li, Zhanhuai [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Autoencoder; Gene expression; Alternative splicing; Cancer subtype; CLASSIFICATION; DISCOVERY; HETEROGENEITY; NETWORK; EVENTS; GENOME;
D O I
10.1016/j.neucom.2018.03.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification of cancer subtypes is vital to advance the precision of cancer disease diagnosis and therapy. Several works had been done to integrate multiple types of genomics data to investigate cancer subtypes. However, (1) few of them particularly considered the intrinsic correlations in each type of data; (2) to the best of our knowledge, none of them considered transcriptome alternative splicing regulation in data integration. It has been demonstrated that many cancers are related to abnormal alternative splicing regulations in recent years. In this paper, we propose a hierarchical deep learning framework, HI-SAE, to integrate gene expression and transcriptome alternative splicing profiles data to identify cancer subtypes. We adopt the stacked autoencoder (SAE) neural network to learn high-level representations in each type of data, respectively, and then integrate all the learned high-level representations by another learning layer to learn more complex data representations. Based on the final learned data representations, we cluster patients into different cancer subtype groups. Comprehensive experiments based on TCGA breast cancer data demonstrate that our model provides an effective and useful approach to integrate multiple types of transcriptomics data to identify cancer subtypes and the transcriptome alternative splicing data offers distinguishable clues of cancer subtypes. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:20 / 30
页数:11
相关论文
共 44 条
  • [1] [Anonymous], 2017, Complexity
  • [2] [Anonymous], 2017, COMPLEXITY
  • [3] Pupylation sites prediction with ensemble classification model
    Bao, Wenzheng
    Huang, Zhenhua
    Yuan, Chang-An
    Huang, De-Shuang
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (02) : 91 - 104
  • [4] DNA methylation epigenotypes in breast cancer molecular subtypes
    Bediaga, Naiara G.
    Acha-Sagredo, Amelia
    Guerra, Isabel
    Viguri, Amparo
    Albaina, Carmen
    Ruiz Diaz, Irune
    Rezola, Ricardo
    Jesus Alberdi, Maria
    Dopazo, Joaquin
    Montaner, David
    de Renobales, Mertxe
    Fernandez, Agustin F.
    Field, John K.
    Fraga, Mario F.
    Liloglou, Triantafillos
    de Pancorbo, Marian M.
    [J]. BREAST CANCER RESEARCH, 2010, 12 (05):
  • [5] MicroRNA signatures highlight new breast cancer subtypes
    Bhattacharyya, Malay
    Nath, Joyshree
    Bandyopadhyay, Sanghamitra
    [J]. GENE, 2015, 556 (02) : 192 - 198
  • [6] Colorectal Cancer Classification and Cell Heterogeneity: A Systems Oncology Approach
    Blanco-Calvo, Moises
    Concha, Angel
    Figueroa, Angelica
    Garrido, Federico
    Valladares-Ayerbes, Manuel
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (06): : 13610 - 13632
  • [7] Metagenes and molecular pattern discovery using matrix factorization
    Brunet, JP
    Tamayo, P
    Golub, TR
    Mesirov, JP
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) : 4164 - 4169
  • [8] MicroRNA-mRNA interactions underlying colorectal cancer molecular subtypes
    Cantini, Laura
    Isella, Claudio
    Petti, Consalvo
    Picco, Gabriele
    Chiola, Simone
    Ficarra, Elisa
    Caselle, Michele
    Medico, Enzo
    [J]. NATURE COMMUNICATIONS, 2015, 6
  • [9] Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks
    Ciresan, Dan C.
    Giusti, Alessandro
    Gambardella, Luca M.
    Schmidhuber, Juergen
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2013, PT II, 2013, 8150 : 411 - 418
  • [10] Dai XF, 2015, AM J CANCER RES, V5, P2929