Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data

被引:77
|
作者
Chen, Runpu [1 ]
Yang, Le [1 ]
Goodison, Steve [2 ]
Sun, Yijun [1 ,3 ,4 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14214 USA
[2] Mayo Clin, Dept Hlth Sci Res, Jacksonville, FL 32224 USA
[3] SUNY Buffalo, Dept Microbiol & Immunol, Buffalo, NY 14214 USA
[4] SUNY Buffalo, Dept Biostat, Buffalo, NY 14214 USA
关键词
BREAST-CANCER; MODEL; DISCOVERY; CLUSTERS;
D O I
10.1093/bioinformatics/btz769
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer subtype classification has the potential to significantly improve disease prognosis and develop individualized patient management. Existing methods are limited by their ability to handle extremely high-dimensional data and by the influence of misleading, irrelevant factors, resulting in ambiguous and overlapping subtypes. Results: To address the above issues, we proposed a novel approach to disentangling and eliminating irrelevant factors by leveraging the power of deep learning. Specifically, we designed a deep-learning framework, referred to as DeepType, that performs joint supervised classification, unsupervised clustering and dimensionality reduction to learn cancer-relevant data representation with cluster structure. We applied DeepType to the METABRIC breast cancer dataset and compared its performance to state-of-the-art methods. DeepType significantly outperformed the existing methods, identifying more robust subtypes while using fewer genes. The new approach provides a framework for the derivation of more accurate and robust molecular cancer subtypes by using increasingly complex, multi-source data.
引用
收藏
页码:1476 / 1483
页数:8
相关论文
共 50 条
  • [31] A reaction norm model for genomic selection using high-dimensional genomic and environmental data
    Diego Jarquín
    José Crossa
    Xavier Lacaze
    Philippe Du Cheyron
    Joëlle Daucourt
    Josiane Lorgeou
    François Piraux
    Laurent Guerreiro
    Paulino Pérez
    Mario Calus
    Juan Burgueño
    Gustavo de los Campos
    Theoretical and Applied Genetics, 2014, 127 : 595 - 607
  • [32] Manifold Discovery for High-Dimensional Data Using Deep Method
    CHEN, J. I. N. G. J. I. N.
    CHEN, S. H. U. P. I. N. G.
    DING, X. U. A. N.
    IEEE ACCESS, 2022, 10 : 65221 - 65227
  • [33] XDL: An Industrial Deep Learning Framework for High-dimensional Sparse Data
    Jiang, Biye
    Deng, Chao
    Yi, Huimin
    Hu, Zelin
    Zhou, Guorui
    Zheng, Yang
    Huang, Sui
    Guo, Xinyang
    Wang, Dongyue
    Song, Yue
    Zhao, Liqin
    Wang, Zhi
    Sun, Peng
    Zhang, Yu
    Zhang, Di
    Li, Jinhui
    Xu, Jian
    Zhu, Xiaoqiang
    Gai, Kun
    1ST INTERNATIONAL WORKSHOP ON DEEP LEARNING PRACTICE FOR HIGH-DIMENSIONAL SPARSE DATA WITH KDD (DLP-KDD 2019), 2019,
  • [34] A deep learning solution approach for high-dimensional random differential equations
    Nabian, Mohammad Amin
    Meidani, Hadi
    PROBABILISTIC ENGINEERING MECHANICS, 2019, 57 : 14 - 25
  • [35] Broad and deep neural network for high-dimensional data representation learning
    Feng, Qiying
    Liu, Zhulin
    Chen, C. L. Philip
    INFORMATION SCIENCES, 2022, 599 : 127 - 146
  • [36] Deep Learning-Bat High-Dimensional Missing Data Estimator
    Leke, Collins
    Ndjiongue, A. R.
    Twala, Bhekisipho
    Marwala, Tshilidzi
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 483 - 488
  • [37] A variable selection approach for highly correlated predictors in high-dimensional genomic data
    Zhu, Wencan
    Levy-Leduc, Celine
    Ternes, Nils
    BIOINFORMATICS, 2021, 37 (16) : 2238 - 2244
  • [38] Multistage feature selection approach for high-dimensional cancer data
    Alkuhlani, Alhasan
    Nassef, Mohammad
    Farag, Ibrahim
    SOFT COMPUTING, 2017, 21 (22) : 6895 - 6906
  • [39] Multistage feature selection approach for high-dimensional cancer data
    Alhasan Alkuhlani
    Mohammad Nassef
    Ibrahim Farag
    Soft Computing, 2017, 21 : 6895 - 6906
  • [40] Learning high-dimensional multimedia data
    Xiaofeng Zhu
    Zhi Jin
    Rongrong Ji
    Multimedia Systems, 2017, 23 : 281 - 283