Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data

被引:77
|
作者
Chen, Runpu [1 ]
Yang, Le [1 ]
Goodison, Steve [2 ]
Sun, Yijun [1 ,3 ,4 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14214 USA
[2] Mayo Clin, Dept Hlth Sci Res, Jacksonville, FL 32224 USA
[3] SUNY Buffalo, Dept Microbiol & Immunol, Buffalo, NY 14214 USA
[4] SUNY Buffalo, Dept Biostat, Buffalo, NY 14214 USA
关键词
BREAST-CANCER; MODEL; DISCOVERY; CLUSTERS;
D O I
10.1093/bioinformatics/btz769
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer subtype classification has the potential to significantly improve disease prognosis and develop individualized patient management. Existing methods are limited by their ability to handle extremely high-dimensional data and by the influence of misleading, irrelevant factors, resulting in ambiguous and overlapping subtypes. Results: To address the above issues, we proposed a novel approach to disentangling and eliminating irrelevant factors by leveraging the power of deep learning. Specifically, we designed a deep-learning framework, referred to as DeepType, that performs joint supervised classification, unsupervised clustering and dimensionality reduction to learn cancer-relevant data representation with cluster structure. We applied DeepType to the METABRIC breast cancer dataset and compared its performance to state-of-the-art methods. DeepType significantly outperformed the existing methods, identifying more robust subtypes while using fewer genes. The new approach provides a framework for the derivation of more accurate and robust molecular cancer subtypes by using increasingly complex, multi-source data.
引用
收藏
页码:1476 / 1483
页数:8
相关论文
共 50 条
  • [41] Learning to visualise high-dimensional data
    Ahmad, K
    Vrusias, B
    EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2004, : 507 - 512
  • [42] Learning high-dimensional multimedia data
    Zhu, Xiaofeng
    Jin, Zhi
    Ji, Rongrong
    MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283
  • [43] Clustering High-Dimensional Stock Data using Data Mining Approach
    Indriyanti, Dhea
    Dhini, Arian
    2019 16TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM2019), 2019,
  • [44] Deep learning for high-dimensional reliability analysis
    Li, Mingyang
    Wang, Zequn
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 139
  • [45] Novel machine learning approach for classification of high-dimensional microarray data
    Rabia Aziz Musheer
    C. K. Verma
    Namita Srivastava
    Soft Computing, 2019, 23 : 13409 - 13421
  • [46] Novel machine learning approach for classification of high-dimensional microarray data
    Musheer, Rabia Aziz
    Verma, C. K.
    Srivastava, Namita
    SOFT COMPUTING, 2019, 23 (24) : 13409 - 13421
  • [47] Identifying a Minimal Class of Models for High-dimensional Data
    Nevo, Daniel
    Ritov, Ya'acov
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [48] High-dimensional imaging using combinatorial channel multiplexing and deep learning
    Ben-Uri, Raz
    Ben Shabat, Lior
    Shainshein, Dana
    Bar-Tal, Omer
    Bussi, Yuval
    Maimon, Noa
    Haran, Tal Keidar
    Milo, Idan
    Goliand, Inna
    Addadi, Yoseph
    Salame, Tomer Meir
    Rochwarger, Alexander
    Schuerch, Christian M.
    Bagon, Shai
    Elhanani, Ofer
    Keren, Leeat
    NATURE BIOTECHNOLOGY, 2025,
  • [49] Solving high-dimensional optimal stopping problems using deep learning
    Becker, Sebastian
    Cheridito, Patrick
    Jentzen, Arnulf
    Welti, Timo
    EUROPEAN JOURNAL OF APPLIED MATHEMATICS, 2021, 32 (03) : 470 - 514
  • [50] Solving high-dimensional partial differential equations using deep learning
    Han, Jiequn
    Jentzen, Arnulf
    Weinan, E.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (34) : 8505 - 8510