A laminar augmented cascading flexible neural forest model for classification of cancer subtypes based on gene expression data

被引:4
作者
Zhong, Lianxin [1 ,2 ]
Meng, Qingfang [1 ,2 ]
Chen, Yuehui [1 ,2 ]
Du, Lei [1 ,2 ]
Wu, Peng [1 ,2 ]
机构
[1] Univ Jinan, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
基金
中国国家自然科学基金;
关键词
Cancer subtype; Cascade forest; Classification; Deep learning; Ensemble methods; ENSEMBLE; TREES;
D O I
10.1186/s12859-021-04391-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Correctly classifying the subtypes of cancer is of great significance for the in-depth study of cancer pathogenesis and the realization of personalized treatment for cancer patients. In recent years, classification of cancer subtypes using deep neural networks and gene expression data has gradually become a research hotspot. However, most classifiers may face overfitting and low classification accuracy when dealing with small sample size and high-dimensional biology data. Results In this paper, a laminar augmented cascading flexible neural forest (LACFNForest) model was proposed to complete the classification of cancer subtypes. This model is a cascading flexible neural forest using deep flexible neural forest (DFNForest) as the base classifier. A hierarchical broadening ensemble method was proposed, which ensures the robustness of classification results and avoids the waste of model structure and function as much as possible. We also introduced an output judgment mechanism to each layer of the forest to reduce the computational complexity of the model. The deep neural forest was extended to the densely connected deep neural forest to improve the prediction results. The experiments on RNA-seq gene expression data showed that LACFNForest has better performance in the classification of cancer subtypes compared to the conventional methods. Conclusion The LACFNForest model effectively improves the accuracy of cancer subtype classification with good robustness. It provides a new approach for the ensemble learning of classifiers in terms of structural design.
引用
收藏
页数:17
相关论文
共 39 条
  • [1] [Anonymous], 2017, DEEP FOREST ALTERNAT
  • [2] MicroRNA-mRNA interactions underlying colorectal cancer molecular subtypes
    Cantini, Laura
    Isella, Claudio
    Petti, Consalvo
    Picco, Gabriele
    Chiola, Simone
    Ficarra, Elisa
    Caselle, Michele
    Medico, Enzo
    [J]. NATURE COMMUNICATIONS, 2015, 6
  • [3] Chang C C C, 2011, LIBSVM: a library for support vector machines
  • [4] Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
    Chen, Kun-Huang
    Wang, Kung-Jeng
    Tsai, Min-Lung
    Wang, Kung-Min
    Adrian, Angelia Melani
    Cheng, Wei-Chung
    Yang, Tzu-Sen
    Teng, Nai-Chia
    Tan, Kuo-Pin
    Chang, Ku-Shang
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [5] Time-series forecasting using flexible neural tree model
    Chen, YH
    Yang, B
    Dong, JW
    Abraham, A
    [J]. INFORMATION SCIENCES, 2005, 174 (3-4) : 219 - 235
  • [6] Flexible neural trees ensemble for stock index modeling
    Chen, Yuehui
    Yang, Bo
    Abraham, Ajith
    [J]. NEUROCOMPUTING, 2007, 70 (4-6) : 697 - 703
  • [7] Feature selection and classification using flexible neural tree
    Chen, Yuehui
    Abraham, Ajith
    Yang, Bo
    [J]. NEUROCOMPUTING, 2006, 70 (1-3) : 305 - 313
  • [8] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [9] Dai XF, 2015, AM J CANCER RES, V5, P2929