Bayesian weighted random forest for classification of high-dimensional genomics data

被引:7
|
作者
Olaniran, Oyebayo Ridwan [1 ]
Abdullah, Mohd Asrul A. [2 ]
机构
[1] Univ Ilorin, Dept Stat, Ilorin, Nigeria
[2] UTHM, Dept Math & Stat, FAST, Parit Raja, Johor, Malaysia
关键词
Bayesian; High-dimensional; Genomic data; Classifcation; Random forest; VARIABLE SELECTION; BREAST-CANCER; GENE; PREDICTION; TUMOR; PATTERNS; LEUKEMIA;
D O I
10.1016/j.kjs.2023.06.008
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, a full Bayesian weighted probabilistic model is developed for random classification trees. The new model Bayesian Weighted Random Classification Forest (BWRCF) arises from the modification of the existing random classification forest in two ways. Firstly, the tree terminal node estimation procedure is replaced with a Bayesian estimation approach. Secondly, a new variable ranking procedure is developed and then hybridized with BWRCF to tackle the high-dimensionality issues. The performance of the proposed method is analyzed using simulated and real-life high-dimensional microarray datasets based on holdout accuracy and misclassification error rates. The results of the analyses showed that the proposed BWRCF is robust in terms of its ability to withstand moderate to large high-dimensionality scenarios. In addition, BWRCF also has improved predictive and efficiency abilities over selected competing methods.
引用
收藏
页码:477 / 484
页数:8
相关论文
共 50 条
  • [41] Sparse Bayesian multinomial probit regression model with correlation prior for high-dimensional data classification
    Yang Aijun
    Jiang Xuejun
    Liu Pengfei
    Lin Jinguan
    STATISTICS & PROBABILITY LETTERS, 2016, 119 : 241 - 247
  • [42] Nonmonotonic Front Propagation on Weighted Graphs With Applications in Image Processing and High-Dimensional Data Classification
    Desquesnes, Xavier
    Elmoataz, Abderrahim
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (06) : 897 - 907
  • [43] Random forest Granger causality for detection of effective brain connectivity using high-dimensional data
    Furqan, Mohammad Shaheryar
    Siyal, Mohammad Yakoob
    JOURNAL OF INTEGRATIVE NEUROSCIENCE, 2016, 15 (01) : 55 - 66
  • [44] Interaction Detection with Random Forests in High-Dimensional Data
    Winham, Stacey
    Wang, Xin
    de Andrade, Mariza
    Freimuth, Robert
    Colby, Colin
    Huebner, Marianne
    Biernacka, Joanna
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 142 - 142
  • [45] Iterative random projections for high-dimensional data clustering
    Cardoso, Angelo
    Wichert, Andreas
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1749 - 1755
  • [46] Random projection ensemble classification with high-dimensional time series
    Zhang, Fuli
    Chan, Kung-Sik
    BIOMETRICS, 2023, 79 (02) : 964 - 974
  • [47] Random projection ensemble conformal prediction for high-dimensional classification
    Qian, Xiaoyu
    Wu, Jinru
    Wei, Ligong
    Lin, Youwu
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 253
  • [48] Classification in High-Dimensional Feature Spaces: Random Subsample Ensemble
    Serpen, Gursel
    Pathical, Santhosh
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 740 - 745
  • [49] High-Dimensional Bayesian Optimization via Random Projection of Manifold Subspaces
    Nguyen, Quoc-Anh Hoang
    The Hung Tran
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK AND DEMO TRACK, PT VIII, ECML PKDD 2024, 2024, 14948 : 288 - 305
  • [50] High-Dimensional Bayesian Optimization Using Both Random and Supervised Embeddings
    Priem, Remy
    Diouane, Youssef
    Bartoli, Nathalie
    Dubreuil, Sylvain
    Saves, Paul
    AIAA JOURNAL, 2024,