Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble

被引:22
|
作者
Xiong, Yueling [1 ]
Ye, Mingquan [1 ]
Wu, Changrong [2 ]
机构
[1] Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China
[2] Anhui Normal Univ, Sch Comp & Informat, Wuhu 241002, Peoples R China
基金
中国国家自然科学基金;
关键词
PARTICLE SWARM OPTIMIZATION; FEATURE-SELECTION; NEURAL-NETWORK; MACHINE;
D O I
10.1155/2021/5556992
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Heterogeneous fault prediction with cost-sensitive domain adaptation
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2018, 28 (02):
  • [42] Cost-sensitive Dictionary Learning for Software Defect Prediction
    Niu, Liang
    Wan, Jianwu
    Wang, Hongyuan
    Zhou, Kaiwei
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 2415 - 2449
  • [43] Weighted Learning Vector Quantization to Cost-Sensitive Learning
    Chen, Ning
    Ribeiro, Bernardete
    Vieira, Armando
    Duarte, Joao
    Neves, Joao
    ARTIFICIAL NEURAL NETWORKS (ICANN 2010), PT III, 2010, 6354 : 277 - +
  • [44] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [45] Optimal cost-sensitive credit scoring using a new hybrid performance metric
    Khalili, Nasser
    Rastegar, Mohamad Ali
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [46] A stacking ensemble deep learning approach to cancer type classification based on TCGA data
    Mohammed, Mohanad
    Mwambi, Henry
    Mboya, Innocent B.
    Elbashir, Murtada K.
    Omolo, Bernard
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [47] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178
  • [48] Text Classification Based on Naive Bayes Algorithm with Feature Selection
    Chen, Zhenguo
    Shi, Guang
    Wang, Xiaoju
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4255 - 4260
  • [49] Improved Naive Bayes with optimal correlation factor for text classification
    Chen, Jiangning
    Dai, Zhibo
    Duan, Juntao
    Matzinger, Heinrich
    Popescu, Ionel
    SN APPLIED SCIENCES, 2019, 1 (09):
  • [50] Linear Cost-sensitive Max-margin Embedded Feature Selection for SVM
    Aram, Khalid Y.
    Lam, Sarah S.
    Khasawneh, Mohammad T.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197