Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble

被引:22
|
作者
Xiong, Yueling [1 ]
Ye, Mingquan [1 ]
Wu, Changrong [2 ]
机构
[1] Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China
[2] Anhui Normal Univ, Sch Comp & Informat, Wuhu 241002, Peoples R China
基金
中国国家自然科学基金;
关键词
PARTICLE SWARM OPTIMIZATION; FEATURE-SELECTION; NEURAL-NETWORK; MACHINE;
D O I
10.1155/2021/5556992
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Cost-sensitive stacking ensemble learning for company financial distress prediction
    Wang, Shanshan
    Chi, Guotai
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [2] A hybrid cost-sensitive ensemble for heart disease prediction
    Qi Zhenya
    Zhang, Zuoru
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [3] Cost-sensitive hierarchical classification for imbalance classes
    Zheng, Weijie
    Zhao, Hong
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2328 - 2338
  • [4] A Novel Method for Credit Scoring Based on Cost-Sensitive Neural Network Ensemble
    Yotsawat, Wirot
    Wattuya, Pakaket
    Srivihok, Anongnart
    IEEE ACCESS, 2021, 9 : 78521 - 78537
  • [5] A Cost-Sensitive Deep Belief Network for Imbalanced Classification
    Zhang, Chong
    Tan, Kay Chen
    Li, Haizhou
    Hong, Geok Soon
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) : 109 - 122
  • [6] Classification of Kidney Cancer Data Using Cost-Sensitive Hybrid Deep Learning Approach
    Shon, Ho Sun
    Batbaatar, Erdenebileg
    Kim, Kyoung Ok
    Cha, Eun Jong
    Kim, Kyung-Ah
    SYMMETRY-BASEL, 2020, 12 (01):
  • [7] A Statistical Approach to Cost-Sensitive AdaBoost for Imbalanced Data Classification
    Bei, Honghan
    Wang, Yajie
    Ren, Zhaonuo
    Jiang, Shuo
    Li, Keran
    Wang, Wenyang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [8] Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification
    Liu, Fen
    Qian, Quan
    ALGORITHMS, 2022, 15 (05)
  • [9] G-Forest: An ensemble method for cost-sensitive feature selection in gene expression microarrays
    Abdulla, Mai
    Khasawneh, Mohammad T.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 108 (108)
  • [10] Improved cost-sensitive representation of data for solving the imbalanced big data classification problem
    Fattahi, Mahboubeh
    Moattar, Mohammad Hossein
    Forghani, Yahya
    JOURNAL OF BIG DATA, 2022, 9 (01)