Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble

被引:22
|
作者
Xiong, Yueling [1 ]
Ye, Mingquan [1 ]
Wu, Changrong [2 ]
机构
[1] Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China
[2] Anhui Normal Univ, Sch Comp & Informat, Wuhu 241002, Peoples R China
基金
中国国家自然科学基金;
关键词
PARTICLE SWARM OPTIMIZATION; FEATURE-SELECTION; NEURAL-NETWORK; MACHINE;
D O I
10.1155/2021/5556992
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Feature selection for text classification with Naive Bayes
    Chen, Jingnian
    Huang, Houkuan
    Tian, Shengfeng
    Qu, Youli
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5432 - 5435
  • [22] ACCELERATE CONVOLUTIONAL NEURAL NETWORKS FOR BINARY CLASSIFICATION VIA CASCADING COST-SENSITIVE FEATURE
    Pang, Junbiao
    Lin, Huihuang
    Su, Li
    Zhang, Chunjie
    Zhang, Weigang
    Duan, Lijuan
    Huang, Qingming
    Yin, Baocai
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1037 - 1041
  • [23] Classification of tracheal stenosis with asymmetric misclassification errors from EMG an cost-sensitive method
    Volk, Ohad
    Ratnovsky, Anat
    Naftali, Sara
    Singer, Gonen
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [24] Imbalanced classification of manufacturing quality conditions using cost-sensitive decision tree ensembles
    Kim, Aekyung
    Oh, Kyuhyup
    Jung, Jae-Yoon
    Kim, Bohyun
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2018, 31 (08) : 701 - 717
  • [25] Cost-sensitive feature selection based on Adaptive Hunting Optimization
    Liang, Yixuan
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 546 - 551
  • [26] Improved cost-sensitive representation of data for solving the imbalanced big data classification problem
    Mahboubeh Fattahi
    Mohammad Hossein Moattar
    Yahya Forghani
    Journal of Big Data, 9
  • [27] Multiple Naive Bayes Classifiers Ensemble for Traffic Incident Detection
    Liu, Qingchao
    Lu, Jian
    Chen, Shuyan
    Zhao, Kangjia
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [28] Naive Bayes classifier - An ensemble procedure for recall and precision enrichment
    Peretz, Or
    Koren, Michal
    Koren, Oded
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [29] Improved Cost-Sensitive Support Vector Machine Classifier for Breast Cancer Diagnosis
    Liu, Na
    Shen, Jiang
    Xu, Man
    Gan, Dan
    Qi, Er-Shi
    Gao, Bo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [30] Cost-Sensitive Feature Selection on Heterogeneous Data
    Qian, Wenbin
    Shu, Wenhao
    Yang, Jun
    Wang, Yinglong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 397 - 408