Predicting the Performance of Ensemble Classification Using Conditional Joint Probability

被引:1
|
作者
Murtza, Iqbal [1 ,2 ]
Kim, Jin-Young [3 ]
Adnan, Muhammad [4 ]
机构
[1] Chonnam Natl Univ, Educ & Res Ctr IoT Convergence Intelligent City Sa, Gwangju 61186, South Korea
[2] Air Univ, Fac Comp & AI, Dept Creat Technol, Islamabad 44230, Pakistan
[3] Chonnam Natl Univ, Dept Intelligent Elect & Comp Engn, Gwangju 61186, South Korea
[4] UiT Arctic Univ Norway, Dept Technol & Safety, N-9019 Tromso, Norway
基金
新加坡国家研究基金会;
关键词
machine learning; probability theory; ensemble classification; cost-sensitive learning; binary classification;
D O I
10.3390/math12162586
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In many machine learning applications, there are many scenarios when performance is not satisfactory by single classifiers. In this case, an ensemble classification is constructed using several weak base learners to achieve satisfactory performance. Unluckily, the construction of the ensemble classification is empirical, i.e., to try an ensemble classification and if performance is not satisfactory then discard it. In this paper, a challenging analytical problem of the estimation of ensemble classification using the prediction performance of the base learners is considered. The proposed formulation is aimed at estimating the performance of ensemble classification without physically developing it, and it is derived from the perspective of probability theory by manipulating the decision probabilities of the base learners. For this purpose, the output of a base learner (which is either true positive, true negative, false positive, or false negative) is considered as a random variable. Then, the effects of logical disjunction-based and majority voting-based decision combination strategies are analyzed from the perspective of conditional joint probability. To evaluate the forecasted performance of ensemble classifier by the proposed methodology, publicly available standard datasets have been employed. The results show the effectiveness of the derived formulations to estimate the performance of ensemble classification. In addition to this, the theoretical and experimental results show that the logical disjunction-based decision outperforms majority voting in imbalanced datasets and cost-sensitive scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Dynamic ensemble classification for credit scoring using soft probability
    Feng, Xiaodong
    Xiao, Zhi
    Zhong, Bo
    Qiu, Jing
    Dong, Yuanxiang
    APPLIED SOFT COMPUTING, 2018, 65 : 139 - 151
  • [2] Predicting Hospital Readmission: A Joint Ensemble-Learning Model
    Yu, Kaiye
    Xie, Xiaolei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (02) : 447 - 456
  • [3] Enhancing Student's Performance Classification Using Ensemble Modeling
    Nafea A.A.
    Mishlish M.
    Shaban A.M.S.
    AL-Ani M.M.
    Ali Alheeti K.M.
    Mohammed H.J.
    Iraqi Journal for Computer Science and Mathematics, 2023, 4 (04): : 204 - 214
  • [4] Twin SVM for conditional probability estimation in binary and multiclass classification
    Shao, Yuan -Hai
    Lv, Xiao-Jing
    Huang, Ling-Wei
    Bai, Lan
    PATTERN RECOGNITION, 2023, 136
  • [5] Probability-Weighted Voting Ensemble Learning for Classification ModelProbability-Weighted Voting Ensemble Learning for Classification Model
    Rojarath, Artitayapron
    Songpan, Wararat
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (04) : 217 - 227
  • [6] Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations
    Sahlaoui, Hayat
    Alaoui, El Arbi Abdellaoui
    Nayyar, Anand
    Agoujil, Said
    Jaber, Mustafa Musa
    IEEE ACCESS, 2021, 9 : 152688 - 152703
  • [7] An Ensemble Classification Approach Using Improvised Attribute Selection
    Memon, Muhammad Qasim
    Qu, Shengquan
    Lu, Yu
    Memon, Aasma
    Memon, Abdul Rehman
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 606 - 610
  • [8] Conditional probability estimation based classification with class label missing at random
    Sheng, Ying
    Wang, Qihua
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 176
  • [9] Dengue symptoms classification analysis with improved conditional probability decision analysis
    Babu, D. Suresh
    Raju, B.
    Swapna, S.
    Kolluri, Johnson
    Ramesh, D.
    Bonagiri, Rajitha
    APPLIED NANOSCIENCE, 2022, 13 (4) : 3085 - 3093
  • [10] Dengue symptoms classification analysis with improved conditional probability decision analysis
    D. Suresh Babu
    B. Raju
    S. Swapna
    Johnson Kolluri
    D. Ramesh
    Rajitha Bonagiri
    Applied Nanoscience, 2023, 13 : 3085 - 3093