Prediction of rhinitis with class imbalance based on heterogeneous ensemble learning

被引:0
作者
Yang, Jingdong [1 ]
Jiang, Biao [1 ]
Qiu, Zehao [1 ]
Meng, Yifei [2 ]
Zhang, Xiaolin [3 ]
Yu, Shaoqing [3 ]
Dai, Fu [4 ]
Qian, Yue [4 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai, Peoples R China
[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai, Peoples R China
[3] Tongji Univ, Tongji Hosp, Sch Med, Dept Otorhinolaryngol Head & Neck Surg, Shanghai, Peoples R China
[4] Antin Hosp, Dept Otorhinolaryngol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Allergic rhinitis; ensemble learning; base learner; multiple-label classification; heterogeneous integrated structure; MODEL;
D O I
10.1080/10255842.2024.2339461
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Common clinical rhinitis is characterized by different types of cases and class imbalance. Its prediction belongs to multiple output classification. Low recognition rate and poor generalization performance often occur for minority class. Therefore, we propose a novel integrated classification model, ARF-OOBEE, which transforms the multi-output classification to multi-label classification and multi-class classification. The multi-label classifier automatically adjusts the number and depth of integrated forest learners according to the imbalance ratio of single class label in a subset. It can effectively reduce the impact of class imbalance on classification and improve prediction performance of both majority or minority class concurrently. Also, we build a multi-class classification based on out-of-bag Extra-Tree to accomplish finer classification for the predicted labels. In addition, we calculate the feature importance for rhinitis on the grounds of the purity of nodes in decision-making tree inside Random Forest and study the correlation between rhinitis features. We conduct 12 folds cross-validation experiments on 461 cases of clinical rhinitis. The outcomes show that the evaluation indicators of ARF-OOBEE, such as Sensitivity, Specificity, Accuracy, F1-Score, AUC, and G-Mean are 74.9%,86.5%,92.0%,78.3%,95.3%, and 79.9%, respectively. In comparison to the other methods, ARF-OOBEE has better evaluation indicator and is more effective for the early clinical diagnosis of rhinitis.
引用
收藏
页数:16
相关论文
共 30 条
[1]   Machine learning and artificial intelligence in the service of medicine: Necessity or potentiality? [J].
Alsuliman, Tamim ;
Humaidan, Dania ;
Sliman, Layth .
CURRENT RESEARCH IN TRANSLATIONAL MEDICINE, 2020, 68 (04) :245-251
[2]  
Belal TA., 2023, 2023 INT C ELECT COM, P1, DOI [10.1109/ECCE57851.2023.10101588, DOI 10.1109/ECCE57851.2023.10101588]
[3]  
Cao L., 2016, An over-sampling method based on probability density estimation for imbalanced datasets classification. ICIIP 2016. 2016 International Conference on Intelligent Information Processing. Dec 2325, P1, DOI [10.1145/3028842.3028886, DOI 10.1145/3028842.3028886]
[4]   An evolutionary framework for machine learning applied to medical data [J].
Castellanos-Garzon, Jose A. ;
Costa, Ernesto ;
Luis Jaimes, Jose S. ;
Corchado, Juan M. .
KNOWLEDGE-BASED SYSTEMS, 2019, 185
[5]  
Chandrashekhar S, 2022, Arxiv, DOI arXiv:2201.05469
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   Serum IgE and eosinophil count in allergic rhinitis-Analysis using a modified Bayes' theorem [J].
Demirjian, M. ;
Rumbyrt, J. S. ;
Gowda, V. C. ;
Klaustermeyer, W. B. .
ALLERGOLOGIA ET IMMUNOPATHOLOGIA, 2012, 40 (05) :281-287
[8]  
Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
[9]   Integrating TANBN with cost sensitive classification algorithm for imbalanced data in medical diagnosis [J].
Gan, Dan ;
Shen, Jiang ;
An, Bang ;
Xu, Man ;
Liu, Na .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 140
[10]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328