Prediction of rhinitis with class imbalance based on heterogeneous ensemble learning

被引:0
作者
Yang, Jingdong [1 ]
Jiang, Biao [1 ]
Qiu, Zehao [1 ]
Meng, Yifei [2 ]
Zhang, Xiaolin [3 ]
Yu, Shaoqing [3 ]
Dai, Fu [4 ]
Qian, Yue [4 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai, Peoples R China
[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai, Peoples R China
[3] Tongji Univ, Tongji Hosp, Sch Med, Dept Otorhinolaryngol Head & Neck Surg, Shanghai, Peoples R China
[4] Antin Hosp, Dept Otorhinolaryngol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Allergic rhinitis; ensemble learning; base learner; multiple-label classification; heterogeneous integrated structure; MODEL;
D O I
10.1080/10255842.2024.2339461
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Common clinical rhinitis is characterized by different types of cases and class imbalance. Its prediction belongs to multiple output classification. Low recognition rate and poor generalization performance often occur for minority class. Therefore, we propose a novel integrated classification model, ARF-OOBEE, which transforms the multi-output classification to multi-label classification and multi-class classification. The multi-label classifier automatically adjusts the number and depth of integrated forest learners according to the imbalance ratio of single class label in a subset. It can effectively reduce the impact of class imbalance on classification and improve prediction performance of both majority or minority class concurrently. Also, we build a multi-class classification based on out-of-bag Extra-Tree to accomplish finer classification for the predicted labels. In addition, we calculate the feature importance for rhinitis on the grounds of the purity of nodes in decision-making tree inside Random Forest and study the correlation between rhinitis features. We conduct 12 folds cross-validation experiments on 461 cases of clinical rhinitis. The outcomes show that the evaluation indicators of ARF-OOBEE, such as Sensitivity, Specificity, Accuracy, F1-Score, AUC, and G-Mean are 74.9%,86.5%,92.0%,78.3%,95.3%, and 79.9%, respectively. In comparison to the other methods, ARF-OOBEE has better evaluation indicator and is more effective for the early clinical diagnosis of rhinitis.
引用
收藏
页数:16
相关论文
共 30 条
[21]  
Ma ZC., 2019, Journal of Hangzhou Dianzi University (Natural Science), V39, P1, DOI [10.13954/j.cnki.hdu.2019.03.001, DOI 10.13954/J.CNKI.HDU.2019.03.001]
[22]  
Perry T, 2015, IEEE C EVOL COMPUTAT, P680, DOI 10.1109/CEC.2015.7256956
[23]   Hybrid prediction model with missing value imputation for medical data [J].
Purwar, Archana ;
Singh, Sandeep Kumar .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) :5621-5631
[24]   A top-down supervised learning approach to hierarchical multi-label classification in networks [J].
Romero, Miguel ;
Finke, Jorge ;
Rocha, Camilo .
APPLIED NETWORK SCIENCE, 2022, 7 (01)
[25]  
Sen S, 2016, 2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), P210, DOI 10.1109/SPIN.2016.7566690
[26]  
Tsai MT, 2010, LECT NOTES COMPUT SC, V6196, P500, DOI 10.1007/978-3-642-14031-0_53
[27]  
Wang L, 2023, Arxiv, DOI arXiv:2303.01064
[28]   Diversity Analysis on Imbalanced Data Sets by Using Ensemble Models [J].
Wang, Shuo ;
Yao, Xin .
2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, :324-331
[29]  
Weng C.G., 2008, P 7 AUSTR DAT MIN C, V87, P27
[30]   Performing Sparse Regularization and Dimension Reduction Simultaneously in Multimodal Data Fusion [J].
Yang, Zhengshi ;
Zhuang, Xiaowei ;
Bird, Christopher ;
Sreenivasan, Karthik ;
Mishra, Virendra ;
Banks, Sarah ;
Cordes, Dietmar .
FRONTIERS IN NEUROSCIENCE, 2019, 13