Scalable kernel-based SVM classification algorithm on imbalance air quality data for proficient healthcare

被引:0
作者
Shwet Ketu
Pramod Kumar Mishra
机构
[1] Banaras Hindu University,Department of Computer Science, Institute of Science
来源
Complex & Intelligent Systems | 2021年 / 7卷
关键词
Air quality; Classification; Proficient healthcare; Scalable kernel-based SVM; Imbalance data;
D O I
暂无
中图分类号
学科分类号
摘要
In the last decade, we have seen drastic changes in the air pollution level, which has become a critical environmental issue. It should be handled carefully towards making the solutions for proficient healthcare. Reducing the impact of air pollution on human health is possible only if the data is correctly classified. In numerous classification problems, we are facing the class imbalance issue. Learning from imbalanced data is always a challenging task for researchers, and from time to time, possible solutions have been developed by researchers. In this paper, we are focused on dealing with the imbalanced class distribution in a way that the classification algorithm will not compromise its performance. The proposed algorithm is based on the concept of the adjusting kernel scaling (AKS) method to deal with the multi-class imbalanced dataset. The kernel function's selection has been evaluated with the help of weighting criteria and the chi-square test. All the experimental evaluation has been performed on sensor-based Indian Central Pollution Control Board (CPCB) dataset. The proposed algorithm with the highest accuracy of 99.66% wins the race among all the classification algorithms i.e. Adaboost (59.72%), Multi-Layer Perceptron (95.71%), GaussianNB (80.87%), and SVM (96.92). The results of the proposed algorithm are also better than the existing literature methods. It is also clear from these results that our proposed algorithm is efficient for dealing with class imbalance problems along with enhanced performance. Thus, accurate classification of air quality through our proposed algorithm will be useful for improving the existing preventive policies and will also help in enhancing the capabilities of effective emergency response in the worst pollution situation.
引用
收藏
页码:2597 / 2615
页数:18
相关论文
共 147 条
[1]  
Menardi G(2014)Training and assessing classification rules with imbalanced data Data Min Knowl Disc 28 92-122
[2]  
Torelli N(2002)The class imbalance problem: a systematic study Intell Data Anal 6 429-449
[3]  
Japkowicz N(2011)A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches IEEE Trans Syst Man Cybern Part C (Appl Rev) 42 463-484
[4]  
Stephen S(2012)Multi-class imbalance problems: analysis and potential solutions IEEE Trans Syst Man Cybern Part B (Cybern) 42 1119-1130
[5]  
Galar M(2021)Detection and classification of leukocytes in blood smear images: state of the art and challenges Int J Ambient Comput Intell (IJACI) 12 111-139
[6]  
Fernandez A(2021)Enhanced Gaussian process regression-based forecasting model for COVID-19 outbreak and significance of IoT for its detection Appl Intell 51 1492-1512
[7]  
Barrenechea E(2004)Special issue on learning from imbalanced data sets ACM SIGKDD Explor Newsl 6 1-6
[8]  
Bustince H(2008)Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance Neural Netw 21 427-436
[9]  
Herrera F(1998)Machine learning for the detection of oil spills in satellite radar images Mach Learn 30 195-215
[10]  
Wang S(2006)Evaluation of classifiers for an uneven class distribution problem Appl Artif Intell 20 381-417