An Empirical Comparison of Supervised Machine Learning Algorithms For Internet of Things Data

被引:0
作者
Khadse, Vijay [1 ]
Mahalle, Parikshit N. [1 ]
Biraris, Swapnil V. [2 ]
机构
[1] SKN Coll Engn Pune, Dept Comp Engn, Pune, Maharashtra, India
[2] Coll Engn Pune, Dept Comp Engn & IT, Pune, Maharashtra, India
来源
2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA) | 2018年
关键词
Internet of Things; Machine Learning; Kappa; Confusion Matrix; Cross-Validation; Precision; Recall; F1-score; Class Imbalance; ACCURACY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet of Things(IoT) is one of the rapidly growing fields andn has a wide range of applications such as smart cities, smart homes, connected wearable, connected health-care, and connected automobiles, etc. These IoT applications generate tremendous amounts of data which needs to be analyzed to draw useful inferences required to optimize the performance of IoT applications. The artificial intelligence(AI) and machine learning (ML) play the significant role in building the smart IoT systems. The main objective of the paper is a comprehensive analysis of five well-known supervised machine learning algorithms on IoT datasets. The five classifiers are K-Nearest Neighbors (KNN), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF) and Logistic Regression (LR). The feature reduction is performed using PCA algorithm. The performance of these five classifiers has been compared on the basis of six characteristics of IoT dataset such as size, number of features, number of classes, class imbalance, missing values and execution time. The classifiers have also been compared on various performance metrics such as precision, recall, f1-score, kappa, and accuracy. As per our results, the DT classifier gives the best accuracy of 99% among all the algorithms for all datasets. The results also show the performance of RF and KNN as almost similar and the NB and LR perform the worst among all the classifiers
引用
收藏
页数:6
相关论文
共 38 条
[1]  
Alam Furqan, 2016, INT WORKSH DAT MIN I
[2]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[3]  
[Anonymous], 1989, The Technical Writer's Handbook
[4]  
[Anonymous], 1955, T ROY SOC LONDON A, V247, P529
[5]  
[Anonymous], 1963, Magnetism
[6]  
[Anonymous], 2013, The Elements of Statistical Learning
[7]  
[Anonymous], unpublished.
[8]  
[Anonymous], 1983, Signals and Systems
[9]  
Caruana R., ICML 06 P 23 INT C M, P161
[10]   Data Mining for the Internet of Things: Literature Review and Challenges [J].
Chen, Feng ;
Deng, Pan ;
Wan, Jiafu ;
Zhang, Daqiang ;
Vasilakos, Athanasios V. ;
Rong, Xiaohui .
INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,