Efficient Classification of Distribution-Based Data for Internet of Things

被引:2
作者
Huang, Jinchao [1 ]
Zhu, Lin [2 ]
Liang, Qilian [3 ]
Fan, Bo [4 ]
Li, Shenghong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Cyber Secur, Sch Elect Informat & Elect Engn, MoE Key Lab Artificial Intelligence,AI Inst, Shanghai 200240, Peoples R China
[2] Ctrip Travel Network Technol Shanghai Co Ltd, Shanghai 200050, Peoples R China
[3] Univ Texas Arlington, Dept Elect Engn, Arlington, TX 76019 USA
[4] Shanghai Jiao Tong Univ, Sch Int & Publ Affairs, Shanghai 200240, Peoples R China
关键词
Internet of Things; online travel agents; distribution-based data; decision making; Bayesian-based estimation;
D O I
10.1109/ACCESS.2018.2879652
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important tool of data mining, classification is also one of the major components of the research of Internet of Things (IoT), which has been widely used in many cases, such as smart cities, information abstraction, wireless sensor networks, and so on. IoT could have broader characterization, where diverse data or information could come from ubiquitous and persistent sources. Influenced by various factors, there are a lot of scenes that the data collected from the IoT devices are in the distribution-based form. Therefore, the study of classification for the distribution-based data is very valuable in the field of IoT. To speed up the training process, this paper proposes a new general approach when the types and parameters of distributions are known. It transforms the original problem into a traditional point-valued classification problem with a sampling-based method. Then for the applications that the distribution parameters are not given in advance, this paper also gives an improved approach, which uses a new Bayesian-based method to estimate the distribution parameters. Empirical comparisons conducted on a series of standard benchmark datasets and a real-world dataset from a major Chinese online travel agent site demonstrate that both of our proposed approaches perform better than the existing methods.
引用
收藏
页码:69279 / 69287
页数:9
相关论文
共 30 条
[1]   Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications [J].
Al-Fuqaha, Ala ;
Guizani, Mohsen ;
Mohammadi, Mehdi ;
Aledhari, Mohammed ;
Ayyash, Moussa .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04) :2347-2376
[2]   Analysis of Eight Data Mining Algorithms for Smarter Internet of Things (IoT) [J].
Alam, Funian ;
Mehmood, Rashid ;
Katib, Iyad ;
Albeshri, Aiiad .
7TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2016)/THE 6TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2016), 2016, 98 :437-442
[3]  
[Anonymous], P 7 IEEE INT C DAT M
[4]  
Anthopoulos L., 2013, ELECT J E GOVT, V11, P1
[5]   Towards Modelling and Reasoning about Uncertain Data of Sensor Measurements for Decision Support in Smart Spaces [J].
Bamgboye, Oluwaseun ;
Liu, Xiaodong ;
Cruickshank, Peter .
2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, :744-749
[6]  
Chaker Jebari., 2007, Proceedings of the 7th IEEE International Conference on Data Mining Workshops (ICDMW'07), P455
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]  
Chipman H., 2001, Lecture Notes-Monograph Series, P65, DOI DOI 10.1214/LNMS/1215540964
[9]  
Fraley C, 2007, J CLASSIF, V24, P155, DOI 10.1007/s00357-007-0004-z
[10]   A Practical Evaluation of Information Processing and Abstraction Techniques for the Internet of Things [J].
Ganz, Frieder ;
Puschmann, Daniel ;
Barnaghi, Payam ;
Carrez, Francois .
IEEE INTERNET OF THINGS JOURNAL, 2015, 2 (04) :340-354