Support high-order tensor data description for outlier detection in high-dimensional big sensor data

被引:15
作者
Deng, Xiaowu [1 ,2 ,3 ]
Jiang, Peng [1 ]
Peng, Xiaoning [2 ,3 ]
Mi, Chunqiao [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Huaihua Univ, Sch Comp Sci & Engn, Huaihua 418000, Peoples R China
[3] Hunan Prov Key Lab Ecol Agr Intelligent Control T, Huaihua 418000, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 81卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Big sensor data; High-dimensional data; Outlier detection; CP factorization; KSTDD; MODELS;
D O I
10.1016/j.future.2017.10.013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The various high-dimensional sensor data can be collected by wireless sensor networks, video monitoring systems and multimedia sensor networks, while High-dimensional sensor data is inherently large-scale because each sensor node has spatial attributes and may also be associated with large amounts of measurement data evolving over time. Detecting outlier in high-dimensional big sensor data is a challenging task. Most of existing outlier detection methods is based on vector representation. However, high-dimensional sensor data is naturally described by tensor representations. The vector-based methods can lead to destroy original structural information and correlation for high-dimensional sensors data, result in the problem of curse of dimensionality, and some outliers cannot be detected. To solve this problem, support high-order tensor data description (STDD) and kernel support high-order tensor data description (KSTDD) are proposed to detect outliers for tensor data. STDD and KSTDD extend support vector data description from vector space to tensor space. KSTDD maintains the structural information of data, avoids the problem caused by the vectorization of tensor data, and improves the performance of outlier detection. Experiments on four sensor datasets show that the proposed method is superior to the traditional vectorized data analysis method. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 44 条
[1]  
[Anonymous], 1980, IDENTIFICATION OUTLI, DOI DOI 10.1007/978-94-015-3994-4
[2]  
[Anonymous], 2014, PAC AS C KNOWL DISC
[3]  
[Anonymous], 2009 3 IEEE INT
[4]  
[Anonymous], 2006, KDD
[5]  
[Anonymous], 2016, TENSORLAB 30
[6]  
Arning Andreas., 1996, A linear method for deviation detection in large databases, P164
[7]  
ASA Section on Statistical Computing, 2014, DAT EXP 2009
[8]  
Barnett V., 1978, Outliers in statistical data
[9]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[10]   Density weighted support vector data description [J].
Cha, Myungraee ;
Kim, Jun Seok ;
Baek, Jun-Geol .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (07) :3343-3350