Support high-order tensor data description for outlier detection in high-dimensional big sensor data

被引:15
作者
Deng, Xiaowu [1 ,2 ,3 ]
Jiang, Peng [1 ]
Peng, Xiaoning [2 ,3 ]
Mi, Chunqiao [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Huaihua Univ, Sch Comp Sci & Engn, Huaihua 418000, Peoples R China
[3] Hunan Prov Key Lab Ecol Agr Intelligent Control T, Huaihua 418000, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 81卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Big sensor data; High-dimensional data; Outlier detection; CP factorization; KSTDD; MODELS;
D O I
10.1016/j.future.2017.10.013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The various high-dimensional sensor data can be collected by wireless sensor networks, video monitoring systems and multimedia sensor networks, while High-dimensional sensor data is inherently large-scale because each sensor node has spatial attributes and may also be associated with large amounts of measurement data evolving over time. Detecting outlier in high-dimensional big sensor data is a challenging task. Most of existing outlier detection methods is based on vector representation. However, high-dimensional sensor data is naturally described by tensor representations. The vector-based methods can lead to destroy original structural information and correlation for high-dimensional sensors data, result in the problem of curse of dimensionality, and some outliers cannot be detected. To solve this problem, support high-order tensor data description (STDD) and kernel support high-order tensor data description (KSTDD) are proposed to detect outliers for tensor data. STDD and KSTDD extend support vector data description from vector space to tensor space. KSTDD maintains the structural information of data, avoids the problem caused by the vectorization of tensor data, and improves the performance of outlier detection. Experiments on four sensor datasets show that the proposed method is superior to the traditional vectorized data analysis method. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 50 条
  • [21] Projected outlier detection in high-dimensional mixed-attributes data set
    Ye, Mao
    Li, Xue
    Orlowska, Maria E.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 7104 - 7113
  • [22] VOA*: Fast Angle-Based Outlier Detection over High-Dimensional Data Streams
    Khalique, Vijdan
    Kitagawa, Hiroyuki
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 40 - 52
  • [23] Outlier detection toward high-dimensional industrial data using extreme tensor-train learning machine with compression
    Deng, Xiaowu
    Shi, Yuanquan
    Yao, Dunhong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (06)
  • [24] Outlier detection based on variance of angle in high dimensional data
    Liu, Wenting
    SIXTH INTERNATIONAL CONFERENCE ON ELECTRONICS AND INFORMATION ENGINEERING, 2015, 9794
  • [25] PCA leverage: outlier detection for high-dimensional functional magnetic resonance imaging data
    Mejia, Amanda F.
    Nebel, Mary Beth
    Eloyan, Ani
    Caffo, Brian
    Lindquist, Martin A.
    BIOSTATISTICS, 2017, 18 (03) : 521 - 536
  • [26] An Ensemble Outlier Detection Method Based on Information Entropy-Weighted Subspaces for High-Dimensional Data
    Li, Zihao
    Zhang, Liumei
    ENTROPY, 2023, 25 (08)
  • [27] High-dimensional outlier detection using random projections
    P. Navarro-Esteban
    J. A. Cuesta-Albertos
    TEST, 2021, 30 : 908 - 934
  • [28] Parallel coordinate order for high-dimensional data
    Tilouche, Shaima
    Partovi Nia, Vahid
    Bassetto, Samuel
    STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (05) : 501 - 515
  • [29] A fast outlier detection strategy for distributed high-dimensional data sets with mixed attributes
    Koufakou, Anna
    Georgiopoulos, Michael
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (02) : 259 - 289
  • [30] A fast outlier detection strategy for distributed high-dimensional data sets with mixed attributes
    Anna Koufakou
    Michael Georgiopoulos
    Data Mining and Knowledge Discovery, 2010, 20 : 259 - 289