Support high-order tensor data description for outlier detection in high-dimensional big sensor data

被引:15
作者
Deng, Xiaowu [1 ,2 ,3 ]
Jiang, Peng [1 ]
Peng, Xiaoning [2 ,3 ]
Mi, Chunqiao [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Huaihua Univ, Sch Comp Sci & Engn, Huaihua 418000, Peoples R China
[3] Hunan Prov Key Lab Ecol Agr Intelligent Control T, Huaihua 418000, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 81卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Big sensor data; High-dimensional data; Outlier detection; CP factorization; KSTDD; MODELS;
D O I
10.1016/j.future.2017.10.013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The various high-dimensional sensor data can be collected by wireless sensor networks, video monitoring systems and multimedia sensor networks, while High-dimensional sensor data is inherently large-scale because each sensor node has spatial attributes and may also be associated with large amounts of measurement data evolving over time. Detecting outlier in high-dimensional big sensor data is a challenging task. Most of existing outlier detection methods is based on vector representation. However, high-dimensional sensor data is naturally described by tensor representations. The vector-based methods can lead to destroy original structural information and correlation for high-dimensional sensors data, result in the problem of curse of dimensionality, and some outliers cannot be detected. To solve this problem, support high-order tensor data description (STDD) and kernel support high-order tensor data description (KSTDD) are proposed to detect outliers for tensor data. STDD and KSTDD extend support vector data description from vector space to tensor space. KSTDD maintains the structural information of data, avoids the problem caused by the vectorization of tensor data, and improves the performance of outlier detection. Experiments on four sensor datasets show that the proposed method is superior to the traditional vectorized data analysis method. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 50 条
  • [41] Manifold-based denoising, outlier detection, and dimension reduction algorithm for high-dimensional data
    Guanghua Zhao
    Tao Yang
    Dongmei Fu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3923 - 3942
  • [42] High-dimensional data monitoring using support machines
    Maboudou-Tchao, Edgard M.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (07) : 1927 - 1942
  • [43] Anomaly detection for high-dimensional data using a novel autoencoder-support vector machine
    Jiang, Zhuo
    Huang, Xiao
    Wang, Rongbin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9457 - 9469
  • [44] Outlier Detection Using Improved Support Vector Data Description in Wireless Sensor Networks
    Shi, Pei
    Li, Guanghui
    Yuan, Yongming
    Kuang, Liang
    SENSORS, 2019, 19 (21)
  • [45] Outlier detection for high dimensional data using the Comedian approach
    Sajesh, T. A.
    Srinivasan, M. R.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2012, 82 (05) : 745 - 757
  • [46] Vibration-Based Outlier Detection on High Dimensional Data
    Xia, Shuyin
    Wang, Guoyin
    Yu, Hong
    Liu, Qun
    Wang, Jin
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2016, 25 (03)
  • [47] An effective and efficient algorithm for high-dimensional outlier detection
    Charu C. Aggarwal
    Philip S. Yu
    The VLDB Journal, 2005, 14 : 211 - 221
  • [48] An effective and efficient algorithm for high-dimensional outlier detection
    Aggarwal, CC
    Yu, PS
    VLDB JOURNAL, 2005, 14 (02) : 211 - 221
  • [49] Feature Extraction for Outlier Detection in High-Dimensional Spaces
    Hoang Vu Nguyen
    Gopalkrishnan, Vivekanand
    PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON FEATURE SELECTION IN DATA MINING, 2010, 10 : 66 - 75
  • [50] Binary Gravitational Subspace Search for Outlier Detection in High Dimensional Data Streams
    Souiden, Imen
    Brahmi, Zaki
    Omri, Mohamed Nazih
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2022, PT II, 2022, 13726 : 157 - 169