Support high-order tensor data description for outlier detection in high-dimensional big sensor data

被引:15
作者
Deng, Xiaowu [1 ,2 ,3 ]
Jiang, Peng [1 ]
Peng, Xiaoning [2 ,3 ]
Mi, Chunqiao [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Huaihua Univ, Sch Comp Sci & Engn, Huaihua 418000, Peoples R China
[3] Hunan Prov Key Lab Ecol Agr Intelligent Control T, Huaihua 418000, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 81卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Big sensor data; High-dimensional data; Outlier detection; CP factorization; KSTDD; MODELS;
D O I
10.1016/j.future.2017.10.013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The various high-dimensional sensor data can be collected by wireless sensor networks, video monitoring systems and multimedia sensor networks, while High-dimensional sensor data is inherently large-scale because each sensor node has spatial attributes and may also be associated with large amounts of measurement data evolving over time. Detecting outlier in high-dimensional big sensor data is a challenging task. Most of existing outlier detection methods is based on vector representation. However, high-dimensional sensor data is naturally described by tensor representations. The vector-based methods can lead to destroy original structural information and correlation for high-dimensional sensors data, result in the problem of curse of dimensionality, and some outliers cannot be detected. To solve this problem, support high-order tensor data description (STDD) and kernel support high-order tensor data description (KSTDD) are proposed to detect outliers for tensor data. STDD and KSTDD extend support vector data description from vector space to tensor space. KSTDD maintains the structural information of data, avoids the problem caused by the vectorization of tensor data, and improves the performance of outlier detection. Experiments on four sensor datasets show that the proposed method is superior to the traditional vectorized data analysis method. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 50 条
  • [31] High-dimensional outlier detection using random projections
    Navarro-Esteban, P.
    Cuesta-Albertos, J. A.
    TEST, 2021, 30 (04) : 908 - 934
  • [32] Research on outlier detection for high dimensional data stream
    Yu, Liping
    Li, Yunfei
    Jia, Juncheng
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND ENGINEERING APPLICATIONS, 2016, 63 : 395 - 398
  • [33] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    AMERICAN STATISTICIAN, 2024, 78 (02) : 240 - 252
  • [34] Outlier detection in high-dimensional regression model
    Wang, Tao
    Li, Zhonghua
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (14) : 6947 - 6958
  • [35] Autoencoder-based outlier detection for sparse, high dimensional data
    Chen, Wanghu
    Li, Huijun
    Li, Jing
    Arshad, Ali
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2735 - 2742
  • [36] A survey on unsupervised subspace outlier detection methods for high dimensional data
    Ahn, Jaehyeong
    Kwon, Sunghoon
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (03) : 507 - 521
  • [37] Non-derivable itemsets for fast outlier detection in large high-dimensional categorical data
    Anna Koufakou
    Jimmy Secretan
    Michael Georgiopoulos
    Knowledge and Information Systems, 2011, 29 : 697 - 725
  • [38] Non-derivable itemsets for fast outlier detection in large high-dimensional categorical data
    Koufakou, Anna
    Secretan, Jimmy
    Georgiopoulos, Michael
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 29 (03) : 697 - 725
  • [39] Manifold-based denoising, outlier detection, and dimension reduction algorithm for high-dimensional data
    Zhao, Guanghua
    Yang, Tao
    Fu, Dongmei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3923 - 3942
  • [40] CELOF: Effective and fast memory efficient local outlier detection in high-dimensional data streams
    Chen, Liang
    Wang, Wei
    Yang, Yun
    APPLIED SOFT COMPUTING, 2021, 102