Data anomaly detection and Data fusion based on Incremental Principal Component Analysis in Fog Computing

被引:3
|
作者
Yu, Xue-Yong [1 ,2 ]
Guo, Xin-Hui [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Wireless Commun, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Minist Educ, Engn Res Ctr Hlth Serv Syst Based Ubiquitous Wire, Nanjing 210003, Peoples R China
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2020年 / 14卷 / 10期
关键词
Incremental Principal Component Analysis; Offline and real-time learning; Fog Computing; Data anomaly detection;
D O I
10.3837/tiis.2020.10.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent agriculture monitoring is based on the perception and analysis of environmental data, which enables the monitoring of the production environment and the control of environmental regulation equipment. As the scale of the application continues to expand, a large amount of data will be generated from the perception layer and uploaded to the cloud service, which will bring challenges of insufficient bandwidth and processing capacity. A fog-based offline and real-time hybrid data analysis architecture was proposed in this paper, which combines offline and real-time analysis to enable real-time data processing on resource-constrained IoT devices. Furthermore, we propose a data process-ing algorithm based on the incremental principal component analysis, which can achieve data dimensionality reduction and update of principal components. We also introduce the concept of Squared Prediction Error (SPE) value and realize the abnormal detection of data through the combination of SPE value and data fusion algorithm. To ensure the accuracy and effectiveness of the algorithm, we design a regular-SPE hybrid model update strategy, which enables the principal component to be updated on demand when data anomalies are found. In addition, this strategy can significantly reduce resource consumption growth due to the data analysis architectures. Practical datasets-based simulations have confirmed that the proposed algorithm can perform data fusion and exception processing in real-time on resource-constrained devices; Our model update strategy can reduce the overall system resource consumption while ensuring the accuracy of the algorithm.
引用
收藏
页码:3989 / 4006
页数:18
相关论文
共 50 条
  • [1] INCREMENTAL PRINCIPAL COMPONENT ANALYSIS BASED OUTLIER DETECTION METHODS FOR SPATIOTEMPORAL DATA STREAMS
    Bhushan, Alka
    Sharker, Monir H.
    Karimi, Hassan A.
    ISPRS INTERNATIONAL WORKSHOP ON SPATIOTEMPORAL COMPUTING, 2015, : 67 - 71
  • [2] An incremental principal component analysis for chunk data
    Ozawa, Seiichi
    Pang, Shaoning
    Kasabov, Nikola
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 2278 - +
  • [3] Probabilistic principal component analysis-based anomaly detection for structures with missing data
    Ma, Zhi
    Yun, Chung-Bang
    Wan, Hua-Ping
    Shen, Yanbin
    Yu, Feng
    Luo, Yaozhi
    STRUCTURAL CONTROL & HEALTH MONITORING, 2021, 28 (05):
  • [4] Anomaly Detection Based on Kernel Principal Component and Principal Component Analysis
    Wang, Wei
    Zhang, Min
    Wang, Dan
    Jiang, Yu
    Li, Yuliang
    Wu, Hongda
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2222 - 2228
  • [5] Anomaly detection based on kernel principal component and principal component analysis
    Wang, Wei
    Zhang, Min
    Wang, Dan
    Jiang, Yu
    Li, Yuliang
    Wu, Hongda
    Lecture Notes in Electrical Engineering, 2019, 463 : 2222 - 2228
  • [6] A Fast Incremental Kernel Principal Component Analysis for Data Streams
    Joseph, Annie Anak
    Ozawa, Seiichi
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3135 - 3142
  • [7] Assessing mineral profiles for rice flour fraud detection by principal component analysis based data fusion
    Perez-Rodriguez, Michael
    Maia Dirchwolf, Pamela
    Rodriguez-Negrin, Zenaida
    Gerardo Pellerano, Roberto
    FOOD CHEMISTRY, 2021, 339
  • [8] Soil Clustering and Anomaly Detection Based on EPBM Data Using Principal Component Analysis and Local Outlier Factor
    Apoji, Dayu
    Soga, Kenichi
    GEO-RISK 2023: DEVELOPMENTS IN RELIABILITY, RISK, AND RESILIENCE, 2023, 346 : 1 - 11
  • [9] Online classification framework for data stream based on incremental kernel principal component analysis
    Wu F.
    Zhong Y.
    Wu Q.-Y.
    Zidonghua Xuebao/ Acta Automatica Sinica, 2010, 36 (04): : 534 - 542
  • [10] Efficient incremental authentication for the updated data in fog computing
    Wang, Fenghe
    Wang, Junquan
    Yang, Wenfeng
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 130 - 137