Expect the Unexpected: Unsupervised Feature Selection for Automated Sensor Anomaly Detection

被引:28
作者
Teh, Hui Yie [1 ]
Wang, Kevin I-Kai [1 ]
Kempa-Liehr, Andreas W. [2 ,3 ]
机构
[1] Univ Auckland, Dept Elect Comp & Software Engn, Auckland 1010, New Zealand
[2] Univ Auckland, Dept Engn Sci, Auckland 1010, New Zealand
[3] Univ Freiburg, Freiburg Mat Res Ctr, D-79085 Freiburg, Germany
关键词
Sensors; Anomaly detection; Feature extraction; Principal component analysis; Calibration; Time series analysis; Internet of Things; feature selection; principal component analysis; sensor data quality; time series analysis; PRINCIPAL COMPONENT ANALYSIS; FAULT-DIAGNOSIS; NETWORKS; QUALITY; SVM;
D O I
10.1109/JSEN.2021.3084970
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the growth of IoT applications, sensor data quality has become increasingly important to ensure the success of these data-driven applications. Sensor data riddled with errors are redundant and affects the accuracy of decision-making results. Moreover, several constraints are inherent in anomaly detection for IoT applications such as limited manpower, time, bandwidth, computational resources and the lack of labelled datasets. Most machine learning algorithms also require a large training set to fit a satisfactory model. In this paper, we propose a fully automated anomaly detection framework, which combines systematic time series feature engineering with unsupervised feature selection. The unsupervised feature selection approach automatically selects time series features based on their predictive power with respect to the statistics of near future measurements. The proposed framework also only needs a short calibration phase with respect to the deployment phase to detect anomalies despite not having seen any in training, which is critical for real-world applications. Its feature selection reduces the amount of data required in the deployment phase on an IoT application. The selected features are suitable for building a reliable anomaly detection model while achieving a similar or better anomaly detection performance than established methods, which are operating on the raw data. Results of the evaluation on two publicly available environmental monitoring datasets show that our proposed unsupervised feature selection approach is a crucial step for having a more accurate anomaly detection while providing complex application-specific time series features, which are safeguarding the sensor system against unseen sensor anomalies.
引用
收藏
页码:18033 / 18046
页数:14
相关论文
共 56 条
[1]  
Abuaitah G. R., 2012, P 2012 IEEE 9 INT C, P1
[2]  
Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
[3]   Unsupervised real-time anomaly detection for streaming data [J].
Ahmad, Subutai ;
Lavin, Alexander ;
Purdy, Scott ;
Agha, Zuha .
NEUROCOMPUTING, 2017, 262 :134-147
[4]  
Alawi A, 2007, FAULT DETECTION FAUL, P908, DOI [10.1016/B978-008044485-7/50153-6, DOI 10.1016/B978-008044485-7/50153-6]
[5]  
Barde A., 2018, SSRN Electron. J, DOI [10.2139/ssrn.3167286, DOI 10.2139/SSRN.3167286]
[6]  
Barrenetxea G., 2019, Sensorscope data
[7]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[8]   Spatial anomaly detection in sensor networks using neighborhood information [J].
Bosman, Hedde H. W. J. ;
Iacca, Giovanni ;
Tejada, Arturo ;
Wortche, Heinrich J. ;
Liotta, Antonio .
INFORMATION FUSION, 2017, 33 :41-56
[9]   Ensembles of incremental learners to detect anomalies in ad hoc sensor networks [J].
Bosman, Hedde H. W. J. ;
Iacca, Giovanni ;
Tejada, Arturo ;
Wortche, Heinrich J. ;
Liotta, Antonio .
AD HOC NETWORKS, 2015, 35 :14-36
[10]   Fall-curve: A novel primitive for IoT Fault Detection and Isolation [J].
Chakraborty, Tusher ;
Nambi, Akshay Uttama ;
Chandra, Ranveer ;
Sharma, Rahul ;
Swaminathan, Manohar ;
Kapetanovic, Zerina ;
Appavoo, Jonathan .
SENSYS'18: PROCEEDINGS OF THE 16TH CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, 2018, :95-107