Fault detection and explanation through big data analysis on sensor streams

被引:50
作者
Manco, Giuseppe [1 ]
Ritacco, Ettore [1 ]
Rullo, Pasquale [5 ]
Gallucci, Lorenzo [4 ]
Astill, Will [3 ]
Kimber, Dianne [3 ]
Antonelli, Marco [2 ]
机构
[1] CNR, ICAR, Via Bucci 41C, I-87036 Arcavacata Di Rende, CS, Italy
[2] Bombardier Transportat SpA, Vado Ligure, Italy
[3] Bombardier Transportat Ltd, London, England
[4] Exeura Srl, Via PA Cabral, I-87036 Arcavacata Di Rende, CS, Italy
[5] Univ Calabria, Dipartmento Matemat & Informat, Via Bucci 30B, I-87036 Arcavacata Di Rende, CS, Italy
关键词
Fault detection; Anomaly detection; Outlier explanation; Big data; Sensor data;
D O I
10.1016/j.eswa.2017.05.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fault prediction is an important topic for the industry as, by providing effective methods for predictive maintenance, allows companies to perform important time and cost savings. In this paper we describe an application developed to predict and explain door failures on metro trains. To this end, the aim was twofold: first, devising prediction techniques capable of early detecting door failures from diagnostic data; second, describing failures in terms of properties distinguishing them from normal behavior. Data pre-processing was a complex task aimed at overcoming a number of issues with the dataset, like size, sparsity, bias, burst effect and trust. Since failure premonitory signals did not share common patterns, but were only characterized as non-normal device signals, fault prediction was performed by using outlier detection. Fault explanation was finally achieved by exhibiting device features showing abnormal values. An experimental evaluation was performed to assess the quality of the proposed approach. Results show that high-degree outliers are effective indicators of incipient failures. Also, explanation in terms of abnormal feature values (responsible for outlierness) seems to be quite expressive. There are some aspects in the proposed approach that deserve particular attention. We introduce a general framework for the failure detection problem based on an abstract model of diagnostic data, along with a formal problem statement. They both provide the basis for the definition of an effective data pre-processing technique where the behavior of a device, in a given time frame, is summarized through a number of suitable statistics. This approach strongly mitigates the issues related to data errors/noise, thus enabling to perform an effective outlier detection. All this, in our view, provides the grounds of a general methodology for advanced prognostic systems. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:141 / 156
页数:16
相关论文
共 30 条
[11]  
Kauschke S., 2015, TECHNICAL REPORT
[12]  
Lazarevic A., 2005, P 11 ACM SIGKDD INT, P157, DOI DOI 10.1145/1081870.1081891
[13]  
Lee Jay, 2013, Manufacturing Letters, V1, P38, DOI 10.1016/j.mfglet.2013.09.005
[14]   Rialto: A Knowledge Discovery suite for data analysis [J].
Manco, Giuseppe ;
Rullo, Pasquale ;
Gallucci, Lorenzo ;
Paturzo, Mirko .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 59 :145-164
[15]   Explaining outliers by subspace separability [J].
Micenkova, Barbora ;
Dang, Xuan-Hong ;
Assent, Ira ;
Ng, Raymond T. .
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, :518-527
[16]  
Mladenic D, 1999, MACHINE LEARNING, PROCEEDINGS, P258
[17]   Current status of machine prognostics in condition-based maintenance: a review [J].
Peng, Ying ;
Dong, Ming ;
Zuo, Ming Jian .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2010, 50 (1-4) :297-313
[18]  
Pereira P, 2014, LECT NOTES ARTIF INT, V8777, P264, DOI 10.1007/978-3-319-11812-3_23
[19]  
Petsche T., 1995, ADV NEURAL INFORM PR, P924
[20]  
Pournelle G. H., 1953, Journal of Mammalogy, V34, P133, DOI 10.1890/0012-9658(2002)083[1421:SDEOLC]2.0.CO