Extracting failure time data from industrial maintenance records using text mining

被引:30
作者
Arif-Uz-Zaman, Iazi [1 ]
Cholette, Michael E. [1 ]
Ma, Lin [1 ]
Karim, Azharul [1 ]
机构
[1] Queensland Univ Technol, Fac Sci & Engn, 2 George St, Brisbane, Qld 4000, Australia
基金
澳大利亚研究理事会;
关键词
Text mining; Work orders analysis; Naive Bayes; Support vector machine; CLASSIFICATION; CATEGORIZATION; KNOWLEDGE; SELECTION; MODELS;
D O I
10.1016/j.aei.2016.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reliability modelling requires accurate failure time of an asset. In real industrial cases, such data are often buried in different historical databases which were set up for purposes other than reliability modelling. In particular, two data sets are commonly available: work orders (WOs), which detail maintenance activities on the asset, and downtime data (DD), which details when the asset was taken offline. Each is incomplete from a failure perspective, where one wishes to know whether each downtime event was due to failure or scheduled activities. In this paper, a text mining approach is proposed to extract accurate failure time data from WOs and DD. A keyword dictionary is constructed using WO text descriptions and classifiers are constructed and applied to attribute each of the DD events to one of two classes: failure or nonfailure. The proposed method thus identifies downtime events whose descriptions are consistent with urgent unplanned WOs. The applicability of the methodology is demonstrated on maintenance data sets from an Australian electricity and sugar processing companies. Analysis of the text of the identified failure events seems to confirm the accurate identification of failures in DD. The results are expected to be immediately useful in improving the estimation of failure times (and thus the reliability models) for real-world assets. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:388 / 396
页数:9
相关论文
共 29 条
[1]  
Abu-Samah A., 2015, IFAC - Papers Online, V48, P844, DOI 10.1016/j.ifacol.2015.09.632
[2]  
[Anonymous], 59 M SOC MACH FAIL P
[3]  
Basu A, 2003, P 36 HAW INT C SYST
[4]   TCBR-HMM: An HMM-based text classifier with a CBR system [J].
Borrajo, L. ;
Seara Vieira, A. ;
Iglesias, E. L. .
APPLIED SOFT COMPUTING, 2015, 26 :463-473
[5]   Using ontology-based text classification to assist Job Hazard Analysis [J].
Chi, Nai-Wen ;
Lin, Ken-Yu ;
Hsieh, Shang-Hsien .
ADVANCED ENGINEERING INFORMATICS, 2014, 28 (04) :381-394
[6]   Development of a data mining-based analysis framework for multi-attribute construction project information [J].
Chi, Seokho ;
Suk, Sung-Joon ;
Kang, Youngcheol ;
Mulva, Stephen P. .
ADVANCED ENGINEERING INFORMATICS, 2012, 26 (03) :574-581
[7]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[8]   Support vector machines for spam categorization [J].
Drucker, H ;
Wu, DH ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1048-1054
[9]  
Edwards B., 2008, AUSTR DAT MIN C
[10]  
Joachims Thorsten, 1998, P 10 EUR C MACH LEAR