Approaches for Using Machine Learning Algorithms with Large Label Sets for Rotorcraft Maintenance

被引:0
作者
Seale, Maria [1 ]
Hines, Amanda [1 ]
Nabholz, Grace [1 ]
Ruvinsky, Alicia [1 ]
Eslinger, Owen [1 ]
Rigoni, Nathan [2 ]
Vega-Maisonet, Luis [3 ]
机构
[1] Us Army Engineer Res & Dev Ctr, 3909 Halls Ferry Rd, Vicksburg, MS 39180 USA
[2] Aviat & Missile Res Dev & Engn Ctr, Huntsville, AL USA
[3] Univ Puerto Rico, PR-108, Mayaguez, PR 00682 USA
来源
2019 IEEE AEROSPACE CONFERENCE | 2019年
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The US Army Aviation and Missile Research, Development, and Engineering Center (AMRDEC), in collaboration with the US Army Engineer Research and Development Center (ERDC), is using machine learning (ML) to transform the way rotorcraft maintenance logbook event data is scored for reporting purposes. Traditionally, human analysts manually inspect data fields completed by maintenance personnel and, using published guidelines in conjunction with their personal expertise, provide sets of labels or scores for each event. These labels are stored with the data and provide valuable insight into maintenance event histories. However, the inequity between the enormous volume of maintenance data generated daily and the ability of analysts to score the data results in only 10% of all data receiving scores; therefore, 90% of the recorded data does not contain this important value added feature. Classification algorithms for automating this scoring process trained on existing labeled data sets have been implemented with promising results. A particularly challenging element of this problem, however, involves the classification of the specific component on which maintenance was performed. Greater than 1200 unique labels exist that can be used to describe a rotorcraft component that is the subject of a maintenance action. Furthermore, the component labels are hierarchically structured, resulting in the occurrence of multiple levels of precision in identifying a component in the expert-labeled data used for training. Although computational efficiency of common classification algorithms has improved considerably, it is still quite challenging to harness these methods for problems that include large numbers of unique class labels. This paper describes several novel strategies for solving this problem, based on hierarchical ensemble models and strategic label set segmentation. Through these approaches, a best overall total component classification accuracy of 96% was achieved, in conjunction with a total per record accuracy for three different label categories of 93%. The approaches implemented to handle the large label set for classification of rotorcraft components, along with classification performance measures, are discussed.
引用
收藏
页数:8
相关论文
共 15 条
[1]  
[Anonymous], 2010, Adv. Neural Informat. Process. Syst.
[2]   A review of instance selection methods [J].
Arturo Olvera-Lopez, J. ;
Ariel Carrasco-Ochoa, J. ;
Francisco Martinez-Trinidad, J. ;
Kittler, Josef .
ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) :133-143
[3]  
Baker YS, 2013, 2013 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: BIG DATA, EMERGENT THREATS, AND DECISION-MAKING IN SECURITY INFORMATICS, P10, DOI 10.1109/ISI.2013.6578776
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[6]  
Godbole Shantanu., 2002, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, P513
[7]   Hierarchical document classification using automatically generated hierarchy [J].
Li, Tao ;
Zhu, Shenghuo ;
Ogihara, Mitsunori .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2007, 29 (02) :211-230
[8]  
MacQueen J., 1967, PROC 5 BERKELEY S MA, V1, P281
[9]  
Mikolov T., 2013, ADV NEURAL INFORM PR, V26, P3111
[10]  
QinetiQ North America, 2012, UNIT LEV LOG SYST AV