Natural language processing for aviation safety reports: From classification to interactive analysis

被引:105
作者
Tanguy, Ludovic [1 ,2 ]
Tulechki, Nikola [1 ,3 ,4 ]
Urieli, Assaf [1 ,3 ,4 ]
Hermann, Eric [4 ]
Raynal, Celine [4 ]
机构
[1] CLLE ERSS CNRS, Paris, France
[2] Univ Toulouse, Computat Linguist, Toulouse, France
[3] Univ Toulouse, Toulouse, France
[4] CFH Safety Data, Toulouse, France
关键词
Safety reports; Aviation; NLP; Document classification; Text mining;
D O I
10.1016/j.compind.2015.09.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we describe the different NLP techniques designed and used in collaboration between the CLLE-ERSS research laboratory and the CFH/Safety Data company to manage and analyse aviation incident reports. These reports are written every time anything abnormal occurs during a civil air flight. Although most of them relate routine problems, they are a valuable source of information about possible sources of greater danger. These texts are written in plain language, show a wide range of linguistic variation (telegraphic style overcrowded by acronyms or standard prose) and exist in different languages, even for a single company/country (although our main focus is on English and French). In addition to their variety, their sheer quantity (e.g. 600/month for a large airline company) clearly requires the use of advanced NLP and text mining techniques in order to extract useful information from them. Although this context and objectives seem to indicate that standard NLP techniques can be applied in a straightforward manner, innovative techniques are required to handle the specifics of aviation report text and the complex classification systems. We present several tools that aim at a better access to this data (classification and information retrieval), and help aviation safety experts in their analyses (data text mining and interactive analysis). Some of these tools are currently in test or in use both at the national and international levels, by airline companies as well as by regulation authorities (DGAC,(1) EASA,(2) ICAO(3)). (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:80 / 95
页数:16
相关论文
共 29 条
  • [11] Hall D., 2008, P 2008 C EMP METH NA, P363, DOI DOI 10.3115/1613715.1613763
  • [12] On error management: lessons from aviation
    Helmreich, RL
    [J]. BRITISH MEDICAL JOURNAL, 2000, 320 (7237) : 781 - 785
  • [13] Ho CH, 2012, J MACH LEARN RES, V13, P3323
  • [14] ICAO, 2013, 2013 SAF REP TECHN R
  • [15] ICAO, 2010, ADREP 2000 TAX
  • [16] Patterns of Query Reformulation During Web Searching
    Jansen, Bernard J.
    Booth, Danielle L.
    Spink, Amanda
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07): : 1358 - 1371
  • [17] Software tools to support incident reporting in safety-critical systems
    Johnson, C
    [J]. SAFETY SCIENCE, 2002, 40 (09) : 765 - 780
  • [18] Johnson C.W., 2003, Failure in safety-critical systems: A handbook of accident and incident reporting
  • [19] Kristjannson T., 2004, P C AM ASS ART INT A
  • [20] Menzel R., 2004, ICAO J, V59, P19