AUTOMATIC SPEECH SEMANTIC RECOGNITION AND VERIFICATION IN AIR TRAFFIC CONTROL

被引:0
作者
Johnson, Daniel R. [1 ]
Nenov, Val I. [2 ]
Espinoza, Gustavo [3 ]
机构
[1] FAA, Atlantic City, NJ 08405 USA
[2] BrainVentions Corp Inc, Los Angeles, CA USA
[3] Hitec Syst Inc, Egg Harbor Township, NJ USA
来源
2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) | 2013年
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The Federal Aviation Administration (FAA) and BrainVentions Corporation, Inc. have conducted a one-year cooperative research agreement to investigate the current state of the art Automatic Speech Semantic Recognition (ASSR) technology, to measure its accuracy, and to explore its potential applications in Air Traffic Control (ATC). The ASSR system used for this study was the ValsVox (BrainVentions) web-based Software as a Service (SaaS) application for real-time speech recognition and semantic parsing of ATC voice communications. Case studies were conducted using the enhanced ValsVox system with recorded audio from previously conducted ATC Human-in-the-Loop (HITL) simulations. The standard speech recognition metric of word error rate was calculated as well as semantic metrics based on the commands issued by the simulation pilots used as the "gold standard" interpretation of the voice clearances. The design of ValsVox is such that the speech recognition is constrained in real-time by the semantic parsing unlike other systems in which a transcript of the utterance is first generated then semantically parsed. For the phrases that were successfully recognized, the accuracy of the semantic recognition as measured against the "gold standard" was over 80%. These findings suggest that the technology may be ready for follow on research to incorporate ASSR into ATC decision support tools. For example, this might include tools that detect mismatches between verbal clearances and keyed commands or to alert the users of incorrect or missing read-backs. There is also a potential to use ASSR in simulation and training environments such as incorporating into simulation pilot workstations to assist overloaded pilots. In addition human transcriptionists could use ValsVox-generated transcripts to save time and costs when complete accuracy is needed.
引用
收藏
页数:14
相关论文
共 15 条
[1]  
[Anonymous], 2013, Target Generation Facility, Patent No. 7089108
[2]  
[Anonymous], ICAO ICAO STANDARD P, P1
[3]  
[Anonymous], 2013, SPEECH RECOGNITION S
[4]  
[Anonymous], 2010, ASR Tuning Custom Acoustic Model, Patent No. 7809405
[5]  
[Anonymous], 2010, DOT FAA TC 10 06
[6]  
Blajev T, 2006, GROUND COMMUNICATION
[7]  
CAA, 2009, CAP 413 RADIOTELEPHO
[8]  
Cordero J.M., 2012, AUTOMATED SPEECH REC, P46
[9]  
Eurocontrol, EGAST A GUIDE TO PHR
[10]  
FAA, 2012, FAA Order JO 7110.65U, Air Traffic Control, Patent No. 20120323577