Automatic Speech Recognition for Air Traffic Control Communications

被引:23
作者
Badrinath, Sandeep [1 ]
Balakrishnan, Hamsa [1 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
D O I
10.1177/03611981211036359
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A significant fraction of communications between air traffic controllers and pilots is through speech, via radio channels. Automatic transcription of air traffic control (ATC) communications has the potential to improve system safety, operational performance, and conformance monitoring, and to enhance air traffic controller training. We present an automatic speech recognition model tailored to the ATC domain that can transcribe ATC voice to text. The transcribed text is used to extract operational information such as call-sign and runway number. The models are based on recent improvements in machine learning techniques for speech recognition and natural language processing. We evaluate the performance of the model on diverse datasets.
引用
收藏
页码:798 / 810
页数:13
相关论文
共 35 条
[1]  
Amodei D, 2016, PR MACH LEARN RES, V48
[2]  
Auli, ARXIV PREPRINT ARXIV
[3]  
Collobert R., FASTEST OPEN SOURCE
[4]  
Delpech E, 2018, PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), P2866
[5]   The Application of Hidden Markov Models in Speech Recognition [J].
Gales, Mark ;
Young, Steve .
FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03) :195-304
[6]   A 1 GHz Frequency-Controlled Class E2 DC/DC Converter for Efficiently Handling Wideband Signal Envelopes [J].
Garcia, J. A. ;
Marante, R. ;
Ruiz, M. N. ;
Hernandez, G. .
2013 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST (IMS), 2013,
[7]  
Godfrey, 2020, AIR TRAFFIC CONTROL, V1994
[8]  
Hannun A., ARXIV PREPRINT ARXIV
[9]  
Heafield Kenneth., 2011, Proceedings of the Sixth Workshop on Statistical Machine Translation, P187
[10]  
Helmke H., 2017, SEMISUPERVISED LEARN