Enhancing Air Traffic Control Communication Systems with Integrated Automatic Speech Recognition: Models, Applications and Performance Evaluation

被引:0
作者
Wang, Zhuang [1 ]
Jiang, Peiyuan [1 ]
Wang, Zixuan [1 ]
Han, Boyuan [1 ]
Liang, Haijun [1 ]
Ai, Yi [1 ]
Pan, Weijun [1 ]
机构
[1] Civil Aviat Flight Univ China, Coll Air Traff Management, Guanghan 618307, Peoples R China
关键词
air traffic control; speech communication; automatic speech recognition;
D O I
10.3390/s24144715
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In air traffic control (ATC), speech communication with radio transmission is the primary way to exchange information between the controller and the pilot. As a result, the integration of automatic speech recognition (ASR) systems holds immense potential for reducing controllers' workload and plays a crucial role in various ATC scenarios, which is particularly significant for ATC research. This article provides a comprehensive review of ASR technology's applications in the ATC communication system. Firstly, it offers a comprehensive overview of current research, including ATC corpora, ASR models, evaluation measures and application scenarios. A more comprehensive and accurate evaluation methodology tailored for ATC is proposed, considering advancements in communication sensing systems and deep learning techniques. This methodology helps researchers in enhancing ASR systems and improving the overall performance of ATC systems. Finally, future research recommendations are identified based on the primary challenges and issues. The authors sincerely hope this work will serve as a clear technical roadmap for ASR endeavors within the ATC domain and make a valuable contribution to the research community.
引用
收藏
页数:35
相关论文
共 94 条
[1]  
[Anonymous], 2007, The HIWIRE database, a noisy and non-native english speech corpus for cockpit communication
[2]   Automatic Speech Recognition for Air Traffic Control Communications [J].
Badrinath, Sandeep ;
Balakrishnan, Hamsa .
TRANSPORTATION RESEARCH RECORD, 2022, 2676 (01) :798-810
[3]   CALL-SIGN RECOGNITION AND UNDERSTANDING FOR NOISY AIR-TRAFFIC TRANSCRIPTS USING SURVEILLANCE INFORMATION [J].
Blatt, Alexander ;
Kocour, Martin ;
Vesely, Karel ;
Szoeke, Igor ;
Klakow, Dietrich .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :8357-8361
[4]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
[5]  
Cordero J., 2013, P 3 SESAR INN DAYS S, P1
[6]   AUTOMATIC RECOGNITION OF SPOKEN DIGITS [J].
DAVIS, KH ;
BIDDULPH, R ;
BALASHEK, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06) :637-642
[7]   Air traffic control speech recognition system cross-task & speaker adaptation [J].
de Cordoba, R. ;
Ferreiros, J. ;
San-Segundo, R. ;
Macias-Guarasa, J. ;
Montero, J. M. ;
Fernandez, F. ;
D'Haro, L. F. ;
Pardo, J. M. .
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2006, 21 (09) :12-17
[8]  
Delpech E, 2018, PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), P2866
[9]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[10]   Speech Recognition for Air Traffic Control via Feature Learning and End-to-End Training [J].
Fan, Peng ;
Hua, Xiyao ;
Lin, Yi ;
Yang, Bo ;
Zhang, Jianwei ;
Ge, Wenyi ;
Guo, Dongyue .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (04) :538-544