Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control

被引:0
|
作者
Blatt, Alexander [1 ]
Krishnan, Aravind [1 ]
Klakow, Dietrich [1 ]
机构
[1] Saarland Univ, Saarland Informat Campus, Saarbrucken, Germany
来源
INTERSPEECH 2024 | 2024年
关键词
diarization; speech recognition; speaker role detection; air-traffic control;
D O I
10.21437/Interspeech.2024-1987
中图分类号
学科分类号
摘要
Utilizing air-traffic control (ATC) data for downstream natural-language processing tasks requires preprocessing steps. Key steps are the transcription of the data via automatic speech recognition (ASR) and speaker diarization, respectively speaker role detection (SRD) to divide the transcripts into pilot and air-traffic controller (ATCO) transcripts. While traditional approaches take on these tasks separately, we propose a transformer-based joint ASR-SRD system that solves both tasks jointly while relying on a standard ASR architecture. We compare this joint system against two cascaded approaches for ASR and SRD on multiple ATC datasets. Our study shows in which cases our joint system can outperform the two traditional approaches and in which cases the other architectures are preferable. We additionally evaluate how acoustic and lexical differences influence all architectures and show how to overcome them for our joint architecture.
引用
收藏
页码:3759 / 3763
页数:5
相关论文
共 23 条
  • [21] Speech Recognition for Air Traffic Control Utilizing a Multi-Head State-Space Model and Transfer Learning
    Liang, Haijun
    Chang, Hanwen
    Kong, Jianguo
    AEROSPACE, 2024, 11 (05)
  • [22] N-best List Re-ranking Using Syntactic Score: A Solution for Improving Speech Recognition Accuracy in Air Traffic Control
    Van Nhan Nguyen
    Holone, Harald
    2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 1309 - 1314
  • [23] N-best List Re-ranking Using Semantic Relatedness and Syntactic Score: An Approach for Improving Speech Recognition Accuracy in Air Traffic Control
    Van Nhan Nguyen
    Holone, Harald
    2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 1315 - 1319