Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages

被引:4
|
作者
Agenbag, Wiehan [1 ]
Niesler, Thomas [1 ]
机构
[1] Stellenbosch Univ, Dept Elect & Elect Engn, Stellenbosch, South Africa
来源
COMPUTER SPEECH AND LANGUAGE | 2019年 / 57卷
关键词
Unsupervised SWU discovery; Automatic lexicon induction; ASR; Under-resourced languages;
D O I
10.1016/j.csl.2019.02.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method enabling the unsupervised discovery of sub-word units (SWUs) and associated pronunciation lexicons for use in automatic speech recognition (ASR) systems. This includes a novel SWU discovery approach based on self-organising HMM-GMM states that are agglomeratively tied across words as well as a novel pronunciation lexicon induction approach that iteratively reduces pronunciation variation by means of model pruning. Our approach relies only on recorded speech and associated orthographic transcriptions and does not require alphabetic graphemes. We apply our methods to corpora of recorded radio broadcasts in Ugandan English, Luganda and Acholi, of which the latter two are under-resourced. The speech is conversational and contains high levels of background noise, and therefore presents a challenge to automatic lexicon induction. We demonstrate that our proposed method is able to discover lexicons that perform as well as baseline expert systems for Acholi, and close to this level for the other two languages when used to train DNN-HMM ASR systems. This demonstrates the potential of the method to enable and accelerate ASR for under-resourced languages for which a phone inventory and pronunciation lexicon are not available by eliminating the dependence on human expertise this usually requires. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:20 / 40
页数:21
相关论文
共 4 条
  • [1] Sub-word Based End-to-End Speech Recognition for an Under-Resourced Language: Amharic
    Gebreegziabher, Nirayo Hailu
    Nuernberger, Andreas
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3466 - 3470
  • [2] Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery
    Razavi, Marzieh
    Magimai-Doss, Mathew
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3873 - 3877
  • [3] Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language
    Le, Viet-Bac
    Besacier, Laurent
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1471 - 1482
  • [4] SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian
    Cucu, Horia
    Buzo, Andi
    Besacier, Laurent
    Burileanu, Corneliu
    SPEECH COMMUNICATION, 2014, 56 : 195 - 212