Multi-pass pronunciation adaptation

被引:0
|
作者
Bodenstab, Nathan [1 ]
Fanty, Mark [2 ]
机构
[1] Oregon Hlth & Sci Univ, OGI, Portland, OR 97201 USA
[2] Nuance Commun, Sunnyvale, CA USA
关键词
pronunciation; speech; learning; adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A mapping between words and pronunciations (potential phonetic realizations) is a key component of speech recognition systems. Traditionally, this has been encoded in a lexicon where each pronunciation is transcribed by a linguist or generated by a grapheme-to-phoneme algorithm. For large vocabulary recognition systems, this process is highly susceptible to errors. We present an off-line data driven algorithm to correct suboptimal pronunciations using transcribed utterances. Unlike previous data driven algorithms that struggle to balance acoustic representation and multi-speaker generalization, our multi-pass approach maximizes both criteria, instead of compromising between the two. We demonstrate on an automated name dialing task that our multipass algorithm achieves a 70% error rate reduction when compared to a baseline grapheme-to-phoneme generated lexicon.
引用
收藏
页码:865 / +
页数:2
相关论文
共 50 条
  • [21] Event Coreference Resolution with Multi-Pass Sieves
    Lu, Jing
    Ng, Vincent
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3996 - 4003
  • [22] Simulations of transformation kinetics in a multi-pass weld
    Rojko, D
    Gliha, V
    MATERIALS AND MANUFACTURING PROCESSES, 2005, 20 (05) : 833 - 849
  • [23] Results of multi-pass nasolacrimal duct probing
    Wright, KW
    Mocan, MC
    Najera-Covarrubias, M
    Suarez, N
    AT THE CROSSINGS: PEDIATRIC OPHTHALMOLOGY AND STRABISMUS, 2004, : 251 - 255
  • [24] Multi-Pass Stamping Forming a Concave Ring
    Zhang, Song
    Shu, Xuedao
    Shi, Jianan
    Li, Zixuan
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [25] The problem of inhomogeneity in multi-pass drawing process
    Luksza, J
    Majta, J
    Skolyszewski, A
    Bator, A
    METAL FORMING 2000, 2000, : 589 - 596
  • [26] Multi-pass model based artistic rendering
    Mi, Xiao-Feng
    Chen, Xue-Song
    Tang, Min
    Dong, Jin-Xiang
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2003, 37 (06): : 664 - 669
  • [27] Randomized multi-pass streaming skyline algorithms
    Sarma, Atish Das
    Lall, Ashwin
    Nanongkai, Danupon
    Xu, Jun
    Proceedings of the VLDB Endowment, 2009, 2 (01): : 85 - 96
  • [28] A Multi-pass Sieve for Clinical Concept Normalization
    Wang, Yuxia
    Hur, Brian
    Verspoor, Karin
    Baldwin, Timothy
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2020, 61 (02): : 41 - 65
  • [29] An evolutionary approach for multi-pass turning operations
    Singh, G.
    Choudhary, A. K.
    Karunakaran, K. P.
    Tiwari, M. K.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2006, 220 (02) : 145 - 162
  • [30] A Multi-Pass Generation of DEM for Urban Planning
    Cui, Zheng
    Zhang, Keqi
    Zhang, Chengcui
    Chen, Shu-Ching
    2013 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CLOUDCOM-ASIA), 2013, : 543 - 548