Multi-pass pronunciation adaptation

被引:0
|
作者
Bodenstab, Nathan [1 ]
Fanty, Mark [2 ]
机构
[1] Oregon Hlth & Sci Univ, OGI, Portland, OR 97201 USA
[2] Nuance Commun, Sunnyvale, CA USA
关键词
pronunciation; speech; learning; adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A mapping between words and pronunciations (potential phonetic realizations) is a key component of speech recognition systems. Traditionally, this has been encoded in a lexicon where each pronunciation is transcribed by a linguist or generated by a grapheme-to-phoneme algorithm. For large vocabulary recognition systems, this process is highly susceptible to errors. We present an off-line data driven algorithm to correct suboptimal pronunciations using transcribed utterances. Unlike previous data driven algorithms that struggle to balance acoustic representation and multi-speaker generalization, our multi-pass approach maximizes both criteria, instead of compromising between the two. We demonstrate on an automated name dialing task that our multipass algorithm achieves a 70% error rate reduction when compared to a baseline grapheme-to-phoneme generated lexicon.
引用
收藏
页码:865 / +
页数:2
相关论文
共 50 条
  • [31] Experimental study of asymmetric multi-pass spinning
    Yong Xiao
    Zhiren Han
    Shuyang Zhou
    Zhen Jia
    The International Journal of Advanced Manufacturing Technology, 2020, 110 : 667 - 679
  • [32] LAMELLAR TEARING IN MULTI-PASS FILLET JOINTS
    ELLIOTT, DN
    WELDING JOURNAL, 1969, 48 (09) : S409 - &
  • [33] An experimental study on a cylindrical multi-pass cell
    Tonomura, M
    Miyazawa, H
    Nakamura, T
    Endo, M
    Yamaguchi, S
    Nanri, K
    Fujioka, T
    2005 PACIFIC RIM CONFERENCE ON LASERS AND ELECTRO-OPTICS, 2005, : 861 - 862
  • [34] Hyperspectral multi-pass mapping for target detection
    Schaum, A
    Stocker, A
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL AND ULTRASPECTRAL IMAGERY IX, 2003, 5093 : 1 - 8
  • [35] Grain refinement of HAZ in multi-pass welding
    Ma, R.
    Fang, K.
    Yang, J. G.
    Liu, X. S.
    Fang, H. Y.
    JOURNAL OF MATERIALS PROCESSING TECHNOLOGY, 2014, 214 (05) : 1131 - 1135
  • [36] MAP equalization for DQPSK in multi-pass demodulation
    Khayrallah, AS
    Fulghum, T
    Hui, D
    IEEE VEHICULAR TECHNOLOGY CONFERENCE, FALL 2000, VOLS 1-6, PROCEEDINGS: BRINGING GLOBAL MOBILITY TO THE NETWORK AGE, 2000, : 2249 - 2256
  • [37] Multi-Pass High-Level Presolving
    Leo, Kevin
    Tack, Guido
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 346 - 352
  • [38] Analysis of a SCALPEL™ multi-pass writing strategy
    Zhu, X
    Munro, E
    Rouse, JA
    Liu, H
    Waskiewicz, WK
    MICROELECTRONIC ENGINEERING, 2000, 53 (1-4) : 321 - 324
  • [39] Revisiting Multi-pass Scatter and Gather on GPUs
    Lai, Zhuohang
    Luo, Qiong
    Jia, Xiaoying
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [40] A remotely controllable optical multi-pass system
    Linnartz, H
    PHYSICA SCRIPTA, 2004, 70 (06) : C24 - C25