Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods

被引:0
作者
Karanasou, Panagiota [1 ]
Lamel, Lori [1 ]
机构
[1] LIMSI CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II | 2011年 / 6609卷
关键词
pronunciation lexicon; G2P conversion; SMT; pivot paraphrasing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constructing a pronunciation lexicon with variants in a fully automatic and language-independent way is a challenge, with many uses in human language technologies. Moreover, with the growing use of web data, there is a recurrent need to add words to existing pronunciation lexicons, and an automatic method can greatly simplify the effort required to generate pronunciations for these out-of-vocabulary words. In this paper, a machine translation approach is used to perform grapheme-to-phoneme (g2p) conversion, the task of finding the pronunciation of a word from its written form. Two alternative methods are proposed to derive pronunciation variants. In the first case, an n-best pronunciation list is extracted directly from the g2p converter. The second is a novel method based on a pivot approach, traditionally used for the paraphrase extraction task, and applied as a post-processing step to the g2p converter. The performance of these two methods is compared under different training conditions. The range of applications which require pronunciation lexicons is discussed and the generated pronunciations are further tested in some preliminary automatic speech recognition experiments.
引用
收藏
页码:506 / 517
页数:12
相关论文
共 19 条
  • [1] [Anonymous], P ICSLP 2002
  • [2] [Anonymous], 1986, JHUEECS8601
  • [3] Bannard C., 2005, P ACI
  • [4] Bisani Maximilian., 2002, Proceedings of the 7th International Conference on Spoken Language Processing, P105
  • [5] Deligne S., 1995, EUROSPEECH, P2243
  • [6] Dietterich TG, 1994, J ARTIF INTELL RES, V2, P263
  • [7] The LIMSI Broadcast News transcription system
    Gauvain, JL
    Lamel, L
    Adda, G
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 89 - 108
  • [8] Gerosa M., 2009, ICASSP
  • [9] Jiampojamarn Sittichai., 2008, The 46th Annual Meeting of the Association for Computational Linguistics, P905
  • [10] Kaisse E., 2005, Handbook of word formation, P25