Admissible Stopping in Viterbi Beam Search for Unit Selection Speech Synthesis

被引:0
作者
Sakai, Shinsuke [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto 6068501, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2013年 / E96D卷 / 06期
关键词
speech synthesis; unit selection; concatenation cost; Viterbi search;
D O I
10.1587/transinf.E96.D.1359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Corpus-based concatenative speech synthesis has been widely investigated and deployed in recent years since it provides a highly natural synthesized speech quality. The amount of computation required in the run time, however, can often be quite large. In this paper, we propose early stopping schemes for Viterbi beam search in the unit selection, with which we can stop early in the local Viterbi minimization for each unit as well as in the exploration of candidate units for a given target. It takes advantage of the fact that the space of the acoustic parameters of the database units is fixed and certain lower bounds of the concatenation costs can be precomputed. The proposed method for early stopping is admissible in that it does not change the result of the Viterbi beam search. Experiments using probability-based concatenation costs as well as distance-based costs show that the proposed methods of admissible stopping effectively reduce the amount of computation required in the Viterbi beam search while keeping its result unchanged. Furthermore, the reduction effect of computation turned out to be much larger if the available lower bound for concatenation costs is tighter.
引用
收藏
页码:1359 / 1367
页数:9
相关论文
共 17 条
  • [1] [Anonymous], 2004, P SSW5
  • [2] Beutnagel Mark., 1999, Proceedings of Eurospeech, V2, P607
  • [3] Chu M, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P264
  • [4] Eide E, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P708
  • [5] Hamza W., 2002, P ICSLP 2002 DENV, P2609
  • [6] Hunt AJ, 1996, INT CONF ACOUST SPEE, P373, DOI 10.1109/ICASSP.1996.541110
  • [7] IWAHASHI N, 1993, IEICE T FUND ELECTR, VE76A, P1942
  • [8] Kominek J., 2003, CMULTI03177
  • [9] Ling ZH, 2007, INT CONF ACOUST SPEE, P1245
  • [10] Nilsson NJ., 1982, Principles of artificial intelligence