An introduction to statistical parametric speech synthesis

被引:0
作者
Simon King
机构
[1] University of Edinburgh,The Centre for Speech Technology Research
来源
Sadhana | 2011年 / 36卷
关键词
Speech synthesis; hidden Markov model-based speech synthesis; statistical parametric speech synthesis; vocoding; text-to-speech;
D O I
暂无
中图分类号
学科分类号
摘要
Statistical parametric speech synthesis, based on hidden Markov model-like models, has become competitive with established concatenative techniques over the last few years. This paper offers a non-mathematical introduction to this method of speech synthesis. It is intended to be complementary to the wide range of excellent technical publications already available. Rather than offer a comprehensive literature review, this paper instead gives a small number of carefully chosen references which are good starting points for further reading.
引用
收藏
页码:837 / 852
页数:15
相关论文
共 9 条
  • [1] Kawahara H(1999)Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds Speech Commun. 27 187-207
  • [2] Masuda-Katsuse I(2009)Statistical parametric speech synthesis Speech Commun. 51 1039-1064
  • [3] de Cheveigné A(2007)Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences Comput. Speech Lang. 21 153-173
  • [4] Zen H(undefined)undefined undefined undefined undefined-undefined
  • [5] Tokuda K(undefined)undefined undefined undefined undefined-undefined
  • [6] Black AW(undefined)undefined undefined undefined undefined-undefined
  • [7] Zen H(undefined)undefined undefined undefined undefined-undefined
  • [8] Tokuda K(undefined)undefined undefined undefined undefined-undefined
  • [9] Kitamura T(undefined)undefined undefined undefined undefined-undefined