An introduction to statistical parametric speech synthesis

被引：0

作者：

Simon King

机构：

[1] University of Edinburgh,The Centre for Speech Technology Research

来源：

Sadhana | 2011年 / 36卷

关键词：

Speech synthesis; hidden Markov model-based speech synthesis; statistical parametric speech synthesis; vocoding; text-to-speech;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Statistical parametric speech synthesis, based on hidden Markov model-like models, has become competitive with established concatenative techniques over the last few years. This paper offers a non-mathematical introduction to this method of speech synthesis. It is intended to be complementary to the wide range of excellent technical publications already available. Rather than offer a comprehensive literature review, this paper instead gives a small number of carefully chosen references which are good starting points for further reading.

引用

页码：837 / 852

页数：15

共 9 条

[1] Kawahara H(1999)Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds Speech Commun. 27 187-207
[2] Masuda-Katsuse I(2009)Statistical parametric speech synthesis Speech Commun. 51 1039-1064
[3] de Cheveigné A(2007)Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences Comput. Speech Lang. 21 153-173
[4] Zen H(undefined)undefined undefined undefined undefined-undefined
[5] Tokuda K(undefined)undefined undefined undefined undefined-undefined
[6] Black AW(undefined)undefined undefined undefined undefined-undefined
[7] Zen H(undefined)undefined undefined undefined undefined-undefined
[8] Tokuda K(undefined)undefined undefined undefined undefined-undefined
[9] Kitamura T(undefined)undefined undefined undefined undefined-undefined

← 1 →