Continuous Control of the Degree of Articulation in HMM-based Speech Synthesis

被引:0
作者
Picart, Benjamin [1 ]
Drugman, Thomas [1 ]
Dutoit, Thierly [1 ]
机构
[1] Univ Mons UMons, Fac Polytech FPMs, TCTS Lab, Mons, Belgium
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
Speech Synthesis; HTS; Expressive Speech; Speaking Style Adaptation; Voice Quality; SPEAKER ADAPTATION; STYLE CONTROL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on the implementation of a continuous control of the degree of articulation (hypo/hyperarticulation) in the framework of HMM-based speech synthesis. The adaptation of a neutral speech synthesizer to generate hypo and hyperarticulated speech using a limited amount of speech data is first studied. This is done using inter-speaker voice adaptation techniques, applied here to intra-speaker voice adaptation. The implementation of a continuous control of the degree of articulation is then proposed in a second step. Finally, a subjective evaluation shows that good quality neutral/hypo/hyperarticulated speech, and also any intermediate, interpolated or extrapolated articulation degrees, can be obtained from an HMM-based speech synthesizer.
引用
收藏
页码:1808 / 1811
页数:4
相关论文
共 23 条
[1]  
[Anonymous], 2003, P 8 EUR C SPEECH COM
[2]  
[Anonymous], EC SPEECH GESTURES
[3]  
Beller G., 2009, THESIS U PARIS 6 PIE
[4]  
Beller G., 2007, INFLUENCE EXPRESSIVI
[5]  
Beller G., 2008, 4 INT C SPEECH PROS
[6]   SPEAKER ADAPTATION USING CONSTRAINED ESTIMATION OF GAUSSIAN MIXTURES [J].
DIGALAKIS, VV ;
RTISCHEV, D ;
NEUMEYER, LG .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (05) :357-366
[7]  
Drugman T., 2010, P INTERSPEECH
[8]  
Drugman T., 2009, P INTERSPEECH
[9]  
Ferguson J.D., 1980, S APPL HIDDEN MARKOV, P143
[10]   Maximum likelihood linear transformations for HMM-based speech recognition [J].
Gales, MJF .
COMPUTER SPEECH AND LANGUAGE, 1998, 12 (02) :75-98