Excitation Modeling Based on Waveform Interpolation for HMM-based Speech Synthesis

被引:0
作者
Sung, June Sig [1 ]
Hong, Doo Hwa [1 ]
Oh, Kyung Hwan [1 ]
Kim, Nam Soo [1 ]
机构
[1] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Inst New Media & Commun, Seoul 151, South Korea
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
HMM-based speech synthesis; Waveform Interpolation; Principal Component Analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is generally known that a well-designed excitation produces high quality signals in hidden Markov model (HMM)-based speech synthesis systems. This paper proposes a novel techniques for generating excitation based on the waveform interpolation (WI). For modeling WI parameters, we implemented statistical method like principal component analysis (PCA). The parameters of the proposed excitation modeling techniques can be easily combined with the conventional speech synthesis system under the HMM framework. From a number of experiments, the proposed method has been found to generate more naturally sounding speech.
引用
收藏
页码:813 / 816
页数:4
相关论文
共 11 条
  • [1] [Anonymous], 2001, 4 ISCA TUT RES WORKS
  • [2] Bishop C., 2006, PATTERN RECOGN, V1st, P559
  • [3] Choy E.L.T., 1998, THESIS MCGILL U MONT
  • [4] Drugman T., 2009, INTERSPEECH2009 BRIG
  • [5] Kim SJ, 2007, IEICE T INF SYST, VE90D, P378, DOI [10.1093/ietisy/e90-1.1.378, 10.1093/ietisy/e90-d.1.378]
  • [6] KLEIJN WB, 1991, INT CONF ACOUST SPEE, P201, DOI 10.1109/ICASSP.1991.150312
  • [7] Maia R., 2007, P ISCA SSW6 AUG
  • [8] Ritz CH, 2002, 2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS, P32, DOI 10.1109/SCW.2002.1215714
  • [9] Tokuda K, 2000, INT CONF ACOUST SPEE, P1315, DOI 10.1109/ICASSP.2000.861820
  • [10] Tokuda Keiichi., 2006, HMM BASED SPEECH SYN