COMPARING GLOTTAL-FLOW-EXCITED STATISTICAL PARAMETRIC SPEECH SYNTHESIS METHODS

被引:0
作者
Raitio, Tuomo [1 ]
Suni, Antti
Vainio, Martti
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
基金
芬兰科学院;
关键词
Statistical parametric speech synthesis; excitation; glottal flow; principal component analysis; pulse library; WAVE-FORM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies the performance of glottal flow signal based excitation methods in statistical parametric speech synthesis. The current state of the art in excitation modeling is reviewed and three excitation methods are selected for experiments. Two of the methods are based on the principal component analysis (PCA) decomposition of estimated glottal flow pulses. While the first one uses only the mean of the pulses, the second method uses 12 principal components in addition to the mean signal for modeling the glottal flow waveform. The third method utilizes a glottal flow pulse library from which pulses are selected according to target and concatenation costs. Subjective listening tests are carried out to determine the quality and similarity of the synthetic speech of one male and one female speaker. The results show that the PCA-based methods are rated best both in quality and similarity, but adding more components does not yield any improvements.
引用
收藏
页码:7830 / 7834
页数:5
相关论文
共 34 条
[1]   A method for generating natural-sounding speech stimuli for cognitive brain research [J].
Alku, P ;
Tiitinen, H ;
Näätänen, R .
CLINICAL NEUROPHYSIOLOGY, 1999, 110 (08) :1329-1333
[2]   GLOTTAL WAVE ANALYSIS WITH PITCH SYNCHRONOUS ITERATIVE ADAPTIVE INVERSE FILTERING [J].
ALKU, P .
SPEECH COMMUNICATION, 1992, 11 (2-3) :109-118
[3]  
[Anonymous], P INT
[4]  
[Anonymous], P ICASSP
[5]  
[Anonymous], SPEECH COMMUN
[6]  
[Anonymous], 2010, PROC 7 ISCA WORKSHOP
[7]  
[Anonymous], 2001, P EUR
[8]  
[Anonymous], 2001, THESIS
[9]  
[Anonymous], 2 INT WORKSH MOD AN
[10]  
[Anonymous], BLIZZ CHALL 2010 WOR