IMPROVED UNIT SELECTION SPEECH SYNTHESIS METHOD UTILIZING SUBJECTIVE EVALUATION RESULTS ON SYNTHETIC SPEECH

被引:0
作者
Xia, Xian-Jun [1 ]
Ling, Zhen-Hua [1 ]
Yang, Chen-Yu [1 ]
Dai, Li-Rong [1 ]
机构
[1] Univ Sci & Technol China, iFLYTEK Speech Lab, Beijing, Peoples R China
来源
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING | 2012年
关键词
Speech synthesis; unit selection; subjective evaluation; SVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an improved unit selection and waveform concatenation speech synthesis method by gathering and utilizing human feedbacks on synthetic speech. Firstly, a set of texts are synthesized by the baseline unit selection synthesis system. Each prosodic word within the synthetic speech is then evaluated as a natural one or an unnatural one by listeners. In our proposed method, these natural synthetic segments are treated as virtual candidate units to extend the original speech corpus for unit selection. A new speech synthesis system is constructed using this extended speech corpus. A synthetic error detector based on SVM classifier is also built using the natural and unnatural synthetic speech. At synthesis time, the input text is synthesized using the baseline system and the extended system simultaneously. The two unit selection results are evaluated by the trained synthetic error detector to determine the optimal one. Experimental results prove the effectiveness of our proposed method in improving the naturalness of synthetic speech on a task of synthesizing place names.
引用
收藏
页码:160 / 164
页数:5
相关论文
共 7 条
[1]  
Hirai T., 2004, 5 ISCA SPEECH SYNTH
[2]  
Hunt AJ, 1996, INT CONF ACOUST SPEE, P373, DOI 10.1109/ICASSP.1996.541110
[3]   ON INFORMATION AND SUFFICIENCY [J].
KULLBACK, S ;
LEIBLER, RA .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01) :79-86
[4]  
Ling Z.-H, ISCSLP 2010
[5]  
Ling Z.-H, 2007, P ICASSP
[6]  
Lu H., 2011, P ICASSP, P5352
[7]  
Wang R H, 2000, P INT C SPOK LANG PR, P391