Measuring a decade of progress in Text-to-Speech

被引:62
作者
King, Simon [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
来源
LOQUENS | 2014年 / 1卷 / 01期
基金
英国工程与自然科学研究理事会;
关键词
text-to-speech synthesis; evaluation; The Blizzard Challenge;
D O I
10.3989/loquens.2014.006
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The Blizzard Challenge offers a unique insight into progress in text-to-speech synthesis over the last decade. By using a very large listening test to compare the performance of a wide range of systems that have been constructed using a common corpus of speech recordings, it is possible to make some direct comparisons between competing techniques. By reviewing over a hundred papers describing all entries to the Challenge since 2005, we can make a useful summary of the most successful techniques adopted by participating teams, as well as drawing some conclusions about where the Blizzard Challenge has succeeded, and where there are still open problems in cross-system comparisons of text-to-speech synthesisers.
引用
收藏
页数:12
相关论文
共 141 条
[1]  
Andersson J. S., 2008, BLIZZ CHALL WORKSH 2
[2]  
Andersson J. S., 2009, BLIZZ CHALL WORKSH 2
[3]  
Aylett M. P., 2006, BLIZZ CHALL WORKSH 2
[4]  
Aylett M. P., 2009, BLIZZ CHALL WORKSH 2
[5]  
Aylett M. P., 2007, BLIZZ CHALL WORKSH 2
[6]  
Baumgartner M., 2012, BLIZZ CHALL WORKSH 2
[7]  
Bennett C. L., 2006, BLIZZ CHALL WORKSH 2
[8]  
Bennett C. L., 2005, BLIZZ CHALL WORKSH 2
[9]   The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences [J].
Benoit, C ;
Grice, M ;
Hazan, V .
SPEECH COMMUNICATION, 1996, 18 (04) :381-392
[10]  
Black A. W., 1997, P EUR RHOD GREEC, P601