Prosody evaluation for embedded slovene speech-synthesis systems

被引:0
作者
Mihelic, France [1 ]
Vesnicer, Bostjan [1 ]
Zibert, Janez [1 ]
Noeth, Elmar [2 ]
机构
[1] Univ Ljubljana, Fac Elect Engn, Ljubljana 1000, Slovenia
[2] Univ Erlangen Nurnberg, IMMD5, D-91058 Erlangen, Germany
来源
INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS | 2007年 / 37卷 / 03期
关键词
embedded systems; speech synthesis; HMM acoustic modeling; prosody modeling; speech synthesis evaluation; prosodic tags recognition; support vector machines; RGB kernel;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes an evaluation of the prosody modeling in an HMM-based Slovene speech-synthesis system that is suitable for embedded systems due to its relatively small memory footprint. The objective-evaluation procedure is based on the results of the automatic recognition of syntactic-prosodic boundary positions and accented words in the synthetic speech. We have shown that the recognition results represent a close match with the prosodic notations, labeled by the human expert on the natural-speech counterpart produced by the speaker whose speech was used to train the speech-synthesis system. Therefore, the recognition rate of the prosodic events is proposed as an objective evaluation measure for the quality of the prosodic modeling in the speech-synthesis system. The results of the proposed evaluation method are also in accordance with previous subjective-listening evaluation tests, where high scores for the naturalness for such a type of speech synthesis were observed.
引用
收藏
页码:176 / 181
页数:6
相关论文
共 20 条
[1]  
[Anonymous], P IEEE WORKSH SPEECH
[2]  
[Anonymous], 1998, P ICSLP SYDN AUSTR
[3]  
[Anonymous], LIBSVM LIB SUPPORT V
[4]   M = Syntax plus Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases [J].
Batliner, A ;
Kompe, R ;
Kiessling, A ;
Mast, M ;
Niemann, H ;
Noth, E .
SPEECH COMMUNICATION, 1998, 25 (04) :193-222
[5]  
BUCKOW J, 2004, MULTILINGUAL PROSODY
[6]  
CAMPBELL N, 1996, PROGR SPEECH SYNTHES, P279
[7]  
Gros J., 1999, Elektrotehniski Vestnik, V66, P92
[8]  
Gros JZ, 2007, INFORM MIDEM, V37, P158
[9]  
Hsu Chih-Wei, PRACTICAL GUIDE SUPP
[10]  
Mihelic A, 2006, INFORM MIDEM, V36, P19