Contextual Additive Structure for HMM-Based Speech Synthesis

被引:3
作者
Takaki, Shinji [1 ]
Nankaku, Yoshihiko [1 ]
Tokuda, Keiichi [1 ]
机构
[1] Nagoya Inst Technol, Dept Comp Sci & Enineering, Nagoya, Aichi 4668555, Japan
基金
日本科学技术振兴机构;
关键词
HMM-based speech synthesis; spectral modeling; decision trees; context clustering; additive structure; distribution convolution; HIDDEN MARKOV-MODELS;
D O I
10.1109/JSTSP.2014.2305919
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a spectral modeling technique based on an additive structure of context dependencies for HMM-based speech synthesis. Contextual additive structure models can represent complicated dependencies between acoustic features and context labels using multiple decision trees. However, the computational complexity of the context clustering is too high for the full context labels of speech synthesis. To overcome this problem, this paper proposes two approaches; covariance parameter tying and a likelihood calculation algorithm using the matrix inversion lemma. Additive structure models can be applied to HMM-based speech synthesis using these techniques and speech quality would significantly be improved. Experimental results show that the proposed method outperforms the conventional one in subjective listening tests.
引用
收藏
页码:229 / 238
页数:10
相关论文
共 22 条
[1]  
Abe Y., 1989, P ICASSP, P326
[2]  
[Anonymous], P ICASSP
[3]  
[Anonymous], 1999, P EUROSPEECH
[4]  
[Anonymous], P INTERSPEECH
[5]  
[Anonymous], P 7 ISCA SPEECH SYNT
[6]  
[Anonymous], 2005, P INTERSPEECH 2005 L
[7]  
[Anonymous], P ICASSP
[8]  
Black AW, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1385, DOI 10.1109/ICSLP.1996.607872
[9]  
Fujisaki H, 2008, INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, P1
[10]  
Fukada T., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P137, DOI 10.1109/ICASSP.1992.225953