A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems

被引:4
作者
Panda, Soumya Priyadarsini [1 ]
Nayak, Ajit Kumar [1 ]
机构
[1] Siksha O Anusandhan Univ, Bhubaneswar, Orissa, India
来源
INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES | 2015年 / 309卷
关键词
Speech synthesis; Text-to-speech system; Natural language processing; Concatenative synthesis; Indian languages;
D O I
10.1007/978-81-322-2009-1_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several text-to-speech (TTS) systems are available today for languages such as English, Japanese, and Chinese, but still Indian languages are lacking behind in terms of good quality synthesized speech. Even though almost all Indian languages share a common phonetic base, till now a usable TTS system for all official Indian languages is not available. Also the existing speech synthesis techniques are found to be less effective in the scripting format of Indian languages. Considering the intelligibility of speech production and increasing memory requirement for Indian language TTS systems, in this paper we have proposed a rule-based concatenative technique for speech synthesis in Indian languages. It is being compared with the existing technique and the results of our experiments show our technique outperforms the existing technique.
引用
收藏
页码:523 / 531
页数:9
相关论文
共 8 条
[1]  
Alias F., 2008, AUDIO SPEECH LANG PR, V16
[2]  
Bhakat Ravi Kalyan, 2013, Pattern Recognition and Machine Intelligence. 5th International Conference, PReMI 2013. Proceedings: LNCS 8251, P390, DOI 10.1007/978-3-642-45062-4_53
[3]  
Boughazi M., 2011, SYSTEM SIGNAL PROCES
[4]  
Feng J., 2012, IEEE SIGNAL PROCESSI
[5]   Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis [J].
Hertz, SR .
PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, :87-90
[6]  
Mohanty S., 2011, INT J ADV ENG TECHNO, V1, P138
[7]   Development of syllable-based text to speech synthesis system in Bengali [J].
Narendra, N. ;
Rao, K. ;
Ghosh, Krishnendu ;
Vempada, Ramu ;
Maity, Sudhamay .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) :167-181
[8]   A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units [J].
Tiomkin, Stas ;
Malah, David ;
Shechtman, Slava ;
Kons, Zvi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05) :1278-1288