Recent Trends in Text to Speech Synthesis of Indian Languages

被引:0
作者
Joshi, Sarang L. [1 ]
Bairagi, Vinayak K. [1 ]
机构
[1] AISSMS IOIT, Pune, Maharashtra, India
来源
HELIX | 2019年 / 9卷 / 03期
关键词
Concatenative; Prosody; Speech Synthesis; Syllable; TTS; Text to Speech;
D O I
10.29042/2019-4931-4936
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A Text To Speech (TTS) synthesizer is a computer application capable of converting arbitrary input text into speech. This conversion broadly involves two steps, namely, text processing and speech synthesis. Text processing converts the entered text to a sequence of synthesis units, while speech synthesis is the generation of an acoustic wave form corresponding to each of these units. Naturalness and intelligibility are the most important qualities expected from a TTS system. In this paper we aim to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks. We have listed various Text-to-Speech synthesis frameworks developed and implemented at different Indian institutes.
引用
收藏
页码:4931 / 4936
页数:6
相关论文
共 16 条
[1]  
Black Alan W., 2003, 8 EUR C SPEECH COMM
[2]  
Gaikwad P. B., 2014, INT J ADV RES COMPUT, V193, P194
[3]  
Gopi A, 2013, 2013 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION AND COMPUTING (ICCC), P184, DOI 10.1109/ICCC.2013.6731647
[4]  
Kanade Varun, 2004, VANI AN INDIA LANGUA
[5]  
Kiruthiga S., 2012, P INT C COMPUTER COM, P1
[6]  
Mache S, 2016, J COMPUT ENG, V18, P35
[7]  
Mahanta D, 2016, PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), P2614, DOI 10.1109/TENCON.2016.7848511
[8]  
Mandal S. K. D., 2007, SSW, P351
[9]  
Mohanty S., 2011, INT J ADV ENG TECHNO, V1, P138
[10]   Text-to-speech synthesis with an Indian language perspective [J].
Panda, Soumya Priyadarsini ;
Nayak, Ajit Kumar ;
Patnaik, Srikanta .
INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2015, 6 (3-4) :170-178