A survey on speech synthesis techniques in Indian languages

被引:0
作者
Soumya Priyadarsini Panda
Ajit Kumar Nayak
Satyananda Champati Rai
机构
[1] Silicon Institute of Technology,Department of CSE
[2] Siksha ‘O’ Anusandhan University,Department of CS and IT
[3] Silicon Institute of Technology,Department of IT
来源
Multimedia Systems | 2020年 / 26卷
关键词
Text to speech system; Speech synthesis; Indian languages; Concatenative synthesis; Formant synthesis; Articulatory synthesis; Syllable-based synthesis; HMM-based synthesis; Statistical parametric synthesis; Polyglot synthesis; Multilingual synthesis; Waveform concatenation, Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
The text to speech technology has achieved significant progress during the past decade and is an active area of research and development in providing different human–computer interactive systems. Even though a number of speech synthesis models are available for different languages focusing on the domain requirements with many motive applications, a source of information on current trends in Indian language speech synthesis is unavailable till date making it difficult for the beginners to initiate research for the development of TTS systems for the low-resourced languages. This paper provides a review of the contributions made by different researchers in the field of Indian language speech synthesis along with a study on the Indian language characteristics and the associated challenges in designing TTS systems. A set of available applications and tools results out of different projects undertaken by different organizations along with a set of possible future developments are also discussed to provide a single reference to an important strand of research in speech synthesis which may benefit anyone interested to initiate research in this area.
引用
收藏
页码:453 / 478
页数:25
相关论文
共 50 条
[41]   A Study of Opinion Mining in Indian Languages [J].
Miranda, Diana Terezinha ;
Mascarenhas, Maruska .
PROGRESS IN INTELLIGENT COMPUTING TECHNIQUES: THEORY, PRACTICE, AND APPLICATIONS, VOL 2, 2018, 719 :71-77
[42]   Neural Machine Translation for Indian Languages [J].
Pathak, Amarnath ;
Pakray, Partha .
JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) :465-477
[43]   Multilingual Speaker Recognition on Indian Languages [J].
Sarkar, Sourjya ;
Rao, K. Sreenivasa ;
Nandi, Dipanjan ;
Kumar, Sunil S. B. .
2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
[44]   Anaphora Resolution System for Indian Languages [J].
Devi, Sobha Lalitha ;
Ram, Vijay Sundar R. ;
Rao, Pattabhi R. K. .
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
[45]   Suitability of syllable-based modeling units for end-to-end speech recognition in Sanskrit and other Indian languages [J].
Anoop, Chandran Savithri ;
Ramakrishnan, Angarai Ganesan .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 220
[46]   Learning Optimum Number of Bases for Indian Languages in Non-negative Matrix Factorization based Multilingual Speech Separation [J].
Nag, Nandini C. ;
Shah, Milind S. .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) :68-77
[47]   End-to-End Text-To-Speech synthesis for under resourced South African languages [J].
Nthite, Thapelo ;
Tsoeu, Mohohlo .
2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, :684-689
[48]   TRANSFORMATION OF F0 CONTOURS FOR LEXICAL TONES IN CONCATENATIVE SPEECH SYNTHESIS OF TONAL LANGUAGES [J].
Trung-Nghia Phung ;
Luong, Mai Chi ;
Akagi, Masato .
2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2012, :129-134
[49]   BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY [J].
Sitaram, Sunayana ;
Palkar, Sukhada ;
Chen, Yun-Nung ;
Parlikar, Alok ;
Black, Alan W. .
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, :7992-7996
[50]   A Hybrid HMM-Waveglow based Text-to-speech Synthesizer using Histogram Equalization for Low resource Indian Languages [J].
Kumar, Mano Ranjith M. ;
Srivastava, Sudhanshu ;
Prakash, Anusha ;
Murthy, Hema A. .
INTERSPEECH 2020, 2020, :2037-2041