A survey on speech synthesis techniques in Indian languages

被引:0
|
作者
Soumya Priyadarsini Panda
Ajit Kumar Nayak
Satyananda Champati Rai
机构
[1] Silicon Institute of Technology,Department of CSE
[2] Siksha ‘O’ Anusandhan University,Department of CS and IT
[3] Silicon Institute of Technology,Department of IT
来源
Multimedia Systems | 2020年 / 26卷
关键词
Text to speech system; Speech synthesis; Indian languages; Concatenative synthesis; Formant synthesis; Articulatory synthesis; Syllable-based synthesis; HMM-based synthesis; Statistical parametric synthesis; Polyglot synthesis; Multilingual synthesis; Waveform concatenation, Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
The text to speech technology has achieved significant progress during the past decade and is an active area of research and development in providing different human–computer interactive systems. Even though a number of speech synthesis models are available for different languages focusing on the domain requirements with many motive applications, a source of information on current trends in Indian language speech synthesis is unavailable till date making it difficult for the beginners to initiate research for the development of TTS systems for the low-resourced languages. This paper provides a review of the contributions made by different researchers in the field of Indian language speech synthesis along with a study on the Indian language characteristics and the associated challenges in designing TTS systems. A set of available applications and tools results out of different projects undertaken by different organizations along with a set of possible future developments are also discussed to provide a single reference to an important strand of research in speech synthesis which may benefit anyone interested to initiate research in this area.
引用
收藏
页码:453 / 478
页数:25
相关论文
共 50 条
  • [31] Speech pattern analysis for four south Indian languages
    Mutagi, RN
    IETE TECHNICAL REVIEW, 1994, 11 (5-6) : 323 - 328
  • [32] Statistical machine translation of Indian languages: a survey
    Jadoon, Nadeem Khan
    Anwar, Waqas
    Bajwa, Usama Ijaz
    Ahmad, Farooq
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07): : 2455 - 2467
  • [33] Statistical machine translation of Indian languages: a survey
    Nadeem Khan Jadoon
    Waqas Anwar
    Usama Ijaz Bajwa
    Farooq Ahmad
    Neural Computing and Applications, 2019, 31 : 2455 - 2467
  • [34] Shruti: an embedded text-to-speech system for Indian languages
    Mukhopadhyay, A.
    Chakraborty, S.
    Choudhury, M.
    Lahiri, A.
    Dey, S.
    Basu, A.
    IEE PROCEEDINGS-SOFTWARE, 2006, 153 (02): : 75 - 79
  • [35] A Common Parts-of-Speech Tagset Framework for Indian Languages
    Baskaran, Sankaran
    Bali, Kalika
    Bhattacharya, Tanmoy
    Bhattacharyya, Pushpak
    Choudhury, Monojit
    Jha, Girish Nath
    Rajendran, S.
    Saravanan, K.
    Sobha, L.
    Subbarao, K. V. S.
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1331 - 1337
  • [36] Application of prosody models for developing speech systems in Indian languages
    Rao, K. Sreenivasa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (01) : 19 - 33
  • [37] Performance Evaluation and Comparison of Multilingual Speech Synthesizers for Indian Languages
    Jeeva, M. P. Actlin
    Ramani, B.
    Vijayalakshmi, P.
    2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 590 - 595
  • [38] WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages
    Choudhary, Tripti
    Goyal, Vishal
    Bansal, Atul
    BIG DATA MINING AND ANALYTICS, 2023, 6 (01) : 85 - 91
  • [39] IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
    Javed, Tahir
    Bhogale, Kaushal
    Raman, Abhigyan
    Kumar, Pratyush
    Kunchukuttan, Anoop
    Khapra, Mitesh M.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12942 - 12950
  • [40] Analysis of Inter-Pausal Units in Indian Languages and Its Application to Text-to-Speech Synthesis
    Prakash, Jeena J.
    Murthy, Hema A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) : 1616 - 1628