A survey on speech synthesis techniques in Indian languages

被引:0
|
作者
Soumya Priyadarsini Panda
Ajit Kumar Nayak
Satyananda Champati Rai
机构
[1] Silicon Institute of Technology,Department of CSE
[2] Siksha ‘O’ Anusandhan University,Department of CS and IT
[3] Silicon Institute of Technology,Department of IT
来源
Multimedia Systems | 2020年 / 26卷
关键词
Text to speech system; Speech synthesis; Indian languages; Concatenative synthesis; Formant synthesis; Articulatory synthesis; Syllable-based synthesis; HMM-based synthesis; Statistical parametric synthesis; Polyglot synthesis; Multilingual synthesis; Waveform concatenation, Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
The text to speech technology has achieved significant progress during the past decade and is an active area of research and development in providing different human–computer interactive systems. Even though a number of speech synthesis models are available for different languages focusing on the domain requirements with many motive applications, a source of information on current trends in Indian language speech synthesis is unavailable till date making it difficult for the beginners to initiate research for the development of TTS systems for the low-resourced languages. This paper provides a review of the contributions made by different researchers in the field of Indian language speech synthesis along with a study on the Indian language characteristics and the associated challenges in designing TTS systems. A set of available applications and tools results out of different projects undertaken by different organizations along with a set of possible future developments are also discussed to provide a single reference to an important strand of research in speech synthesis which may benefit anyone interested to initiate research in this area.
引用
收藏
页码:453 / 478
页数:25
相关论文
共 50 条
  • [21] Phoneme-to-Speech Dictionary for Indian Languages
    Reddy, Mallamma V.
    Mary, Margaret T.
    Hanumanthappa, M.
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,
  • [22] An Approach to Building Language-Independent Text-to-Speech Synthesis for Indian Languages
    Prakash, Anusha
    Reddy, M. Ramasubba
    Nagarajan, T.
    Murthy, Hema A.
    2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [23] Modified Rule-Based Concatenative Technique for Intelligible Speech Synthesis in Indian Languages
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    ADVANCED SCIENCE LETTERS, 2016, 22 (02) : 557 - 563
  • [24] A Study on Abstractive Summarization Techniques in Indian Languages
    Sunitha, C.
    Jaya, A.
    Ganesh, Amal
    FOURTH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE & ENGINEERING (ICRTCSE 2016), 2016, 87 : 25 - 31
  • [25] Salient phonetic features of Indian languages in speech technology
    Bhaskararao, Peri
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 587 - 599
  • [26] Effect of GCI perturbation on speech quality in Indian languages
    Lehana, PK
    Pandey, PC
    IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 959 - 963
  • [27] IndicSpeech: Text-to-Speech Corpus for Indian Languages
    Srivastava, Nimisha
    Mukhopadhyay, Rudrabha
    Prajwal, K. R.
    Jawahar, C., V
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
  • [28] Salient phonetic features of Indian languages in speech technology
    PERI BHASKARARAO
    Sadhana, 2011, 36 : 587 - 599
  • [29] Multilingual speech mode classification model for Indian languages
    Tripathi, Kumud
    Rao, K. Sreenivasa
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [30] Speech interfaces in Indian languages for access to Internet resources
    Jaywant, R
    Prasad, GD
    Ramani, S
    PROCEEDINGS OF THE ICCC 2002: 15TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION, VOLS 1 AND 2: REDEFINING INTERNET IN THE CONTEXT OF PERVASIVE COMPUTING, 2002, : 817 - 827