Deep Learning Techniques in Tandem with Signal Processing Cues for Phonetic Segmentation for Text to Speech Synthesis in Indian Languages

被引:12
|
作者
Baby, Arun [1 ]
Prakash, Jeena J. [1 ]
Vignesh, Rupak [1 ]
Murthy, Hema A. [1 ]
机构
[1] Indian Inst Technol Madras, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Deep Neural Networks; Convolutional Neural Networks; phonetic segmentation; signal processing cues;
D O I
10.21437/Interspeech.2017-666
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic detection of phoneme boundaries is an important sub-task in building speech processing applications, especially text-to-speech synthesis (TTS) systems. The main drawback of the Gaussian mixture model- hidden Markov model (GMM-HMM) based forced-alignment is that the phoneme boundaries are not explicitly modeled. In an earlier work. we had proposed the use of signal processing cues in tandem with GMM-HMM based forced alignment for boundary correction for building Indian language TTS systems. In this paper, we capitalise on the ability of robust acoustic modeling techniques such as deep neural networks (DNN) and convolutional deep neural networks (CNN) for acoustic modeling. The GMM-HMM based forced alignment is replaced by DNN-HMM/CNN-HMM based forced alignment. Signal processing cues are used to correct the segment boundaries obtained using DNN-HMM/CNN-HMM segmentation. TTS systems built using these boundaries show a relative improvement in synthesis quality.
引用
收藏
页码:3817 / 3821
页数:5
相关论文
共 50 条
  • [41] Pioneering Prognosis and Management in Neuromuscular Healthcare Using EMG Signal Processing with Advanced Deep Learning Techniques
    Chandrasekaran, Raja
    Neeli, Jyoti
    Alsberi, Hassan
    Hassan, Mohamed M.
    Uikey, Jyoti
    Yahya, Mohammad
    TRAITEMENT DU SIGNAL, 2024, 41 (04) : 1633 - 1645
  • [42] RETRACTED ARTICLE: An adaptive speech signal processing for COVID-19 detection using deep learning approach
    Kawther A. Al-Dhlan
    International Journal of Speech Technology, 2022, 25 : 641 - 649
  • [43] Retraction Note: An adaptive speech signal processing for COVID-19 detection using deep learning approach
    Kawther A. Al-Dhlan
    International Journal of Speech Technology, 2022, 25 (Suppl 1) : 31 - 31
  • [44] Signal Processing Using Dictionaries, Atoms, and Deep Learning: A Common Analysis-Synthesis Framework
    Zhang, Chao
    van der Baan, Mirko
    PROCEEDINGS OF THE IEEE, 2022, 110 (04) : 454 - 475
  • [45] Human Respiration and Motion Detection Based on Deep Learning and Signal Processing Techniques to Support Search and Rescue Teams
    Niyaz, Ozden
    Erenoglu, Mehmet Ziya
    Turk, Ahmet Serdar
    Colak, Sultan Aldirmaz
    Erkmen, Burcu
    Tokan, Nurhan Turker
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [46] RETRACTED: An adaptive speech signal processing for COVID-19 detection using deep learning approach (Retracted Article)
    Al-Dhlan, Kawther A.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 25 (3) : 641 - 649
  • [47] Deep Learning Techniques on Text Classification Using Natural Language Processing (NLP) In Social Healthcare Network: A Comprehensive Survey
    Lavanya, P. M.
    Sasikala, E.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 603 - 609
  • [48] Three-Stage Pavement Crack Localization and Segmentation Algorithm Based on Digital Image Processing and Deep Learning Techniques
    Yang, Zhen
    Ni, Changshuang
    Li, Lin
    Luo, Wenting
    Qin, Yong
    SENSORS, 2022, 22 (21)
  • [49] Why Deep Learning Is More Efficient than Support Vector Machines, and How it is Related to Sparsity Techniques in Signal Processing
    Bokati, Laxman
    Kosheleva, Olga
    Kreinovich, Vladik
    Sosa, Anibal
    2020 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE (ISMSI 2020), 2020, : 8 - 12
  • [50] Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system
    Li, Xinxing
    Ma, Diankun
    Yin, Baoquan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 180