Hidden-Markov-model based statistical parametric speech synthesis for Marathi with optimal number of hidden states

被引:0
|
作者
Suraj Pandurang Patil
Swapnil Laxman Lahudkar
机构
[1] JSPMs Rajarshi Shahu College of Engineering,
[2] JSPM’s Imperial College of Engineering and Research,undefined
来源
International Journal of Speech Technology | 2019年 / 22卷
关键词
Speech Synthesis; Hidden Markov Model; Context-dependent HMM; HMM Toolkit;
D O I
暂无
中图分类号
学科分类号
摘要
Hidden Markov Model and Deep Neural Networks based Statistical Parametric Speech Synthesis systems, gain a significant attention from researchers because of their flexibility in generating speech waveforms in diverse voice qualities as well as in styles. This paper describes HMM-based speech synthesis system (SPSS) for the Marathi language. In proposed synthesis method, speech parameter trajectories used for synthesis are generated from the trained hidden Markov models (HMM). We have recorded our database of 5300 phonetically balanced Marathi sentences to train the context-dependent HMM with five, seven and nine hidden states. The subjective quality measures (MOS and PWP) shows that the HMMs with seven hidden states are capable of giving an adequate quality of synthesized speech as compared to five state and with less time complexity than seven state HMMs. The contextual features used for experimentation are inclusive of a position of an observed phoneme in a respective syllable, word, and sentence.
引用
收藏
页码:93 / 98
页数:5
相关论文
共 50 条
  • [21] Statistical brand switching model: an Hidden Markov approach
    Kumaraswamy, K.
    Bhatracharyulu, N. Ch.
    OPSEARCH, 2023, 60 (02) : 942 - 950
  • [22] Statistical brand switching model: an Hidden Markov approach
    K. Kumaraswamy
    N. Ch. Bhatracharyulu
    OPSEARCH, 2023, 60 : 942 - 950
  • [23] Time-Inhomogeneous Hidden Bernoulli Model: An alternative to Hidden Markov Model for automatic speech recognition
    Kabudian, Jahanshah
    Homayounpour, M. Mehdi
    Ahadi, S. Mohammad
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4101 - +
  • [24] Improved detection algorithm for copy number variations based on hidden Markov model
    Hai Yang
    Daming Zhu
    Multimedia Tools and Applications, 2020, 79 : 9237 - 9253
  • [25] Improved detection algorithm for copy number variations based on hidden Markov model
    Yang, Hai
    Zhu, Daming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9237 - 9253
  • [26] A Hidden Markov Model for Persian Part-of-Speech Tagging
    Okhovvat, Morteza
    Bidgoli, Behrouz Minaei
    WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [27] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
  • [28] Deterministically annealed design of hidden Markov model speech recognizers
    Rao, AV
    Rose, K
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02): : 111 - 126
  • [29] A fused hidden Markov model with application to bimodal speech processing
    Pan, H
    Levinson, SE
    Huang, TS
    Liang, ZP
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (03) : 573 - 581
  • [30] Improved hidden Markov model for speech recognition and POS tagging
    Yuan Li-chi
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2012, 19 (02) : 511 - 516