Hidden-Markov-model based statistical parametric speech synthesis for Marathi with optimal number of hidden states

被引:0
|
作者
Suraj Pandurang Patil
Swapnil Laxman Lahudkar
机构
[1] JSPMs Rajarshi Shahu College of Engineering,
[2] JSPM’s Imperial College of Engineering and Research,undefined
来源
International Journal of Speech Technology | 2019年 / 22卷
关键词
Speech Synthesis; Hidden Markov Model; Context-dependent HMM; HMM Toolkit;
D O I
暂无
中图分类号
学科分类号
摘要
Hidden Markov Model and Deep Neural Networks based Statistical Parametric Speech Synthesis systems, gain a significant attention from researchers because of their flexibility in generating speech waveforms in diverse voice qualities as well as in styles. This paper describes HMM-based speech synthesis system (SPSS) for the Marathi language. In proposed synthesis method, speech parameter trajectories used for synthesis are generated from the trained hidden Markov models (HMM). We have recorded our database of 5300 phonetically balanced Marathi sentences to train the context-dependent HMM with five, seven and nine hidden states. The subjective quality measures (MOS and PWP) shows that the HMMs with seven hidden states are capable of giving an adequate quality of synthesized speech as compared to five state and with less time complexity than seven state HMMs. The contextual features used for experimentation are inclusive of a position of an observed phoneme in a respective syllable, word, and sentence.
引用
收藏
页码:93 / 98
页数:5
相关论文
共 50 条
  • [41] Gait Analysis based on a Hidden Markov Model
    Bae, Joonbum
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2012, : 1025 - 1029
  • [42] Gait Identification Based on Hidden Markov Model
    Zhao, XiLing
    Shang, XinHua
    2012 2ND INTERNATIONAL CONFERENCE ON APPLIED ROBOTICS FOR THE POWER INDUSTRY (CARPI), 2012, : 812 - 815
  • [43] Spectral matching based on hidden Markov model
    Fu, Jing
    Shu, Ning
    Kong, Xiangbin
    REMOTE SENSING OF THE ENVIRONMENT: THE 17TH CHINA CONFERENCE ON REMOTE SENSING, 2011, 8203
  • [44] Morphology Analysis for Hidden Markov Model based Indonesian Part-of-Speech Tagger
    Muljono
    Afini, Umriya
    Supriyanto, Catur
    2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 237 - 240
  • [45] Identifying heterogeneous diffusion states in the cytoplasm by a hidden Markov model
    Janczura, Joanna
    Balcerek, Michal
    Burnecki, Krzysztof
    Sabri, Adal
    Weiss, Matthias
    Krapf, Diego
    NEW JOURNAL OF PHYSICS, 2021, 23 (05):
  • [46] Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech
    Yoshizato, Kota
    Kameoka, Hirokazu
    Saito, Daisuke
    Sagayama, Shigeki
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 390 - 393
  • [47] Speech recognition using hybrid hidden Markov model and NN classifier
    Kundu A.
    Bayya A.
    International Journal of Speech Technology, 1998, 2 (3) : 227 - 240
  • [48] FINITE REGISTER LENGTH EFFECTS IN A HIDDEN MARKOV MODEL SPEECH RECOGNIZER
    YANG, WJ
    WANG, HC
    SPEECH COMMUNICATION, 1990, 9 (03) : 239 - 245
  • [49] Speech Recognition for English to Indonesian Translator Using Hidden Markov Model
    Muhammad, Hariz Zakka
    Nasrun, Muhammad
    Setianingsih, Casi
    Murti, Muhammad Ary
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 255 - 260
  • [50] CONTEXTUAL VECTOR QUANTIZATION FOR SPEECH RECOGNITION WITH DISCRETE HIDDEN MARKOV MODEL
    HUO, QA
    CHAN, CK
    PATTERN RECOGNITION, 1995, 28 (04) : 513 - 517