Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引:18
|
作者
Pantazis, Yannis [1 ,2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] FORTH, Inst Comp Sci, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech synthesis; noise modeling; time envelope; energy distribution;
D O I
10.1109/ICASSP.2008.4518683
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.
引用
收藏
页码:4609 / +
页数:2
相关论文
共 50 条
  • [41] Speech Perception in Noise with a Harmonic Complex Excited Vocoder
    Churchill, Tyler H.
    Kan, Alan
    Goupell, Matthew J.
    Ihlefeld, Antje
    Litovsky, Ruth Y.
    JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2014, 15 (02): : 265 - 278
  • [42] Improving performance of spectral subtraction in speech recognition using a model for additive noise
    Yoma, NB
    McInnes, FR
    Jack, MA
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06): : 579 - 582
  • [43] Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis
    Narendra, N. P.
    Rao, K. Sreenivasa
    SPEECH COMMUNICATION, 2016, 77 : 65 - 83
  • [44] WEIGHTED CODEBOOK MAPPING FOR NOISY SPEECH ENHANCEMENT USING HARMONIC-NOISE MODEL
    Zavarehei, Esfandiar
    Vaseghi, Saeed
    Yan, Qin
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 253 - 256
  • [45] Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
    Pierre-Amaury Grumiaux
    Mathieu Lagrange
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [46] Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
    Grumiaux, Pierre-Amaury
    Lagrange, Mathieu
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [47] A VERY LOW BIT RATE CODEC FOR WIDE BAND SPEECH BASED ON A LONG-TERM PERCEPTUAL HARMONIC PLUS NOISE MODEL
    Ben Ali, Faten
    Djaziri-Larbi, Sonia
    2016 INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2016, : 71 - 76
  • [48] A Voice Conversion System Based on the Harmonic plus Noise Excitation and Gaussian Mixture Model
    Wu Lifang
    Zhang Linghua
    PROCEEDINGS OF THE 2012 SECOND INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2012), 2012, : 1575 - 1578
  • [49] A glimpsing model of speech perception in noise
    Cooke, Martin
    Journal of the Acoustical Society of America, 2006, 119 (03): : 1562 - 1573
  • [50] A glimpsing model of speech perception in noise
    Cooke, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03): : 1562 - 1573