Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引:18
|
作者
Pantazis, Yannis [1 ,2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] FORTH, Inst Comp Sci, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech synthesis; noise modeling; time envelope; energy distribution;
D O I
10.1109/ICASSP.2008.4518683
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.
引用
收藏
页码:4609 / +
页数:2
相关论文
共 50 条
  • [1] Modeling speech based on harmonic plus noise models
    Stylianou, Y
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 244 - 260
  • [2] Speech synthesis method with a harmonic plus noise model
    Ishikawa, Y
    Maruyama, I
    Hase, T
    2002 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2002, : 238 - 239
  • [3] Phase and transient modeling for harmonic plus noise speech coding
    Yu, EWM
    Chan, CF
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1467 - 1470
  • [4] On the implementation of the Harmonic plus Noise Model for concatenative speech synthesis
    Stylianou, Y
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 957 - 960
  • [5] Enhancement of esophagus speech using harmonic plus noise model
    Lehana, PK
    Gupta, RK
    Kumari, S
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A669 - A672
  • [6] Applying the harmonic plus noise model in concatenative speech synthesis
    Stylianou, Y
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 21 - 29
  • [7] On the harmonic-plus-noise decomposition of speech
    Hu Qi
    Liang Mangui
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 736 - +
  • [8] A Long-Term Harmonic plus Noise Model for Speech Signals
    Ben Ali, Faten
    Girin, Laurent
    Larbi, Sonia Djaziri
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 60 - +
  • [9] Emotional speech analysis using harmonic plus noise model and Gaussian mixture model
    Singh, Jang Bahadur
    Lehana, Parveen Kumar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 483 - 496
  • [10] Emotional speech analysis using harmonic plus noise model and Gaussian mixture model
    Jang Bahadur Singh
    Parveen Kumar Lehana
    International Journal of Speech Technology, 2019, 22 : 483 - 496