Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引:18
|
作者
Pantazis, Yannis [1 ,2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] FORTH, Inst Comp Sci, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech synthesis; noise modeling; time envelope; energy distribution;
D O I
10.1109/ICASSP.2008.4518683
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.
引用
收藏
页码:4609 / +
页数:2
相关论文
共 50 条
  • [22] Compression of a Slovak Speech Database Using Harmonic, Noise and Transient Model
    Nagy, Martin Turi
    Rozinaj, Gregor
    PROCEEDINGS ELMAR-2010, 2010, : 363 - 366
  • [23] A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement
    Ouyang, Zhiheng
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3224 - 3228
  • [24] DETECTING PATHOLOGICAL SPEECH USING CONTOUR MODELING OF HARMONIC-TO-NOISE RATIO
    Lee, Jung-Won
    Kim, Samuel
    Kang, Hong-Goo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Synthesis of Dog's Calls using Harmonic plus Noise model
    Gupta, Rakesh K.
    Kumari, Santoresh
    Lehana, Parveen K.
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 5, 2005, : 233 - 236
  • [26] Improved Harmonic plus Noise Model for vocal and musical instrument sounds
    Dubnov, S
    VIRTUAL, SYNTHETIC, AND ENTERTAINMENT AUDIO, 2002, : 233 - 238
  • [27] Interactive Speech and Noise Modeling for Speech Enhancement
    Zheng, Chengyu
    Peng, Xiulian
    Zhang, Yuan
    Srinivasan, Sriram
    Lu, Yan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14549 - 14557
  • [28] Harmonic to noise ratio improvement in oesophageal speech
    Oleagordia-Ruiz, Ibon
    Garcia-Zapirain, Begonya
    TECHNOLOGY AND HEALTH CARE, 2015, 23 (03) : 359 - 368
  • [29] The harmonic and noise information of the glottal pulses in speech
    Sousa, Ricardo
    Ferreira, Anibal
    Alku, Paavo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 10 : 137 - 143
  • [30] SPEECH ENHANCEMENT IN CAR NOISE ENVIRONMENT BASED ON AN ANALYSIS-SYNTHESIS APPROACH USING HARMONIC NOISE MODEL
    Chen, R. F.
    Chan, C. F.
    So, H. C.
    Lee, Jonathan S. C.
    Leung, C. Y.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4413 - +