Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引：18

作者：

Pantazis, Yannis ^{[1
,2
]}

Stylianou, Yannis ^{[1
,2
]}

机构：

[1] FORTH, Inst Comp Sci, Iraklion, Greece

[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

speech synthesis; noise modeling; time envelope; energy distribution;

D O I：

10.1109/ICASSP.2008.4518683

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.

引用

页码：4609 / +

页数：2

共 50 条

[21] Low bit-rate speech codec based on a long-term harmonic plus noise model
1600, Audio Engineering Society (64):
[22] Compression of a Slovak Speech Database Using Harmonic, Noise and Transient Model
Nagy, Martin Turi
Rozinaj, Gregor
PROCEEDINGS ELMAR-2010, 2010, : 363 - 366
[23] A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement
Ouyang, Zhiheng
Yu, Hongjiang
Zhu, Wei-Ping
Champagne, Benoit
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3224 - 3228
[24] DETECTING PATHOLOGICAL SPEECH USING CONTOUR MODELING OF HARMONIC-TO-NOISE RATIO
Lee, Jung-Won
Kim, Samuel
Kang, Hong-Goo
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[25] Synthesis of Dog's Calls using Harmonic plus Noise model
Gupta, Rakesh K.
Kumari, Santoresh
Lehana, Parveen K.
WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 5, 2005, : 233 - 236
[26] Improved Harmonic plus Noise Model for vocal and musical instrument sounds
Dubnov, S
VIRTUAL, SYNTHETIC, AND ENTERTAINMENT AUDIO, 2002, : 233 - 238
[27] Interactive Speech and Noise Modeling for Speech Enhancement
Zheng, Chengyu
Peng, Xiulian
Zhang, Yuan
Srinivasan, Sriram
Lu, Yan
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14549 - 14557
[28] Harmonic to noise ratio improvement in oesophageal speech
Oleagordia-Ruiz, Ibon
Garcia-Zapirain, Begonya
TECHNOLOGY AND HEALTH CARE, 2015, 23 (03) : 359 - 368
[29] The harmonic and noise information of the glottal pulses in speech
Sousa, Ricardo
Ferreira, Anibal
Alku, Paavo
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 10 : 137 - 143
[30] SPEECH ENHANCEMENT IN CAR NOISE ENVIRONMENT BASED ON AN ANALYSIS-SYNTHESIS APPROACH USING HARMONIC NOISE MODEL
Chen, R. F.
Chan, C. F.
So, H. C.
Lee, Jonathan S. C.
Leung, C. Y.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4413 - +

← 1 2 3 4 5 →