Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引：18

作者：

Pantazis, Yannis ^{[1
,2
]}

Stylianou, Yannis ^{[1
,2
]}

机构：

[1] FORTH, Inst Comp Sci, Iraklion, Greece

[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

speech synthesis; noise modeling; time envelope; energy distribution;

D O I：

10.1109/ICASSP.2008.4518683

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.

引用

页码：4609 / +

页数：2

共 50 条

[41] Speech Perception in Noise with a Harmonic Complex Excited Vocoder
Churchill, Tyler H.
Kan, Alan
Goupell, Matthew J.
Ihlefeld, Antje
Litovsky, Ruth Y.
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2014, 15 (02): : 265 - 278
[42] Improving performance of spectral subtraction in speech recognition using a model for additive noise
Yoma, NB
McInnes, FR
Jack, MA
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06): : 579 - 582
[43] Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis
Narendra, N. P.
Rao, K. Sreenivasa
SPEECH COMMUNICATION, 2016, 77 : 65 - 83
[44] WEIGHTED CODEBOOK MAPPING FOR NOISY SPEECH ENHANCEMENT USING HARMONIC-NOISE MODEL
Zavarehei, Esfandiar
Vaseghi, Saeed
Yan, Qin
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 253 - 256
[45] Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
Pierre-Amaury Grumiaux
Mathieu Lagrange
EURASIP Journal on Audio, Speech, and Music Processing, 2023
[46] Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
Grumiaux, Pierre-Amaury
Lagrange, Mathieu
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[47] A VERY LOW BIT RATE CODEC FOR WIDE BAND SPEECH BASED ON A LONG-TERM PERCEPTUAL HARMONIC PLUS NOISE MODEL
Ben Ali, Faten
Djaziri-Larbi, Sonia
2016 INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2016, : 71 - 76
[48] A Voice Conversion System Based on the Harmonic plus Noise Excitation and Gaussian Mixture Model
Wu Lifang
Zhang Linghua
PROCEEDINGS OF THE 2012 SECOND INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2012), 2012, : 1575 - 1578
[49] A glimpsing model of speech perception in noise
Cooke, Martin
Journal of the Acoustical Society of America, 2006, 119 (03): : 1562 - 1573
[50] A glimpsing model of speech perception in noise
Cooke, M
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03): : 1562 - 1573

← 1 2 3 4 5 →