Improving the modeling of the noise part in the Harmonic plus Noise model of speech

被引：18

作者：

Pantazis, Yannis ^{[1
,2
]}

Stylianou, Yannis ^{[1
,2
]}

机构：

[1] FORTH, Inst Comp Sci, Iraklion, Greece

[2] Univ Crete, Dept Comp Sci, Multimedia Informat Lab, Iraklion, Greece

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

speech synthesis; noise modeling; time envelope; energy distribution;

D O I：

10.1109/ICASSP.2008.4518683

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Harmonic + Noise model (HNM) is a hybrid model of speech with a harmonic component and a noise component. While the harmonic part describes efficiently the periodicities in speech signals (voiced parts), modeling of the noise part introduces artifacts primarily because of the specific time-domain characteristics of noise in voiced speech. In this paper, we concentrated on the modeling of noise in voiced frames. To model the temporal characteristics of noise, we study three time envelopes in the context of HNM; Triangular envelope, Hilbert envelope and Energy envelope. Listening tests showed a clear preference for the Energy envelope and Hilbert envelope for male voices and to a lesser extent the same conclusions can be drawn for female voices.

引用

页码：4609 / +

页数：2

共 50 条

[1] Modeling speech based on harmonic plus noise models
Stylianou, Y
NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 244 - 260
[2] Speech synthesis method with a harmonic plus noise model
Ishikawa, Y
Maruyama, I
Hase, T
2002 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2002, : 238 - 239
[3] Phase and transient modeling for harmonic plus noise speech coding
Yu, EWM
Chan, CF
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1467 - 1470
[4] On the implementation of the Harmonic plus Noise Model for concatenative speech synthesis
Stylianou, Y
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 957 - 960
[5] Enhancement of esophagus speech using harmonic plus noise model
Lehana, PK
Gupta, RK
Kumari, S
TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A669 - A672
[6] Applying the harmonic plus noise model in concatenative speech synthesis
Stylianou, Y
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 21 - 29
[7] On the harmonic-plus-noise decomposition of speech
Hu Qi
Liang Mangui
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 736 - +
[8] A Long-Term Harmonic plus Noise Model for Speech Signals
Ben Ali, Faten
Girin, Laurent
Larbi, Sonia Djaziri
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 60 - +
[9] Emotional speech analysis using harmonic plus noise model and Gaussian mixture model
Singh, Jang Bahadur
Lehana, Parveen Kumar
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 483 - 496
[10] Emotional speech analysis using harmonic plus noise model and Gaussian mixture model
Jang Bahadur Singh
Parveen Kumar Lehana
International Journal of Speech Technology, 2019, 22 : 483 - 496

← 1 2 3 4 5 →