Assessment of Disordered Voices Using Empirical Mode Decomposition in the Log-Spectral Domain

被引：0

作者：

Kacha, A. ^{[1
]}

Grenez, F. ^{[1
]}

Schoentgen, J. ^{[1
]}

机构：

[1] Univ Jijel, Lab Phys Rayonnement & Applicat, Jijel, Algeria

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

Disordered voices; empirical mode decomposition; harmonic-to-noise ratio;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Empirical mode decomposition (EMD) algorithm is proposed as an alternative to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio is used to summarize the degree of disturbance in the speech signal. The empirical mode decomposition algorithm is a tool for the analysis of multi-component signals. The analysis method does not require a priori fixed basis function like conventional analysis methods (e.g. Fourier transform and wavelet transform). The proposed method is tested on synthetic vowels and natural speech. The corpus of synthetic vowels comprises 48 stimuli of synthetic sounds [a] that combine three values of vocal frequency, four levels of jitter frequency and four levels of additive noise. The corpora of natural speech comprise a concatenation of the vowel [a] with two Dutch sentences produced by 28 normophonic and 223 speakers with different degrees of dysphonia.

引用

页码：66 / 69

页数：4

共 50 条

[1] Multiband vocal dysperiodicities analysis using empirical mode decomposition in the log-spectral domain
Kacha, Abdellah
Grenez, Francis
Schoentgen, Jean
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 17 : 11 - 20
[2] Bivariate Empirical Mode Decomposition of Speech Signals for Disordered Voices Assessment
Boubekiria, Kawther
Kacha, Abdellah
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
[3] Empirical Mode Decomposition-Based Spectral Acoustic Cues for Disordered Voices Analysis
Kacha, Abdellah
Grenez, Francis
Schoentgen, Jean
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3599 - 3603
[4] NOISE ESTIMATION USING A CONSTRAINED SEQUENTIAL HMM IN LOG-SPECTRAL DOMAIN
Ying, Dongwen
Lu, Xugang
Li, Junfeng
Yan, Yonghong
Dang, Jianwu
Soong, Frank
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4553 - 4556
[5] Robust Speech Recognition Using MLP Neural Network in Log-Spectral Domain
Ghaemmaghami, Masoumeh P.
Sameti, Hossein
Razzazi, Farbod
BabaAli, Bagher
Dabbaghchian, Saeed
2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 467 - +
[6] Accurate compensation in the log-spectral domain for noisy speech recognition
Afify, M
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 388 - 398
[7] Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
Ying, Dongwen
Yan, Yonghong
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1145 - 1157
[8] Empirical mode decomposition. Spectral properties in normal and pathological voices
Torres, M. E.
Schlotthauer, G.
Rufiner, H. L.
Jackson-Menaldi, M. C.
4TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2009, 22 (1-3): : 252 - 255
[9] MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN
Dionelis, Nikolaos
Brookes, Mike
2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 111 - 115
[10] Log-Spectral Amplitude and Spectral Polarity Estimation in Short-Time Discrete Cosine Transform Domain
Shi, Sisi
Paliwal, Kuldip K.
Busch, Andrew
IEEE ACCESS, 2023, 11 : 34456 - 34475

← 1 2 3 4 5 →