Assessment of Disordered Voices Using Empirical Mode Decomposition in the Log-Spectral Domain

被引:0
|
作者
Kacha, A. [1 ]
Grenez, F. [1 ]
Schoentgen, J. [1 ]
机构
[1] Univ Jijel, Lab Phys Rayonnement & Applicat, Jijel, Algeria
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
Disordered voices; empirical mode decomposition; harmonic-to-noise ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Empirical mode decomposition (EMD) algorithm is proposed as an alternative to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio is used to summarize the degree of disturbance in the speech signal. The empirical mode decomposition algorithm is a tool for the analysis of multi-component signals. The analysis method does not require a priori fixed basis function like conventional analysis methods (e.g. Fourier transform and wavelet transform). The proposed method is tested on synthetic vowels and natural speech. The corpus of synthetic vowels comprises 48 stimuli of synthetic sounds [a] that combine three values of vocal frequency, four levels of jitter frequency and four levels of additive noise. The corpora of natural speech comprise a concatenation of the vowel [a] with two Dutch sentences produced by 28 normophonic and 223 speakers with different degrees of dysphonia.
引用
收藏
页码:66 / 69
页数:4
相关论文
共 50 条
  • [1] Multiband vocal dysperiodicities analysis using empirical mode decomposition in the log-spectral domain
    Kacha, Abdellah
    Grenez, Francis
    Schoentgen, Jean
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 17 : 11 - 20
  • [2] Bivariate Empirical Mode Decomposition of Speech Signals for Disordered Voices Assessment
    Boubekiria, Kawther
    Kacha, Abdellah
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [3] Empirical Mode Decomposition-Based Spectral Acoustic Cues for Disordered Voices Analysis
    Kacha, Abdellah
    Grenez, Francis
    Schoentgen, Jean
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3599 - 3603
  • [4] NOISE ESTIMATION USING A CONSTRAINED SEQUENTIAL HMM IN LOG-SPECTRAL DOMAIN
    Ying, Dongwen
    Lu, Xugang
    Li, Junfeng
    Yan, Yonghong
    Dang, Jianwu
    Soong, Frank
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4553 - 4556
  • [5] Robust Speech Recognition Using MLP Neural Network in Log-Spectral Domain
    Ghaemmaghami, Masoumeh P.
    Sameti, Hossein
    Razzazi, Farbod
    BabaAli, Bagher
    Dabbaghchian, Saeed
    2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 467 - +
  • [6] Accurate compensation in the log-spectral domain for noisy speech recognition
    Afify, M
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 388 - 398
  • [7] Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
    Ying, Dongwen
    Yan, Yonghong
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1145 - 1157
  • [8] Empirical mode decomposition. Spectral properties in normal and pathological voices
    Torres, M. E.
    Schlotthauer, G.
    Rufiner, H. L.
    Jackson-Menaldi, M. C.
    4TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2009, 22 (1-3): : 252 - 255
  • [9] MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN
    Dionelis, Nikolaos
    Brookes, Mike
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 111 - 115
  • [10] Log-Spectral Amplitude and Spectral Polarity Estimation in Short-Time Discrete Cosine Transform Domain
    Shi, Sisi
    Paliwal, Kuldip K.
    Busch, Andrew
    IEEE ACCESS, 2023, 11 : 34456 - 34475