A better decomposition of speech obtained using modified Empirical Mode Decomposition

被引:27
|
作者
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IPs; Mode mixing; Dyadic filterbank; LP; Formants; NOISE; SEPARATION; EXTRACTION; FREQUENCY; DATABASE; DESIGN;
D O I
10.1016/j.dsp.2016.07.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of this work is to obtain meaningful time domain components, or Intrinsic Mode Functions (IMFs), of the speech signal, using Empirical Mode Decomposition (EMD), with reduced mode mixing, and in a time-efficient manner. This work focuses on two aspects - firstly, extracting IMFs of the speech signal which can better reflect its higher frequency spectrum; and secondly, to get a better representation and distribution of the vocal tract resonances of the speech signal in its IMFs, compared to that obtained from standard EMD. To this effect, modifications are proposed to the EMD algorithm for processing speech signals, based on the critical nature of the interpolation points (IPs) used for cubic spline interpolation in EMD. The effect of using different sets of IPs, other than the extrema of the residue - as used in standard EMD - is analyzed. It is found that having more IPs is beneficial only upto a certain limit, after which the characteristic dyadic filterbank nature of EMD breaks down. For certain sets of IPs, these modified EMD processes perform better than EMD, giving better frequency separability between the IMFs, and an enhanced representation of the higher frequency content of the signal. A detailed study of the distribution of the formants, in the IMFs of the speech signal, is done using Linear Prediction (LP) analysis of the IMFs. It is found that the IMFs of the EMD variants have a far better distribution of the formants structure within them, with reduced overlapping amongst their filter spectrums, compared to that of standard EMD. Henceforth, when subjected to the task of formants estimation of voiced speech, using LP analysis, the IMFs of the modified EMD processes cumulatively exhibit a superior performance than that of standard EMD, or the speech signal itself, under both dean and noisy conditions. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [1] Empirical Mode Decomposition for adaptive AM-FM analysis of Speech: A Review
    Sharma, Rajib
    Vignolo, Leandro
    Schlotthauer, Gaston
    Colominas, M. A.
    Rufiner, H. Leonardo
    Prasanna, S. R. M.
    SPEECH COMMUNICATION, 2017, 88 : 39 - 64
  • [2] Speech vs Music Discrimination using Empirical Mode Decomposition
    Khonglah, Banriskhem K.
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [3] Image decomposition based on modified Bidimensional Empirical Mode Decomposition
    Ben Arfia, Faten
    Ben Messaoud, Mohamed
    Abid, Mohamed
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2011), 2011, 8009
  • [4] Dysfluent Speech Classification Using Variational Mode Decomposition and Complete Ensemble Empirical Mode Decomposition Techniques With NGCU-Based RNN
    Vinay, N. A.
    Vidyasagar, K. N.
    Rohith, S.
    Supreeth, S.
    Prasad, S. N.
    Kumar, S. Pramod
    Bharathi, S. H.
    IEEE ACCESS, 2024, 12 : 174934 - 174953
  • [5] Characterizing Glottal Activity from Speech using Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [6] Speech enhancement using empirical mode decomposition and the Teager-Kaiser energy operator
    Khaldi, Kais
    Boudraa, Abdel-Ouahab
    Komaty, Ali
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (01): : 451 - 459
  • [7] Empirical Mode Decomposition using the Second Derivative
    Park, Min-Su
    Kim, Donghoh
    Oh, Hee-Seok
    KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (02) : 335 - 347
  • [8] Using Empirical Mode Decomposition for Ground Filtering
    Ozcan, Abdullah H.
    Unsalan, Cem
    2015 7TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SPACE TECHNOLOGIES (RAST), 2015, : 317 - 321
  • [9] The inner structure of empirical mode decomposition
    Wang, Yung-Hung
    Young, Hsu-Wen Vincent
    Lo, Men-Tzung
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 462 : 1003 - 1017
  • [10] Detection of the Glottal Closure Instants Using Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. M.
    Leonardo Rufiner, Hugo
    Schlotthauer, Gaston
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3412 - 3440