Characterizing Glottal Activity from Speech using Empirical Mode Decomposition

被引:0
|
作者
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
来源
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC) | 2015年
关键词
Glottal activity; EMD; IMF; GIMF; GAD; pitch;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Glottal activity is an important aspect of speech production that results in voiced speech, and localizing such regions for computing various parameters of the excitation source is useful in many speech processing applications. The aim of this paper is to investigate the ability of Empirical Mode Decomposition (EMD) and its noise assisted variants, in characterizing glottal activity from the speech signal. A pair of consecutive Intrinsic Mode Functions (IMFs), obtained from the decomposition is found to reflect the periodic nature of different voiced regions of the speech signal. This IMF pair is utilized to construct a signal, named the Glottal Intrinsic Mode Function (GIMF), which represents most of the voiced speech regions. To measure the capability of the GIMF in representing the glottal activity, it is applied to the tasks of Glottal Activity Detection (GAD), pitch frequency (F-0) tracking and detecting pitch markers. The results ascertain the capability of EMD in localizing Glottal activity within a small subset of IMFs, and suggest the possibility of accurately extracting source-information from voiced speech with simple signal processing procedures.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Detection of the Glottal Closure Instants Using Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. M.
    Leonardo Rufiner, Hugo
    Schlotthauer, Gaston
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3412 - 3440
  • [2] Speech vs Music Discrimination using Empirical Mode Decomposition
    Khonglah, Banriskhem K.
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [3] A better decomposition of speech obtained using modified Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    DIGITAL SIGNAL PROCESSING, 2016, 58 : 26 - 39
  • [4] Empirical Mode Decomposition: A way for finding Pitch (Stuttered speech signal)
    Raju, N.
    Neelamegam, P.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (06): : 1030 - 1036
  • [5] Adaptive Compressive Sensing of Speech Signals based on Empirical Mode Decomposition
    Wang, Shi-kui
    Shao, Yu-feng
    2014 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORK (WCSN), 2014, : 444 - 447
  • [6] An Adaptive Speech Enhancement Approach Based on DCT and Empirical Mode Decomposition
    Rao, S. Nageswara
    Sankar, K. Jaya
    Naidu, C. D.
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 581 - 586
  • [7] Single Channel speech separation based on empirical mode decomposition and Hilbert Transform
    Krishna, Prasanna Kumar Mundodu
    Ramaswamy, Kumaraswamy
    IET SIGNAL PROCESSING, 2017, 11 (05) : 579 - 586
  • [8] Removal of Artifacts in ECG using Empirical Mode Decomposition
    Anapagamini, S. A.
    Rajavel, R.
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 288 - 292
  • [9] Dysfluent Speech Classification Using Variational Mode Decomposition and Complete Ensemble Empirical Mode Decomposition Techniques With NGCU-Based RNN
    Vinay, N. A.
    Vidyasagar, K. N.
    Rohith, S.
    Supreeth, S.
    Prasad, S. N.
    Kumar, S. Pramod
    Bharathi, S. H.
    IEEE ACCESS, 2024, 12 : 174934 - 174953
  • [10] Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers
    Prasanna Kumar M.K.
    Kumaraswamy R.
    International Journal of Speech Technology, 2017, 20 (01) : 109 - 125