Characterizing Glottal Activity from Speech using Empirical Mode Decomposition

被引：0

作者：

Sharma, Rajib ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India

来源：

2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC) | 2015年

关键词：

Glottal activity; EMD; IMF; GIMF; GAD; pitch;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Glottal activity is an important aspect of speech production that results in voiced speech, and localizing such regions for computing various parameters of the excitation source is useful in many speech processing applications. The aim of this paper is to investigate the ability of Empirical Mode Decomposition (EMD) and its noise assisted variants, in characterizing glottal activity from the speech signal. A pair of consecutive Intrinsic Mode Functions (IMFs), obtained from the decomposition is found to reflect the periodic nature of different voiced regions of the speech signal. This IMF pair is utilized to construct a signal, named the Glottal Intrinsic Mode Function (GIMF), which represents most of the voiced speech regions. To measure the capability of the GIMF in representing the glottal activity, it is applied to the tasks of Glottal Activity Detection (GAD), pitch frequency (F-0) tracking and detecting pitch markers. The results ascertain the capability of EMD in localizing Glottal activity within a small subset of IMFs, and suggest the possibility of accurately extracting source-information from voiced speech with simple signal processing procedures.

引用

页数：6

共 50 条

[1] Detection of the Glottal Closure Instants Using Empirical Mode Decomposition
Sharma, Rajib
Prasanna, S. R. M.
Leonardo Rufiner, Hugo
Schlotthauer, Gaston
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3412 - 3440
[2] Speech vs Music Discrimination using Empirical Mode Decomposition
Khonglah, Banriskhem K.
Sharma, Rajib
Prasanna, S. R. Mahadeva
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[3] A better decomposition of speech obtained using modified Empirical Mode Decomposition
Sharma, Rajib
Prasanna, S. R. Mahadeva
DIGITAL SIGNAL PROCESSING, 2016, 58 : 26 - 39
[4] Empirical Mode Decomposition: A way for finding Pitch (Stuttered speech signal)
Raju, N.
Neelamegam, P.
RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (06): : 1030 - 1036
[5] Adaptive Compressive Sensing of Speech Signals based on Empirical Mode Decomposition
Wang, Shi-kui
Shao, Yu-feng
2014 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORK (WCSN), 2014, : 444 - 447
[6] An Adaptive Speech Enhancement Approach Based on DCT and Empirical Mode Decomposition
Rao, S. Nageswara
Sankar, K. Jaya
Naidu, C. D.
2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 581 - 586
[7] Single Channel speech separation based on empirical mode decomposition and Hilbert Transform
Krishna, Prasanna Kumar Mundodu
Ramaswamy, Kumaraswamy
IET SIGNAL PROCESSING, 2017, 11 (05) : 579 - 586
[8] Removal of Artifacts in ECG using Empirical Mode Decomposition
Anapagamini, S. A.
Rajavel, R.
2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 288 - 292
[9] Dysfluent Speech Classification Using Variational Mode Decomposition and Complete Ensemble Empirical Mode Decomposition Techniques With NGCU-Based RNN
Vinay, N. A.
Vidyasagar, K. N.
Rohith, S.
Supreeth, S.
Prasad, S. N.
Kumar, S. Pramod
Bharathi, S. H.
IEEE ACCESS, 2024, 12 : 174934 - 174953
[10] Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers
Prasanna Kumar M.K.
Kumaraswamy R.
International Journal of Speech Technology, 2017, 20 (01) : 109 - 125

← 1 2 3 4 5 →