Detection of Glottal Activity Using Different Attributes of Source Information

被引:17
|
作者
Adiga, Nagaraj [1 ]
Prasanna, S. R. M. [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Glottal activity; higher-order statistics; normalized autocorrelation peak strength; strength of excitation; LINEAR PREDICTION; EPOCH EXTRACTION;
D O I
10.1109/LSP.2015.2461008
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The major activity during speech production is glottal activity and is earlier detected using strength of excitation (SoE). This work uses the normalized autocorrelation peak strength (NAPS) and higher order statistics (HOS) as additional features for detecting glottal activity. The three features, namely, SoE, NAPS, and HOS, are, respectively indicators of different attributes of glottal activity, namely, energy, periodicity, and asymmetrical nature of the resulting source signal. The effectiveness of these features is analyzed using the differential electroglottograph signal, zero-frequency filtered signal, and integrated linear prediction residual, as representatives of source signal. The combination of glottal activity information from the three features outperforms any single of them, demonstrating different information represented by each of these features.
引用
收藏
页码:2107 / 2111
页数:5
相关论文
共 26 条
  • [21] Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies
    Prasanna, S. R. Mahadeva
    Reddy, B. V. Sandeep
    Krishnamoorthy, P.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 556 - 565
  • [22] Subpixel target detection in hyperspectral data using higher order statistics source separation algorithms
    Robila, S
    Computational Imaging III, 2005, 5674 : 424 - 431
  • [23] A speech feature extraction method using complexity measure for voice activity detection in WGN
    Huang, Heyun
    Lin, Fuhuei
    SPEECH COMMUNICATION, 2009, 51 (09) : 714 - 723
  • [24] Robust voice activity detection using higher-order statistics in the LPC residual domain
    Nemer, E
    Goubran, R
    Mahmoud, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 217 - 231
  • [25] Spike detection in human muscle sympathetic nerve activity using the kurtosis of stationary wavelet transform coefficients
    Brychta, Robert J.
    Shiavi, Richard
    Robertson, David
    Diedrich, Andre
    JOURNAL OF NEUROSCIENCE METHODS, 2007, 160 (02) : 359 - 367
  • [26] Analysis and Detection of Phonation Modes in Singing Voice using Excitation Source Features and Single Frequency Filtering Cepstral Coefficients (SFFCC)
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 441 - 445