Detection of Glottal Activity Using Different Attributes of Source Information

被引:17
作者
Adiga, Nagaraj [1 ]
Prasanna, S. R. M. [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Glottal activity; higher-order statistics; normalized autocorrelation peak strength; strength of excitation; LINEAR PREDICTION; EPOCH EXTRACTION;
D O I
10.1109/LSP.2015.2461008
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The major activity during speech production is glottal activity and is earlier detected using strength of excitation (SoE). This work uses the normalized autocorrelation peak strength (NAPS) and higher order statistics (HOS) as additional features for detecting glottal activity. The three features, namely, SoE, NAPS, and HOS, are, respectively indicators of different attributes of glottal activity, namely, energy, periodicity, and asymmetrical nature of the resulting source signal. The effectiveness of these features is analyzed using the differential electroglottograph signal, zero-frequency filtered signal, and integrated linear prediction residual, as representatives of source signal. The combination of glottal activity information from the three features outperforms any single of them, demonstrating different information represented by each of these features.
引用
收藏
页码:2107 / 2111
页数:5
相关论文
共 14 条
[1]  
Ananthapadmanabha T. V., 1984, ACOUSTIC ANAL VOICE
[2]  
[Anonymous], P INTERSPEECH
[3]  
[Anonymous], 2011, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, DOI DOI 10.1111/J.1096-3642.2009.00621.X
[4]   Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs [J].
Dhananjaya, N. ;
Yegnanarayana, B. .
IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (03) :273-276
[5]  
Kominek J, 2004, P 5 ISCA WORKSH SPEE
[6]   2-CHANNEL SPEECH ANALYSIS [J].
KRISHNAMURTHY, AK ;
CHILDERS, DG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04) :730-743
[7]   LINEAR PREDICTION - TUTORIAL REVIEW [J].
MAKHOUL, J .
PROCEEDINGS OF THE IEEE, 1975, 63 (04) :561-580
[8]   Epoch Extraction From Speech Signals [J].
Murty, K. Sri Rama ;
Yegnanarayana, B. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08) :1602-1613
[9]   Characterization of Glottal Activity From Speech Signals [J].
Murty, K. Sri Rama ;
Yegnanarayana, B. ;
Joseph, M. Anand .
IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (06) :469-472
[10]   Robust voice activity detection using higher-order statistics in the LPC residual domain [J].
Nemer, E ;
Goubran, R ;
Mahmoud, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03) :217-231