USE OF PITCH CONTINUITY FOR ROBUST SPEECH ACTIVITY DETECTION

被引:0
作者
Shao, Yiwen [1 ,2 ]
Lin, Qiguang [1 ]
机构
[1] Baihu Technol Co Ltd, Guangzhou, Guangdong, Peoples R China
[2] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
关键词
autocorrelation function; speech activity detection; pitch continuity; pitch detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech activity detection (SAD) is an important component for various speech processing applications and has been researched extensively recently. The pitch continuity, a significant characteristic of speech, however, has not successfully played a role in existing SAD methods. In this work, we propose a novel way to integrate the pitch continuity with pitch-related features. Practice is carried out through the Combo-SAD approach: We examine three consecutive frames and assume that they all have the same pitch as the center frame due to pitch continuity. Corresponding feature values are recomputed at the adjusted pitch location and then used in the final expression. The new combo feature is evaluated with various types of additive noise at different signal-to-noise ratios (SNR). The results show that the new feature leads to better SAD performance (with an up to 39.3% relative improvement on miss rate compared to Combo-SAD). We also introduce a novel variant of the underlying autocorrelation function and illustrate how it can improve the accuracy of pitch detection.
引用
收藏
页码:5534 / 5538
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 1993, IFA P, DOI DOI 10.1371/JOURNAL.PONE.0069107
[2]  
[Anonymous], 2011, THEORY APPL DIGITAL
[3]  
[Anonymous], 1993, TIMIT ACOUSTIC PHONE
[4]  
[Anonymous], ARXIV12100297
[5]  
Chuangsuwanich E., 2011, 12 ANN C INT SPEECH
[6]  
Fant G., 1985, STL-QPSR, V26, P1
[7]  
Graciarena M, 2013, INTERSPEECH, P709
[8]  
Hirsch H.-G., 2005, Fant - filtering and noise adding tool
[9]  
Joho D, 2007, INT CONF ACOUST SPEE, P1077
[10]  
Liu Y., 2017, JIANGSU U SCI TECHNO, V31, P73