Robust Voice Activity Detection Against Non Homogeneous Noisy Environments

被引:0
作者
Chelloug, Charaf Eddine [1 ]
Farrouki, Atef [1 ]
机构
[1] Univ Constantine, Lab SISCOM, Constantine, Algeria
来源
2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA) | 2018年
关键词
Voice Activity Detection; False Acceptance Rate; Full band Energy; adaptive detection; Microcontroller unit (MCU);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, an improved voice activity detection (VAD) algorithm is presented to deal with non stationary noisy environments. The proposed approach is based on adaptive thresholding to regulate the False Acceptance (FA) ratio in absence of active voice. Sequential hypothesis tests, using full band energy, have been carrying out to reject or to classify the frame under processing as a voiced segment. The main advantage of the proposed technique consists of its capability to automatically update the level of background noise, by taking into account the current environment. Performances and real time behavior have been analyzed and compared to the modern standard G.729-B, by implementing the proposed VAD architecture on a Micro Controller Unit-based processing system.
引用
收藏
页数:6
相关论文
共 10 条
[1]  
Bao X., 2013, P IET INT SIGN PROC, P1
[2]   Voice activity detection based on multiple statistical models [J].
Chang, Joon-Hyuk ;
Kim, Nam Soo ;
Mitra, Sanjit K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) :1965-1976
[3]  
Chelloug CE, 2016, INT WIREL COMMUN, P139, DOI 10.1109/IWCMC.2016.7577047
[4]   Likelihood ratio sign test for voice activity detection [J].
Deng, S. ;
Han, J. .
IET SIGNAL PROCESSING, 2012, 6 (04) :306-312
[5]  
Etellisi Ehab, 2011, SCIRES COMMUNICATION, V3, P185
[6]  
Hu Y., 2006, IEEE INT C AC SPEECH, P588
[7]  
ITU, 1996, SIL COMPR SCHEM G 72
[8]   Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering [J].
Mousazadeh, Saman ;
Cohen, Israel .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06) :1261-1271
[9]  
Muralishankar R, 2010, IEEE ICC
[10]   Voice Activity Detection Based on an Unsupervised Learning Framework [J].
Ying, Dongwen ;
Yan, Yonghong ;
Dang, Jianwu ;
Soong, Frank K. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08) :2624-2632