Noise Robust Voice Activity Detection Using Features Extracted From the Time-Domain Autocorrelation Function

被引:0
|
作者
Ghaemmaghami, Houman [1 ]
Baker, Brendan [1 ]
Vogt, Robbie [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld 4001, Australia
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
voice activity detection; high noise; autocorrelation; zero-crossing rate; time-domain analysis; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a method of voice activity detection (VAD) for high noise scenarios, using a noise robust voiced speech detection feature. The developed method is based on the fusion of two systems. The first system utilises the maximum peak of the normalised time-domain autocorrelation function (MaxPeak). The second system uses a novel combination of cross-correlation and zero-crossing rate of the normalised autocorrelation to approximate a measure of signal pitch and periodicity (CrossCorr) that is hypothesised to be noise robust. The score outputs by the two systems are then merged using weighted sum fusion to create the proposed autocorrelation zero-crossing rate (AZR) VAD. Accuracy of AZR was compared to state-of-the-art and standardised VAD methods and was shown to outperform the best performing system with an average relative improvement of 24.8% in half-total error rate (HTER) on the QUT-NOISE-TIMIT database created using real recordings from high-noise environments.
引用
收藏
页码:3118 / 3121
页数:4
相关论文
共 50 条
  • [1] NOISE ROBUST VOICE ACTIVITY DETECTION USING NORMAL PROBABILITY TESTING AND TIME-DOMAIN HISTOGRAM ANALYSIS
    Ghaemmaghami, Houman
    Dean, David
    Sridharan, Sridha
    McCowan, Iain
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4470 - 4473
  • [2] Spontaneous State Detection Using Time-Frequency and Time-Domain Features Extracted From Stereo-Electroencephalography Traces
    Ye, Huanpeng
    Fan, Zhen
    Li, Guangye
    Wu, Zehan
    Hu, Jie
    Sheng, Xinjun
    Chen, Liang
    Zhu, Xiangyang
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [3] On Noise Robust Voice Activity Detection
    Dekens, Tomas
    Verhelst, Werner
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2660 - 2663
  • [4] Cavitation detection in centrifugal pumps using pressure time-domain features
    Samanipour, Pouya
    Poshtan, Javad
    Sadeghi, Hamed
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (05) : 4287 - 4298
  • [5] Robust voice activity detection directed by noise classification
    Saeedi, Jamal
    Ahadi, Seyed Mohammad
    Faez, Karim
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (03) : 561 - 572
  • [6] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [7] Robust voice activity detection directed by noise classification
    Jamal Saeedi
    Seyed Mohammad Ahadi
    Karim Faez
    Signal, Image and Video Processing, 2015, 9 : 561 - 572
  • [8] A Robust Voice Activity Detection Algorithm in Nonstationary Noise
    Lei, Jianjun
    Yang, Jiachen
    Wang, Jian
    Yang, Zhen
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 195 - +
  • [9] Time Domain Audio Features for Chainsaw Noise Detection Using WSNs
    Czuni, Laszlo
    Varga, Peter Zoltan
    IEEE SENSORS JOURNAL, 2017, 17 (09) : 2917 - 2924
  • [10] A robust voice activity detection based on noise eigenspace projection
    Ying, Dongwen
    Shi, Yu
    Soong, Frank
    Dang, Jianwu
    Lu, Xugang
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 76 - +