Robust Voice Activity Detection Based on Adaptive Sub-band Energy Sequence Analysis and Harmonic Detection

被引:0
|
作者
Guo, Yanmeng [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing 100080, Peoples R China
来源
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年
关键词
voice activity detection; harmonic structure; noise robustness; automatic speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice activity detection (VAD) in real-world noise is a very challenging task. In this paper, a two-step methodology is proposed to solve the problem. First, segments with non-stationary components, including speech and dynamic noise, are located using sub-band energy sequence analysis (SESA). Secondly, voice is detected within the selected segments employing the proposed method concerning its harmonic structure. Therefore, speech segments can be accurately detected by this rule-based framework. This algorithm is evaluated in several databases in terms of speech/non-speech discrimination and in terms of word accuracy rate when it is used as the front-end of automatic speech recognition (ASR) system. It provides a more reliable performance over the commonly used standard methods.
引用
收藏
页码:1637 / 1640
页数:4
相关论文
共 50 条
  • [1] ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY
    Zhang, Zhihao
    Lin, Jinlong
    SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 44 - 48
  • [2] Sub-band reserved likelihood ratio-based robust voice activity detection
    School of Electronic and Information Engineering, South China University of Technology, Guangzhou
    510641, China
    Huazhong Ligong Daxue Xuebao, 11 (78-82):
  • [3] An Improved Robust Statistical Voice Activity Detection based on Sub-band Periodic Intensity
    He, Weijun
    Feng, Xiaohui
    Zhu, Zhengyu
    Zhou, Weili
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 2171 - 2175
  • [4] A Voice Activity Detection Algorithm With Sub-band Detection Based on Time-frequency Characteristics of Mandarin
    Wang, Yinfeng
    Huang, Shaoguang
    Wei, Ying
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1287 - 1291
  • [5] Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability
    Liu, Bin
    Tao, Jianhua
    Mo, Fuyuan
    Li, Ya
    Wen, Zhengqi
    Liu, Shanfeng
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 531 - +
  • [6] Wavelet sub-band features for voice disorder detection and classification
    Gidaye, Girish
    Nirmal, Jagannath
    Ezzine, Kadria
    Frikha, Mondher
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28499 - 28523
  • [7] Wavelet sub-band features for voice disorder detection and classification
    Girish Gidaye
    Jagannath Nirmal
    Kadria Ezzine
    Mondher Frikha
    Multimedia Tools and Applications, 2020, 79 : 28499 - 28523
  • [8] A Discrete Wavelet Transform-Based Voice Activity Detection and Noise Classification With Sub-Band Selection
    Abdullah, Salinna
    Zamani, Majid
    Demosthenous, Andreas
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [9] A Replay Speech Detection Algorithm Based on Sub-band Analysis
    Lang Lin
    Wang, Rangding
    Yan Diqun
    INTELLIGENT INFORMATION PROCESSING IX, 2018, 538 : 337 - 345
  • [10] Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal
    Meenakshi, G. Nisha
    Ghosh, Prasanta Kumar
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (11) : 1859 - 1863