Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection

被引:8
|
作者
Cournapeau, David [1 ,2 ]
Watanabe, Shinji [2 ]
Nakamura, Atsushi [2 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Sch Informat, Kyoto 6068501, Japan
[2] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
关键词
Sequential estimation; speech analysis; variational Bayes (VB); voice activity detection (VAD); SPEECH RECOGNITION; EM ALGORITHM;
D O I
10.1109/JSTSP.2010.2080821
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new online, unsupervised method for Voice Activity Detection (VAD) is proposed. The conventional VAD methods often rely on heuristics to adapt the decision threshold to the estimated SNR. The proposed VAD method is based on the Variational Bayes (VB) approach to the online Expectation Maximization (EM), so that it can automatically adapt the decision level and the statistical model at the same time. We consider two parallel classifiers, one for the noise-only case, and the other for speech-and-noise case. Both models are trained concurrently and online using the VB framework. The VB framework also provides an explicit approximation of the log evidence called free energy. It is used to assess the reliability of the classifier in an online fashion, and to decide which model is more appropriate at a given time frame. Experimental evaluations were conducted on the CENSREC-1-C database designed for VAD evaluations. With the effect of the model comparison, the proposed scheme outperforms the conventional VAD algorithms, especially in the remote recording condition. It is also shown to be more robust with respect to changes of the noise type.
引用
收藏
页码:1071 / 1083
页数:13
相关论文
共 50 条
  • [31] UNSUPERVISED DOMAIN ADAPTATION FOR DEEP NEURAL NETWORK BASED VOICE ACTIVITY DETECTION
    Zhang, Xiao-Lei
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] Voice Activity Detection by Upper Body Motion Analysis and Unsupervised Domain Adaptation
    Shahid, Muhammad
    Beyan, Cigdem
    Murino, Vittorio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1260 - 1269
  • [33] A Fusion Model for Robust Voice Activity Detection
    Wang, Guan-Bo
    Zhang, Wei-Qiang
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [34] A Statistical Model-Based Voice Activity Detection Employing Minimum Classification Error Technique
    Kang, Sang-Ick
    Song, Ji-Hyun
    Lee, Kye-Hwan
    Park, Yun-Sik
    Chang, Joon-Hyuk
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 103 - 106
  • [35] An Unsupervised Cross-Lingual Topic Model Framework for Sentiment Classification
    Lin, Zheng
    Jin, Xiaolong
    Xu, Xueke
    Wang, Yuanzhuo
    Cheng, Xueqi
    Wang, Weiping
    Meng, Dan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (03) : 432 - 444
  • [36] A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
    Zhang, Yan
    Tang, Zhen-min
    Li, Yan-ping
    Luo, Yang
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [37] Analog/Mixed-Signal Classification for Voice Activity Detection
    Kurrey, Prashant
    Kavishwar, Mihir
    Zele, Rajesh
    2022 29TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (IEEE ICECS 2022), 2022,
  • [38] Extended Minimum Classification Error Training in Voice Activity Detection
    Arakawa, Takayuki
    Al-Hassanieh, Haitham
    Tsujikawa, Masanori
    Isotani, Ryosuke
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 232 - +
  • [39] Voice activity detection based on noise classification and dictionary selection
    Xie, Yining
    Huang, Jinjie
    Zhao, Jing
    He, Yongjun
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2016, 44 (12): : 121 - 126
  • [40] Android Malware Detection Combining Feature Correlation and Bayes Classification Model
    Tan, Min
    Yu, Min
    Wang, Yongjian
    Li, Song
    Liu, Chao
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 664 - 668