VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING

被引:0
|
作者
Teng, Peng [1 ]
Jia, Yunde [1 ]
机构
[1] Beijing Inst Technol, Sch Comp, Beijing, Peoples R China
关键词
voice activity detection; convolutive non-negative sparse coding; conditional random fields;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a voice activity detection (VAD) approach using convolutive non-negative sparse coding (CNSC) to improve the detection performance in low signal-to-noise (SNR) conditions. Our idea is to use noise-robust feature for speech signal detection while noise is reduced away. We first use magnitude spectrum as the non-negative and additive low-level representation of audio signals, and learn a speech dictionary from clean speech as well as a noise dictionary from noise samples. Then, the two dictionaries are concatenated to form a global dictionary, and an audio signal is decomposed into coefficient vectors using CNSC on the global dictionary. Only coefficients corresponding to the bases from the speech dictionary are taken as the features for the signal. At last, the activity labels is given by decoding a conditional random field (CRF) which is constructed to model the context of an audio signal for VAD. Experiments demonstrate that our VAD approach has an excellent performance in low SNR conditions.
引用
收藏
页码:7373 / 7377
页数:5
相关论文
共 50 条
  • [31] Dispersion Constraint Based Non-negative Sparse Coding Model
    Xin Wang
    Can Wang
    Li Shang
    Zhan-Li Sun
    Neural Processing Letters, 2016, 43 : 603 - 609
  • [32] A Voice Activity Detection Algorithm Using Sparse Non-negative Matrix Factorization-based Model Learning in Spectro-Temporal Domain
    Mavaddati, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2023, 36 (08): : 1478 - 1488
  • [33] Local Region Partitioning for Disguised Face Recognition Using Non-negative Sparse Coding
    Khoa Dang Dang
    Thai Hoang Le
    ADVANCED METHODS FOR COMPUTATIONAL COLLECTIVE INTELLIGENCE, 2013, 457 : 197 - 206
  • [34] Non-Negative Sparse Coding based Scalable Access Control using Fingertip ECG
    Raj, Peter Sam
    Sonowal, Sukanya
    Hatzinakos, Dimitrios
    2014 IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2014), 2014,
  • [35] Sparse image coding using a 3D non-negative tensor factorization
    Hazan, T
    Polak, S
    Shashua, A
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 50 - 57
  • [36] Combining Non-negative Matrix Factorization and Sparse Coding for Functional Brain Overlapping Community Detection
    X. Li
    Z. Hu
    H. Wang
    Cognitive Computation, 2018, 10 : 991 - 1005
  • [37] Combining Non-negative Matrix Factorization and Sparse Coding for Functional Brain Overlapping Community Detection
    Li, X.
    Hu, Z.
    Wang, H.
    COGNITIVE COMPUTATION, 2018, 10 (06) : 991 - 1005
  • [38] Image Classification by Non-Negative Sparse Coding, Low-Rank and Sparse Decomposition
    Zhang, Chunjie
    Liu, Jing
    Tian, Qi
    Xu, Changsheng
    Lu, Hanqing
    Ma, Songde
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1673 - 1680
  • [39] LEARNING SPEECH FEATURES IN THE PRESENCE OF NOISE: SPARSE CONVOLUTIVE ROBUST NON-NEGATIVE MATRIX FACTORIZATION
    de Frein, Ruairi
    Rickard, Scott T.
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 1248 - 1253
  • [40] Sparse coding of human motion trajectories with non-negative matrix factorization
    Vollmer, Christian
    Hellbach, Sven
    Eggert, Julian
    Gross, Horst-Michael
    NEUROCOMPUTING, 2014, 124 : 22 - 32