VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING

被引:0
|
作者
Teng, Peng [1 ]
Jia, Yunde [1 ]
机构
[1] Beijing Inst Technol, Sch Comp, Beijing, Peoples R China
关键词
voice activity detection; convolutive non-negative sparse coding; conditional random fields;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a voice activity detection (VAD) approach using convolutive non-negative sparse coding (CNSC) to improve the detection performance in low signal-to-noise (SNR) conditions. Our idea is to use noise-robust feature for speech signal detection while noise is reduced away. We first use magnitude spectrum as the non-negative and additive low-level representation of audio signals, and learn a speech dictionary from clean speech as well as a noise dictionary from noise samples. Then, the two dictionaries are concatenated to form a global dictionary, and an audio signal is decomposed into coefficient vectors using CNSC on the global dictionary. Only coefficients corresponding to the bases from the speech dictionary are taken as the features for the signal. At last, the activity labels is given by decoding a conditional random field (CRF) which is constructed to model the context of an audio signal for VAD. Experiments demonstrate that our VAD approach has an excellent performance in low SNR conditions.
引用
收藏
页码:7373 / 7377
页数:5
相关论文
共 50 条
  • [21] NON-NEGATIVE SPARSE CODING FOR HUMAN ACTION RECOGNITION
    Amiri, S. Mohsen
    Nasiopoulos, Panos
    Leung, Victor C. M.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1421 - 1424
  • [22] Non-Negative Kernel Sparse Coding for Image Classification
    Zhang, Yungang
    Xu, Tianwei
    Ma, Jieming
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 531 - 540
  • [23] Face recognition using localized features based on non-negative sparse coding
    Bhavin J. Shastri
    Martin D. Levine
    Machine Vision and Applications, 2007, 18 : 107 - 122
  • [24] Noise removal using a novel non-negative sparse coding shrinkage technique
    Shang, L
    Huang, DS
    Zheng, CH
    Sun, ZL
    NEUROCOMPUTING, 2006, 69 (7-9) : 874 - 877
  • [25] Face recognition using localized features based on non-negative sparse coding
    Shastri, Bhavin J.
    Levine, Martin D.
    MACHINE VISION AND APPLICATIONS, 2007, 18 (02) : 107 - 122
  • [27] Non-negative Kernel Sparse Coding for the Analysis of Motion Data
    Hosseini, Babak
    Huelsmann, Felix
    Botsch, Mario
    Hammer, Barbara
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 506 - 514
  • [28] Dispersion Constraint Based Non-negative Sparse Coding Model
    Wang, Xin
    Wang, Can
    Shang, Li
    Sun, Zhan-Li
    NEURAL PROCESSING LETTERS, 2016, 43 (02) : 603 - 609
  • [29] ISAR target recognition based on non-negative sparse coding
    Tang, Ning
    Gao, Xunzhang
    Li, Xiang
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2012, 23 (06) : 849 - 857
  • [30] Dispersion constraint based non-negative sparse coding algorithm
    Shang, Li
    Wang, Xin
    Sun, Zhan-Li
    NEUROCOMPUTING, 2016, 188 : 253 - 261