VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING

被引:0
|
作者
Teng, Peng [1 ]
Jia, Yunde [1 ]
机构
[1] Beijing Inst Technol, Sch Comp, Beijing, Peoples R China
关键词
voice activity detection; convolutive non-negative sparse coding; conditional random fields;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a voice activity detection (VAD) approach using convolutive non-negative sparse coding (CNSC) to improve the detection performance in low signal-to-noise (SNR) conditions. Our idea is to use noise-robust feature for speech signal detection while noise is reduced away. We first use magnitude spectrum as the non-negative and additive low-level representation of audio signals, and learn a speech dictionary from clean speech as well as a noise dictionary from noise samples. Then, the two dictionaries are concatenated to form a global dictionary, and an audio signal is decomposed into coefficient vectors using CNSC on the global dictionary. Only coefficients corresponding to the bases from the speech dictionary are taken as the features for the signal. At last, the activity labels is given by decoding a conditional random field (CRF) which is constructed to model the context of an audio signal for VAD. Experiments demonstrate that our VAD approach has an excellent performance in low SNR conditions.
引用
收藏
页码:7373 / 7377
页数:5
相关论文
共 50 条
  • [41] Constrained non-negative sparse coding using learnt instrument templates for realtime music transcription
    Carabias-Orti, J. J.
    Rodriguez-Serrano, F. J.
    Vera-Candeas, P.
    Canadas-Quesada, F. J.
    Ruiz-Reyes, N.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (07) : 1671 - 1680
  • [42] Natural image compression using an extended non-negative sparse coding neural network technique
    Shang, L
    Huang, DS
    Zheng, CH
    Sun, ZL
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 1866 - 1871
  • [43] Non-negative sparse representation for anomaly detection in hyperspectral imagery
    Wei D.
    Huang S.
    Zhao Y.
    Pang C.
    1600, Chinese Society of Astronautics (45):
  • [44] LINEAR SPATIAL PYRAMID MATCHING USING NON-CONVEX AND NON-NEGATIVE SPARSE CODING FOR IMAGE CLASSIFICATION
    Bao, Chengqiang
    He, Liangtian
    Wang, Yilun
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 186 - 190
  • [45] RECOGNIZING HUMAN ACTIONS BASED ON SPARSE CODING WITH NON-NEGATIVE AND LOCALITY CONSTRAINTS
    Chen, Yuanbo
    Zhao, Yanyun
    Cai, Anni
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [46] Extended Super Resolution of Hyperspectral Images via Non-negative Sparse Coding
    Pawar, Maneesh
    Venkatesh, K. S.
    SENSING AND IMAGING, 2019, 20 (1):
  • [47] Extended Super Resolution of Hyperspectral Images via Non-negative Sparse Coding
    Maneesh Pawar
    K. S. Venkatesh
    Sensing and Imaging, 2019, 20
  • [48] An Entorhinal-Hippocampal Loop Model Based on Non-negative Sparse Coding
    Zhao K.
    Ren M.
    Journal of The Institution of Engineers (India): Series B, 2025, 106 (01) : 113 - 127
  • [49] ACTIVE-SET NEWTON ALGORITHM FOR NON-NEGATIVE SPARSE CODING OF AUDIO
    Virtanen, Tuomas
    Raj, Bhiksha
    Gemmeke, Jort F.
    Van Hamme, Hugo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [50] Dispersion Constraint Based Non-Negative Sparse Coding Neural Network Model
    Shang, Li
    Zhou, Yan
    Sun, Zhan-Li
    INTELLIGENT COMPUTING THEORY, 2014, 8588 : 473 - 479