Speech enhancement with a GSC-like structure employing sparse coding

被引:0
|
作者
Li-chun Yang
Yun-tao Qian
机构
[1] Zhejiang University,College of Computer Science and Technology
[2] Zhejiang Wanli University,Intelligent Control Research Institute
来源
Journal of Zhejiang University SCIENCE C | 2014年 / 15卷
关键词
Generalized sidelobe canceller; Speech enhancement; Voice activity detection; Dictionary learning; Sparse coding; TN912.35;
D O I
暂无
中图分类号
学科分类号
摘要
Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller (GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast, the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection (VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals.
引用
收藏
页码:1154 / 1163
页数:9
相关论文
共 50 条
  • [21] A GSC algorithm based on null spectral subtraction for dual small microphone array speech enhancement
    Yang, Li-Chun
    Qian, Yun-Tao
    Wang, Wen-Hong
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2013, 47 (08): : 1493 - 1499
  • [22] AUDITORY CODING BASED SPEECH ENHANCEMENT
    Ren, Yao
    Johnson, Michael T.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4685 - 4688
  • [23] An improved voice activity detection algorithm employing speech enhancement preprocessing
    Lee, YC
    Ahn, SS
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2001, E84A (06): : 1401 - 1405
  • [24] A speech enhancement method based on sparse reconstruction on log-spectra
    Shen T.W.
    Lun D.P.K.
    Shen, Tak Wai (twshen2000@yahoo.com.hk), 1600, Taylor and Francis Asia Pacific (24): : 24 - 34
  • [25] Hierarchical sparse coding framework for speech emotion recognition
    Torres-Boza, Diana
    Oveneke, Meshia Cedric
    Wang, Fengna
    Jiang, Dongmei
    Verhelst, Werner
    Sahli, Hichem
    SPEECH COMMUNICATION, 2018, 99 : 80 - 89
  • [26] Sparse coding based features for speech units classification
    Sharma, Pulkit
    Abrol, Vinayak
    Dileep, A. D.
    Sao, Anil Kumar
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 333 - 350
  • [27] Sparse coding based features for speech units classification
    Sharma, Pulkit
    Abrol, Vinayak
    Dileep, A. D.
    Sao, Anil Kumar
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 712 - 715
  • [28] Optimization of learned dictionary for sparse coding in speech processing
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    NEUROCOMPUTING, 2016, 173 : 471 - 482
  • [29] Speech enhancement employing Laplacian-Gaussian mixture
    Gazor, S
    Zhang, W
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 896 - 904
  • [30] Multimodal image enhancement using convolutional sparse coding
    Awais Ahmed
    She Kun
    Junaid Ahmed
    Shaukat Hayat
    Abdullah Aman Khan
    Multimedia Systems, 2023, 29 : 2099 - 2110