Speech enhancement with a GSC-like structure employing sparse coding

被引:0
|
作者
Li-chun Yang
Yun-tao Qian
机构
[1] Zhejiang University,College of Computer Science and Technology
[2] Zhejiang Wanli University,Intelligent Control Research Institute
来源
Journal of Zhejiang University SCIENCE C | 2014年 / 15卷
关键词
Generalized sidelobe canceller; Speech enhancement; Voice activity detection; Dictionary learning; Sparse coding; TN912.35;
D O I
暂无
中图分类号
学科分类号
摘要
Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller (GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast, the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection (VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals.
引用
收藏
页码:1154 / 1163
页数:9
相关论文
共 50 条
  • [1] Speech enhancement with a GSC-like structure employing sparse coding
    Yang, Li-chun
    Qian, Yun-tao
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2014, 15 (12): : 1154 - 1163
  • [2] Speech enhancement with a GSC-like structure employing sparse coding
    Li-chun YANG
    Yun-tao QIAN
    Frontiers of Information Technology & Electronic Engineering, 2014, (12) : 1154 - 1163
  • [3] Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation
    Krueger, Alexander
    Warsitz, Ernst
    Haeb-Umbach, Reinhold
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 206 - 219
  • [4] SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES
    Sigg, Christian D.
    Dikk, Tomas
    Buhmann, Joachim M.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4758 - 4761
  • [5] A speech enhancement method employing sparse representation of power spectral density
    Zhao, Yanping
    Zhao, Xiaohui
    Wang, Bo
    Journal of Information and Computational Science, 2013, 10 (06): : 1705 - 1714
  • [6] Spectrum enhancement with sparse coding for robust speech recognition
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
  • [7] Speech enhancement based on Sparse Code Shrinkage employing multiple speech models
    Jancovic, Peter
    Zou, Xin
    Koekueer, Muenevver
    SPEECH COMMUNICATION, 2012, 54 (01) : 108 - 118
  • [8] Speech enhancement via sparse coding with ideal binary mask
    Sun, Juan
    Tang, Yibin
    Jiang, Aimin
    Xu, Ning
    Zhou, Lin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 537 - 540
  • [9] Speech enhancement based on the general transfer function GSC and postfiltering
    Gannot, S
    Cohen, I
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (06): : 561 - 571
  • [10] Threshold reduction for improving sparse coding shrinkage performance in speech enhancement
    Faraji, Neda
    Ahadi, S. M.
    Shariati, S. Saloomeh
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1220 - +