Speech enhancement with a GSC-like structure employing sparse coding

被引:0
作者
Li-chun Yang
Yun-tao Qian
机构
[1] Zhejiang University,College of Computer Science and Technology
[2] Zhejiang Wanli University,Intelligent Control Research Institute
来源
Journal of Zhejiang University SCIENCE C | 2014年 / 15卷
关键词
Generalized sidelobe canceller; Speech enhancement; Voice activity detection; Dictionary learning; Sparse coding; TN912.35;
D O I
暂无
中图分类号
学科分类号
摘要
Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller (GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast, the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection (VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals.
引用
收藏
页码:1154 / 1163
页数:9
相关论文
共 50 条
  • [41] Structure Preserving Sparse Coding for Data Representation
    Shu, Zhenqiu
    Wu, Xiao-jun
    Hu, Cong
    NEURAL PROCESSING LETTERS, 2018, 48 (03) : 1705 - 1719
  • [42] Speech Enhancement Based on Sparse Representation Using Universal Dictionary
    Huang, Ling
    Li, Lin
    He, Shan
    2013 IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION (ASID), 2013,
  • [43] Statistical method for sparse coding of speech including a linear predictive model
    Rufiner, Hugo L.
    Goddard, John
    Rocha, Luis F.
    Torres, Maria E.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2006, 367 : 231 - 251
  • [44] Automatic segmentation and clustering of speech using sparse coding and metaheuristic search
    Agenbag, Wiehan
    Niesler, Thomas
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3184 - 3188
  • [45] Global Soft Decision Employing Support Vector Machine For Speech Enhancement
    Chang, Joon-Hyuk
    Jo, Q-Haing
    Kim, Dong Kook
    Kim, Nam Soo
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 57 - 60
  • [46] EXPLOITING THE HARMONIC STRUCTURE FOR SPEECH ENHANCEMENT
    Cho, Eunjoon
    Smith, Julius O., III
    Widrow, Bernard
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4569 - 4572
  • [47] A novel speech enhancement method by learnable sparse and low-rank decomposition and domain adaptation
    Mavaddaty, Samira
    Ahadi, Seyed Mohammad
    Seyedin, Sanaz
    SPEECH COMMUNICATION, 2016, 76 : 42 - 60
  • [48] Speech enhancement based on multi-task sparse representation for dual small microphone arrays
    Yang, L.-C., 1600, Editorial Board of Journal on Communications (35): : 87 - 94
  • [49] Speech Inventory Based Discriminative Training for Joint Speech Enhancement and Low-Rate Speech Coding
    Xiao, Xiaoqiang
    Nickel, Robert M.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2398 - +
  • [50] LPCSE: Neural Speech Enhancement through Linear Predictive Coding
    Liu, Yang
    Tang, Na
    Chu, Xiaoli
    Yang, Yang
    Wang, Jun
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 5335 - 5341