Speech enhancement with a GSC-like structure employing sparse coding

被引:0
作者
Li-chun Yang
Yun-tao Qian
机构
[1] Zhejiang University,College of Computer Science and Technology
[2] Zhejiang Wanli University,Intelligent Control Research Institute
来源
Journal of Zhejiang University SCIENCE C | 2014年 / 15卷
关键词
Generalized sidelobe canceller; Speech enhancement; Voice activity detection; Dictionary learning; Sparse coding; TN912.35;
D O I
暂无
中图分类号
学科分类号
摘要
Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller (GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast, the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection (VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals.
引用
收藏
页码:1154 / 1163
页数:9
相关论文
共 50 条
  • [31] Multimodal image enhancement using convolutional sparse coding
    Ahmed, Awais
    Kun, She
    Ahmed, Junaid
    Hayat, Shaukat
    Khan, Abdullah Aman
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2099 - 2110
  • [32] Speech enhancement using group complementary joint sparse representations in modulation domain
    Xie, Zhuopeng
    Yang, Huichao
    Ye, Zhongfu
    APPLIED ACOUSTICS, 2022, 201
  • [33] Hyper-parameterization of sparse reconstruction for speech enhancement
    Shi, Yue
    Low, Siow Yong
    Yiu, Ka Fai Cedric
    APPLIED ACOUSTICS, 2018, 138 : 72 - 79
  • [34] Sparse Mixture of Local Experts for Efficient Speech Enhancement
    Sivaraman, Aswin
    Kim, Minje
    INTERSPEECH 2020, 2020, : 4526 - 4530
  • [35] Dictionary evaluation and optimization for sparse coding based speech processing
    He, Yongjun
    Chen, Deyun
    Sun, Guanglu
    Han, Jiqing
    INFORMATION SCIENCES, 2015, 310 : 77 - 96
  • [36] Parallel and Hierarchical Decision Making for Sparse Coding in Speech Recognition
    Wang, Dong
    Vipperla, Ravichander
    Evans, Nicholas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2568 - 2571
  • [37] A Novel Single Channel Speech Enhancement Algorithm Based on Sparse Representation and Dictionary Learning
    Li, Yinan
    Wu, Haijia
    Zeng, Li
    Zhang, Xiongwei
    JibinYang
    2013 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2013), 2013,
  • [38] Structure Preserving Sparse Coding for Data Representation
    Zhenqiu Shu
    Xiao-jun Wu
    Cong Hu
    Neural Processing Letters, 2018, 48 : 1705 - 1719
  • [39] Structure regularized sparse coding for data representation
    Wang, Xiaoming
    Wang, Shitong
    Huang, Zengxi
    Du, Yajun
    KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 87 - 102
  • [40] Temporal Auditory Coding Features for Causal Speech Enhancement
    Thoidis, Iordanis
    Vrysis, Lazaros
    Markou, Dimitrios
    Papanikolaou, George
    ELECTRONICS, 2020, 9 (10) : 1 - 17