Speech enhancement with a GSC-like structure employing sparse coding

被引：0

作者：

Li-chun Yang

Yun-tao Qian

机构：

[1] Zhejiang University,College of Computer Science and Technology

[2] Zhejiang Wanli University,Intelligent Control Research Institute

来源：

Journal of Zhejiang University SCIENCE C | 2014年 / 15卷

关键词：

Generalized sidelobe canceller; Speech enhancement; Voice activity detection; Dictionary learning; Sparse coding; TN912.35;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller (GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast, the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection (VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals.

引用

页码：1154 / 1163

页数：9

共 50 条

[1] Speech enhancement with a GSC-like structure employing sparse coding
Yang, Li-chun
Qian, Yun-tao
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2014, 15 (12): : 1154 - 1163
[2] Speech enhancement with a GSC-like structure employing sparse coding
Li-chun YANG
Yun-tao QIAN
Frontiers of Information Technology & Electronic Engineering, 2014, (12) : 1154 - 1163
[3] Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation
Krueger, Alexander
Warsitz, Ernst
Haeb-Umbach, Reinhold
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 206 - 219
[4] SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES
Sigg, Christian D.
Dikk, Tomas
Buhmann, Joachim M.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4758 - 4761
[5] A speech enhancement method employing sparse representation of power spectral density
Zhao, Yanping
Zhao, Xiaohui
Wang, Bo
Journal of Information and Computational Science, 2013, 10 (06): : 1705 - 1714
[6] Spectrum enhancement with sparse coding for robust speech recognition
He, Yongjun
Sun, Guanglu
Han, Jiqing
DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
[7] Speech enhancement based on Sparse Code Shrinkage employing multiple speech models
Jancovic, Peter
Zou, Xin
Koekueer, Muenevver
SPEECH COMMUNICATION, 2012, 54 (01) : 108 - 118
[8] Speech enhancement via sparse coding with ideal binary mask
Sun, Juan
Tang, Yibin
Jiang, Aimin
Xu, Ning
Zhou, Lin
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 537 - 540
[9] Speech enhancement based on the general transfer function GSC and postfiltering
Gannot, S
Cohen, I
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (06): : 561 - 571
[10] Threshold reduction for improving sparse coding shrinkage performance in speech enhancement
Faraji, Neda
Ahadi, S. M.
Shariati, S. Saloomeh
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1220 - +

← 1 2 3 4 5 →