Speech enhancement method based on low-rank approximation in a reproducing kernel Hilbert space

被引:5
作者
Zhao, Yanping [1 ]
Qiu, Robert Caiming [2 ]
Zhao, Xiaohui [1 ]
Wang, Bo [1 ]
机构
[1] Jilin Univ, Coll Commun Engn, Changchun 130012, Peoples R China
[2] Tennessee Technol Univ, Dept Elect & Comp Engn, Cookeville, TN 38505 USA
关键词
Speech enhancement; Kernel function; Low-rank approximation; Error bound; NOISE;
D O I
10.1016/j.apacoust.2016.05.008
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech signal is corrupted unavoidably by noisy environment in subway, factory, and restaurant or speech from other speakers in speech communication. Speech enhancement methods have been widely studied to minimize noise influence in different linear transform domain, such as discrete Fourier transform domain, Karhunen-Loeve transform domain or discrete cosine transform domain. Kernel method as a nonlinear transform has received a lot of interest recently and is commonly used in many applications including audio signal processing. However this kind of method typically suffers from the computational complexity. In this paper, we propose a speech enhancement algorithm using low-rank approximation in a reproducing kernel Hilbert space to reduce storage space and running time with very little performance loss in the enhanced speech. We also analyze the root mean squared error bound between the enhanced vectors obtained by the approximation kernel matrix and the full kernel matrix. Simulations show that the proposed method can improve the computation speed of the algorithm with the approximate performance compared with that of the full kernel matrix. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:79 / 83
页数:5
相关论文
共 21 条
[1]  
Bach F., JMLR WORKSHOP C P, V30, P1
[2]   Signal subspace approach for psychoacoustically motivated speech enhancement [J].
Borowicz, Adam ;
Petrovsky, Alexandr .
SPEECH COMMUNICATION, 2011, 53 (02) :210-219
[3]  
Cortes C, INT C ART INT STAT, P113
[4]   A DCT-Based Speech Enhancement System With Pitch Synchronous Analysis [J].
Ding, Huijun ;
Soon, Ing Yann ;
Yeo, Chai Kiat .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08) :2614-2623
[5]  
Drineas P, 2005, J MACH LEARN RES, V6, P2153
[6]  
Fazel A, P IEEE INT S CIRC SY, P113
[7]   Sparse Auditory Reproducing Kernel (SPARK) Features for Noise-Robust Speech Recognition [J].
Fazel, Amin ;
Chakrabartty, Shantanu .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04) :1362-1371
[8]  
Garofolo J. S., 1988, Getting started with the darpa timit cd-rom: an acoustic phonetic continuous speech database, V107, P16
[9]   Kernel Feature Template Matching for Spectrum Sensing [J].
Hou, Shujie ;
Qiu, Robert Caiming .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2014, 63 (05) :2258-2271
[10]  
Hsieh C., 2014, ADV NEURAL INFORM PR, P3689