A New Sparse Representation Algorithm for Speech Denoising

被引：0

作者：

Zhou, Yan ^{[1
]}

机构：

[1] Suzhou Vocat Univ, Sch Elect Informat Engn, Suzhou, Jiangsu, Peoples R China

来源：

PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS) | 2014年 / 109卷

关键词：

speech denoising; spectrogram; K-SVD algorithm; redundant dictionary; sparse representation; ENHANCEMENT;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a new speech denoising method that uses K-SVD sparse representation algorithm. This approach is based on sparse and redundant representation over dictionary. Here, spectrogram patches are used as training samples for the initial redundant dictionary. However, since the K-SVD algorithm is limited in handling small size spectrogram, the training samples need to extend their deployment to arbitrary spectrogram sizes by defining a global spectrogram prior that forces sparsity over patches in every location in the spectrogram. Simulation experiments show that the performance of the proposed K-SVD denoising algorithm is stable, and the white noise can be effectively separated. In addition, K-SVD algorithm is a simple and effective algorithm which surpasses the redundant DCT method and Gabor dictionary. In a word, K-SVD algorithm leads to an alternative speech denoising method.

引用

页码：131 / 134

页数：4

共 12 条

[1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[2]

Chen SSB, 2001, SIAM REV, V43, P129, DOI [10.1137/S003614450037906X, 10.1137/S1064827596304010]

[3] IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE [J].

DONOHO, DL ;

JOHNSTONE, IM .

BIOMETRIKA, 1994, 81 (03) :425-455

[4] Learning Sparse Representation Using Iterative Subspace Identification [J].

Gowreesunker, B. Vikrham ;

Tewfik, Ahmed H. .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (06) :3055-3065

[5] Speech enhancement by residual domain constrained optimization [J].

Jin, Wen ;

Scordilis, Michael S. .

SPEECH COMMUNICATION, 2006, 48 (10) :1349-1364

[6] Learning overcomplete representations [J].

Lewicki, MS ;

Sejnowski, TJ .

NEURAL COMPUTATION, 2000, 12 (02) :337-365

[7]

Loizou P.C., 2007, SPEECH DENOISING THE

[8] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement [J].

Tantibundhit, Charturong ;

Pernkopf, Franz ;

Kubin, Gernot .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06) :1417-1428

[9] Speech enhancement using linear prediction residual [J].

Yegnanarayana, B ;

Avendano, C ;

Hermansky, H ;

Murthy, PS .

SPEECH COMMUNICATION, 1999, 28 (01) :25-42

[10]

Yu Xiaosheng, 2011, INT J DIGIT CONTENT, V5, P170

← 1 2 →