Optimization of learned dictionary for sparse coding in speech processing

被引：11

作者：

He, Yongjun ^{[1
]}

Sun, Guanglu ^{[1
]}

Han, Jiqing ^{[2
]}

机构：

[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China

[2] Harbin Inst Technol, Harbin 150001, Peoples R China

来源：

NEUROCOMPUTING | 2016年 / 173卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;

D O I：

10.1016/j.neucom.2015.03.061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：471 / 482

页数：12

共 50 条

[1] Dictionary evaluation and optimization for sparse coding based speech processing
He, Yongjun
Chen, Deyun
Sun, Guanglu
Han, Jiqing
INFORMATION SCIENCES, 2015, 310 : 77 - 96
[2] Spectrum enhancement with sparse coding for robust speech recognition
He, Yongjun
Sun, Guanglu
Han, Jiqing
DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
[3] An MDL Framework for Sparse Coding and Dictionary Learning
Ramirez, Ignacio
Sapiro, Guillermo
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (06) : 2913 - 2927
[4] SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES
Sigg, Christian D.
Dikk, Tomas
Buhmann, Joachim M.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4758 - 4761
[5] SPARSE CODING FOR SPEECH RECOGNITION
Sivaram, G. S. V. S.
Nemala, Sridhar Krishna
Elhilali, Mounya
Trac D. Tran
Hermansky, Hynek
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4346 - 4349
[6] Sparse coding with adaptive dictionary learning for underdetermined blind speech separation
Xu, Tao
Wang, Wenwu
Dai, Wei
SPEECH COMMUNICATION, 2013, 55 (03) : 432 - 450
[7] Dictionary Optimization for Block-Sparse Representations
Zelnik-Manor, Lihi
Rosenblum, Kevin
Eldar, Yonina C.
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) : 2386 - 2395
[8] Cochannel Speech Segregation with Sparse Coding
Ingale, Pallavi P.
Nalbalwar, S. L.
2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 4589 - 4592
[9] Continuous speech recognition with sparse coding
Smit, W. J.
Barnard, E.
COMPUTER SPEECH AND LANGUAGE, 2009, 23 (02) : 200 - 219
[10] Sparse coding and dictionary learning with class-specific group sparsity
Sun, Yuping
Quan, Yuhui
Fu, Jia
NEURAL COMPUTING & APPLICATIONS, 2018, 30 (04) : 1265 - 1275

← 1 2 3 4 5 →