Optimization of learned dictionary for sparse coding in speech processing

被引：11

作者：

He, Yongjun ^{[1
]}

Sun, Guanglu ^{[1
]}

Han, Jiqing ^{[2
]}

机构：

[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China

[2] Harbin Inst Technol, Harbin 150001, Peoples R China

来源：

NEUROCOMPUTING | 2016年 / 173卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;

D O I：

10.1016/j.neucom.2015.03.061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：471 / 482

页数：12

共 50 条

[21] Image Copy Detection via Dictionary Learning and Sparse Coding
Lin, Chih-Yang
Kang, Li-Wei
Muchtar, Kahlil
Wei, Jyh-Da
Yeh, Chia-Hung
THIRD INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND INTELLIGENT CONTROL (ISIC 2012), 2012, : 242 - 245
[22] LEARNING AN ADAPTIVE DICTIONARY STRUCTURE FOR EFFICIENT IMAGE SPARSE CODING
Mazaheri, Jeremy Aghaei
Guillemot, Christine
labit, ClauDe
2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 1 - 4
[23] MEMORY-ASSISTED SEISMIC SIGNAL COMPRESSION BASED ON DICTIONARY LEARNING AND SPARSE CODING
Tian, Xin
Abdi, Afshin
Liu, Entao
Fekri, Faramarz
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 358 - 362
[24] Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection
You, Datao
Han, Jiqing
Zheng, Guibin
Zheng, Tieran
Li, Jie
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (07) : 2267 - 2291
[25] Hierarchical sparse coding framework for speech emotion recognition
Torres-Boza, Diana
Oveneke, Meshia Cedric
Wang, Fengna
Jiang, Dongmei
Verhelst, Werner
Sahli, Hichem
SPEECH COMMUNICATION, 2018, 99 : 80 - 89
[26] Sparse coding based features for speech units classification
Sharma, Pulkit
Abrol, Vinayak
Dileep, A. D.
Sao, Anil Kumar
COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 333 - 350
[27] Polynomial dictionary learning algorithms in sparse representations
Guan, Jian
Wang, Xuan
Feng, Pengming
Dong, Jing
Chambers, Jonathon
Jiang, Zoe L.
Wang, Wenwu
SIGNAL PROCESSING, 2018, 142 : 492 - 503
[28] Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach
Qayyum, Abdul
Malik, Aamir Saeed
Saad, Naufal M.
Iqbal, Mahboob
Abdullah, Mohd Faris
Rasheed, Waqas
Abdullah, Tuan A. B. Rashid
Bin Jafaar, Mohd Yaqoob
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) : 3587 - 3607
[29] Sparse representation with learned multiscale dictionary for image fusion
Yin, Haitao
NEUROCOMPUTING, 2015, 148 : 600 - 610
[30] Sparse representation over learned dictionary for symbol recognition
Thanh Ha Do
Tabbone, Salvatore
Terrades, Oriol Ramos
SIGNAL PROCESSING, 2016, 125 : 36 - 47

← 1 2 3 4 5 →