Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [21] Image Copy Detection via Dictionary Learning and Sparse Coding
    Lin, Chih-Yang
    Kang, Li-Wei
    Muchtar, Kahlil
    Wei, Jyh-Da
    Yeh, Chia-Hung
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND INTELLIGENT CONTROL (ISIC 2012), 2012, : 242 - 245
  • [22] LEARNING AN ADAPTIVE DICTIONARY STRUCTURE FOR EFFICIENT IMAGE SPARSE CODING
    Mazaheri, Jeremy Aghaei
    Guillemot, Christine
    labit, ClauDe
    2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 1 - 4
  • [23] MEMORY-ASSISTED SEISMIC SIGNAL COMPRESSION BASED ON DICTIONARY LEARNING AND SPARSE CODING
    Tian, Xin
    Abdi, Afshin
    Liu, Entao
    Fekri, Faramarz
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 358 - 362
  • [24] Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection
    You, Datao
    Han, Jiqing
    Zheng, Guibin
    Zheng, Tieran
    Li, Jie
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (07) : 2267 - 2291
  • [25] Hierarchical sparse coding framework for speech emotion recognition
    Torres-Boza, Diana
    Oveneke, Meshia Cedric
    Wang, Fengna
    Jiang, Dongmei
    Verhelst, Werner
    Sahli, Hichem
    SPEECH COMMUNICATION, 2018, 99 : 80 - 89
  • [26] Sparse coding based features for speech units classification
    Sharma, Pulkit
    Abrol, Vinayak
    Dileep, A. D.
    Sao, Anil Kumar
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 333 - 350
  • [27] Polynomial dictionary learning algorithms in sparse representations
    Guan, Jian
    Wang, Xuan
    Feng, Pengming
    Dong, Jing
    Chambers, Jonathon
    Jiang, Zoe L.
    Wang, Wenwu
    SIGNAL PROCESSING, 2018, 142 : 492 - 503
  • [28] Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach
    Qayyum, Abdul
    Malik, Aamir Saeed
    Saad, Naufal M.
    Iqbal, Mahboob
    Abdullah, Mohd Faris
    Rasheed, Waqas
    Abdullah, Tuan A. B. Rashid
    Bin Jafaar, Mohd Yaqoob
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) : 3587 - 3607
  • [29] Sparse representation with learned multiscale dictionary for image fusion
    Yin, Haitao
    NEUROCOMPUTING, 2015, 148 : 600 - 610
  • [30] Sparse representation over learned dictionary for symbol recognition
    Thanh Ha Do
    Tabbone, Salvatore
    Terrades, Oriol Ramos
    SIGNAL PROCESSING, 2016, 125 : 36 - 47