Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [31] Brain tumor classification and segmentation using sparse coding and dictionary learning
    Al-Shaikhli, Saif Dawood Salman
    Yang, Michael Ying
    Rosenhahn, Bodo
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2016, 61 (04): : 413 - 429
  • [32] Sparse Coding based Robust Image Denoising via Coupled Dictionary
    Singh, Kuldeep
    Viswakarma, D. K.
    Walia, Gurjit S.
    Kapoor, Rajiv
    2016 1ST INDIA INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (IICIP), 2016,
  • [33] Analysis of WD Face Dictionary for Sparse Coding Based Face Recognition
    Thavalengal, Shejin
    Sao, Anil Kumar
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT 1, 2013, 8156 : 221 - 230
  • [34] Improved Speaker Verification using Block Sparse Coding over Joint Speaker-Channel Learned Dictionary
    Sreeram, Ganji
    Haris, B. C.
    Sinha, Rohit
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [35] Initial Fault Feature Extraction Via Sparse Representation Over Learned Dictionary
    Yu Fa-jun
    Zhou Feng-xing
    Yan Bao-kang
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 1693 - 1696
  • [36] Hierarchical Sparse Dictionary Learning
    Bian, Xiao
    Ning, Xia
    Jiang, Geoff
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 687 - 700
  • [37] Parallel and Hierarchical Decision Making for Sparse Coding in Speech Recognition
    Wang, Dong
    Vipperla, Ravichander
    Evans, Nicholas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2568 - 2571
  • [38] Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries
    Haris, B. C.
    Sinha, Rohit
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (10) : 2143 - 2157
  • [39] Supervised dictionary learning for blind image quality assessment using quality-constraint sparse coding
    Jiang, Qiuping
    Shao, Feng
    Jiang, Gangyi
    Yu, Mei
    Peng, Zongju
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 33 : 123 - 133
  • [40] Time warp invariant kSVD: Sparse coding and dictionary learning for time series under time warp
    Yazdi, Saeed Varasteh
    Douzal-Chouakria, Ahlame
    PATTERN RECOGNITION LETTERS, 2018, 112 : 1 - 8