Dictionary Learning for Sparse Audio Inpainting

被引:8
|
作者
Taubock, Georg [1 ]
Rajbamshi, Shristi [1 ]
Balazs, Peter [1 ]
机构
[1] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria
关键词
Reliability; Dictionaries; Signal processing algorithms; Machine learning; Time-frequency analysis; Time-domain analysis; Frequency modulation; Audio inpainting; convex; dictionary; frame; Gabor; learning; optimization; sparsity; time-frequency;
D O I
10.1109/JSTSP.2020.3046422
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of audio inpainting is to fill a gap in an audio signal. This is ideally done by reconstructing the original signal or, at least, by inferring a meaningful surrogate signal. We propose a novel approach applying sparse modeling in the time-frequency (TF) domain. In particular, we devise a dictionary learning technique which learns the dictionary from reliable parts around the gap with the goal to obtain a signal representation with increased TF sparsity. This is based on a basis optimization technique to deform a given Gabor frame such that the sparsity of the analysis coefficients of the resulting frame is maximized. Furthermore, we modify the SParse Audio INpainter (SPAIN) for both the analysis and the synthesis model such that it is able to exploit the increased TF sparsity and-in turn-benefits from dictionary learning. Our experiments demonstrate that the developed methods achieve significant gains in terms of signal-to-distortion ratio (SDR) and objective difference grade (ODG) compared with several state-of-the-art audio inpainting techniques.
引用
收藏
页码:104 / 119
页数:16
相关论文
共 50 条
  • [41] Sparse Dictionary Learning for Blind Hyperspectral Unmixing
    Liu, Yang
    Guo, Yi
    Li, Feng
    Xin, Lei
    Huang, Puming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (04) : 578 - 582
  • [42] Secure Overcomplete Dictionary Learning for Sparse Representation
    Nakachi, Takayuki
    Bandoh, Yukihiro
    Kiya, Hitoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (01) : 50 - 58
  • [43] An MDL Framework for Sparse Coding and Dictionary Learning
    Ramirez, Ignacio
    Sapiro, Guillermo
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (06) : 2913 - 2927
  • [44] MULTILEVEL DICTIONARY LEARNING FOR SPARSE REPRESENTATION OF IMAGES
    Thiagarajan, Jayaraman J.
    Ramamurthy, Karthikeyan N.
    Spanias, Andreas
    2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 271 - 276
  • [45] Learning Discriminative Dictionary for Group Sparse Representation
    Sun, Yubao
    Liu, Qingshan
    Tang, Jinhui
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3816 - 3828
  • [46] COMPRESSIBLE DICTIONARY LEARNING FOR FAST SPARSE APPROXIMATIONS
    Yaghoobi, Mehrdad
    Davies, Mike E.
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 661 - 664
  • [47] Bayesian sparse reconstruction based on dictionary learning
    Wang, Yan
    Ke, Jun
    ADVANCED OPTICAL IMAGING TECHNOLOGIES III, 2020, 11549
  • [48] MULTISCALE DICTIONARY LEARNING FOR HIERARCHICAL SPARSE REPRESENTATION
    Shen, Yangmei
    Xiong, Hongkai
    Dai, Wenrui
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1332 - 1337
  • [49] ADL: Active Dictionary Learning for Sparse Representation
    Tang, Bo
    Xu, Jin
    He, Haibo
    Man, Hong
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2723 - 2729
  • [50] Sparse ISAR Imaging Exploiting Dictionary Learning
    Hu Changyu
    Wang Ling
    Zhu Dongqiang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (07) : 1735 - 1742