Dictionary Learning for Sparse Audio Inpainting

被引:8
|
作者
Taubock, Georg [1 ]
Rajbamshi, Shristi [1 ]
Balazs, Peter [1 ]
机构
[1] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria
关键词
Reliability; Dictionaries; Signal processing algorithms; Machine learning; Time-frequency analysis; Time-domain analysis; Frequency modulation; Audio inpainting; convex; dictionary; frame; Gabor; learning; optimization; sparsity; time-frequency;
D O I
10.1109/JSTSP.2020.3046422
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of audio inpainting is to fill a gap in an audio signal. This is ideally done by reconstructing the original signal or, at least, by inferring a meaningful surrogate signal. We propose a novel approach applying sparse modeling in the time-frequency (TF) domain. In particular, we devise a dictionary learning technique which learns the dictionary from reliable parts around the gap with the goal to obtain a signal representation with increased TF sparsity. This is based on a basis optimization technique to deform a given Gabor frame such that the sparsity of the analysis coefficients of the resulting frame is maximized. Furthermore, we modify the SParse Audio INpainter (SPAIN) for both the analysis and the synthesis model such that it is able to exploit the increased TF sparsity and-in turn-benefits from dictionary learning. Our experiments demonstrate that the developed methods achieve significant gains in terms of signal-to-distortion ratio (SDR) and objective difference grade (ODG) compared with several state-of-the-art audio inpainting techniques.
引用
收藏
页码:104 / 119
页数:16
相关论文
共 50 条
  • [21] ON THE SAMPLE COMPLEXITY OF SPARSE DICTIONARY LEARNING
    Seibert, M.
    Kleinsteuber, M.
    Gribonval, R.
    Jenatton, R.
    Bach, F.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 244 - 247
  • [22] Dictionary learning algorithms for sparse representation
    Kreutz-Delgado, K
    Murray, JF
    Rao, BD
    Engan, K
    Lee, TW
    Sejnowski, TJ
    NEURAL COMPUTATION, 2003, 15 (02) : 349 - 396
  • [23] Submodular Dictionary Learning for Sparse Coding
    Jiang, Zhuolin
    Zhang, Guangxiao
    Davis, Larry S.
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3418 - 3425
  • [24] Incoherent Dictionary Learning for Sparse Representation
    Lin, Tong
    Liu, Shi
    Zha, Hongbin
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1237 - 1240
  • [25] Joint Sparse Regularization for Dictionary Learning
    Jianyu Miao
    Heling Cao
    Xiao-Bo Jin
    Rongrong Ma
    Xuan Fei
    Lingfeng Niu
    Cognitive Computation, 2019, 11 : 697 - 710
  • [26] Joint Sparse Regularization for Dictionary Learning
    Miao, Jianyu
    Cao, Heling
    Jin, Xiao-Bo
    Ma, Rongrong
    Fei, Xuan
    Niu, Lingfeng
    COGNITIVE COMPUTATION, 2019, 11 (05) : 697 - 710
  • [27] Hyperspectral Image Inpainting Based on Robust Spectral Dictionary Learning
    Song, Xiaorui
    Wu, Lingda
    APPLIED SCIENCES-BASEL, 2019, 9 (15):
  • [28] Wavelet Image Inpainting Based on Dictionary Learning with a Beta Process
    Zhou, Guanghua
    Zhu, Dazhou
    Wang, Kun
    Wu, Qiong
    Feng, Xiangchu
    Wang, Cheng
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [29] Audio-Fingerprinting via Dictionary Learning
    Saravanos, Christina
    Ampeliotis, Dimitris
    Berberidis, Kostas
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [30] Learning Sparse Masks for Diffusion-Based Image Inpainting
    Alt, Tobias
    Peter, Pascal
    Weickert, Joachim
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 528 - 539