Dictionary Learning for Sparse Audio Inpainting

被引：8

作者：

Taubock, Georg ^{[1
]}

Rajbamshi, Shristi ^{[1
]}

Balazs, Peter ^{[1
]}

机构：

[1] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2021年 / 15卷 / 01期

关键词：

Reliability; Dictionaries; Signal processing algorithms; Machine learning; Time-frequency analysis; Time-domain analysis; Frequency modulation; Audio inpainting; convex; dictionary; frame; Gabor; learning; optimization; sparsity; time-frequency;

D O I：

10.1109/JSTSP.2020.3046422

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The objective of audio inpainting is to fill a gap in an audio signal. This is ideally done by reconstructing the original signal or, at least, by inferring a meaningful surrogate signal. We propose a novel approach applying sparse modeling in the time-frequency (TF) domain. In particular, we devise a dictionary learning technique which learns the dictionary from reliable parts around the gap with the goal to obtain a signal representation with increased TF sparsity. This is based on a basis optimization technique to deform a given Gabor frame such that the sparsity of the analysis coefficients of the resulting frame is maximized. Furthermore, we modify the SParse Audio INpainter (SPAIN) for both the analysis and the synthesis model such that it is able to exploit the increased TF sparsity and-in turn-benefits from dictionary learning. Our experiments demonstrate that the developed methods achieve significant gains in terms of signal-to-distortion ratio (SDR) and objective difference grade (ODG) compared with several state-of-the-art audio inpainting techniques.

引用

页码：104 / 119

页数：16

共 50 条

[21] ON THE SAMPLE COMPLEXITY OF SPARSE DICTIONARY LEARNING
Seibert, M.
Kleinsteuber, M.
Gribonval, R.
Jenatton, R.
Bach, F.
2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 244 - 247
[22] Dictionary learning algorithms for sparse representation
Kreutz-Delgado, K
Murray, JF
Rao, BD
Engan, K
Lee, TW
Sejnowski, TJ
NEURAL COMPUTATION, 2003, 15 (02) : 349 - 396
[23] Submodular Dictionary Learning for Sparse Coding
Jiang, Zhuolin
Zhang, Guangxiao
Davis, Larry S.
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3418 - 3425
[24] Incoherent Dictionary Learning for Sparse Representation
Lin, Tong
Liu, Shi
Zha, Hongbin
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1237 - 1240
[25] Joint Sparse Regularization for Dictionary Learning
Jianyu Miao
Heling Cao
Xiao-Bo Jin
Rongrong Ma
Xuan Fei
Lingfeng Niu
Cognitive Computation, 2019, 11 : 697 - 710
[26] Joint Sparse Regularization for Dictionary Learning
Miao, Jianyu
Cao, Heling
Jin, Xiao-Bo
Ma, Rongrong
Fei, Xuan
Niu, Lingfeng
COGNITIVE COMPUTATION, 2019, 11 (05) : 697 - 710
[27] Hyperspectral Image Inpainting Based on Robust Spectral Dictionary Learning
Song, Xiaorui
Wu, Lingda
APPLIED SCIENCES-BASEL, 2019, 9 (15):
[28] Wavelet Image Inpainting Based on Dictionary Learning with a Beta Process
Zhou, Guanghua
Zhu, Dazhou
Wang, Kun
Wu, Qiong
Feng, Xiangchu
Wang, Cheng
2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
[29] Audio-Fingerprinting via Dictionary Learning
Saravanos, Christina
Ampeliotis, Dimitris
Berberidis, Kostas
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[30] Learning Sparse Masks for Diffusion-Based Image Inpainting
Alt, Tobias
Peter, Pascal
Weickert, Joachim
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 528 - 539

← 1 2 3 4 5 →