Dictionary Learning for Sparse Audio Inpainting

被引:8
|
作者
Taubock, Georg [1 ]
Rajbamshi, Shristi [1 ]
Balazs, Peter [1 ]
机构
[1] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria
关键词
Reliability; Dictionaries; Signal processing algorithms; Machine learning; Time-frequency analysis; Time-domain analysis; Frequency modulation; Audio inpainting; convex; dictionary; frame; Gabor; learning; optimization; sparsity; time-frequency;
D O I
10.1109/JSTSP.2020.3046422
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of audio inpainting is to fill a gap in an audio signal. This is ideally done by reconstructing the original signal or, at least, by inferring a meaningful surrogate signal. We propose a novel approach applying sparse modeling in the time-frequency (TF) domain. In particular, we devise a dictionary learning technique which learns the dictionary from reliable parts around the gap with the goal to obtain a signal representation with increased TF sparsity. This is based on a basis optimization technique to deform a given Gabor frame such that the sparsity of the analysis coefficients of the resulting frame is maximized. Furthermore, we modify the SParse Audio INpainter (SPAIN) for both the analysis and the synthesis model such that it is able to exploit the increased TF sparsity and-in turn-benefits from dictionary learning. Our experiments demonstrate that the developed methods achieve significant gains in terms of signal-to-distortion ratio (SDR) and objective difference grade (ODG) compared with several state-of-the-art audio inpainting techniques.
引用
收藏
页码:104 / 119
页数:16
相关论文
共 50 条
  • [1] Dictionary learning based sinogram inpainting for CT sparse reconstruction
    Li, Si
    Cao, Qing
    Chen, Yang
    Hu, Yining
    Luo, Limin
    Toumoulin, Christine
    OPTIK, 2014, 125 (12): : 2862 - 2867
  • [2] Audio Inpainting via l1-Minimization and Dictionary Learning
    Rajbamshi, Shristi
    Tauboeck, Georg
    Holighaus, Nicki
    Balazs, Peter
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 2149 - 2153
  • [3] Image Inpainting Method Based Sparse Analysis Model Of Synchronous Dictionary Learning
    Li, Bin
    Sun, Baohua
    Li, Dekun
    Jiang, Tong
    Li, Gang
    Li, Hao
    2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [4] Sparse Audio Inpainting with Variational Bayesian Inference
    Chantas, Giannis
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [5] A Method for Single Frame Super Resolution with Inpainting Based on Sparse Dictionary Learning
    Umehara, Takuya
    Mimura, Kazushi
    Hirabayashi, Akira
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [6] Image Inpainting with Group Based Sparse Representation using Self Adaptive Dictionary Learning
    Rao, T. J. V. Subrahmanyeswara
    Rao, M. Venu Gopala
    Aswini, T. V. N. L.
    2015 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION ENGINEERING SYSTEMS (SPACES), 2015, : 301 - 305
  • [7] Dictionary learning based sparse coefficients for audio classification with max and average pooling
    Zubair, Syed
    Yan, Fei
    Wang, Wenwu
    DIGITAL SIGNAL PROCESSING, 2013, 23 (03) : 960 - 970
  • [8] Removing Dust Artifacts in Retinal Images via Dictionary Learning and Sparse-Based Inpainting
    Barrios, Erik M.
    Marrugo, Andres G.
    Millan, Maria S.
    2019 XXII SYMPOSIUM ON IMAGE, SIGNAL PROCESSING AND ARTIFICIAL VISION (STSIVA), 2019,
  • [9] SPARSE NON-LOCAL SIMILARITY MODELING FOR AUDIO INPAINTING
    Toumi, Ichrak
    Emiya, Valentin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 576 - 580
  • [10] Dictionary learning based on M-PCA-N for audio signal sparse representation
    Yang, Jichen
    He, Qianhua
    Li, Yanxiong
    Liu, Leian
    Li, Jianhong
    Feng, Xiaohui
    IET SIGNAL PROCESSING, 2018, 12 (02) : 198 - 206