Sparse time-frequency representations for polyphonic audio based on combined efficient fan-chirp transforms

被引:0
|
作者
Costa M.V.M. [1 ]
Apolinário I.F. [1 ]
Biscainho L.W.P. [1 ]
机构
[1] Signals, Multimedia, and Telecommunications Lab (SMT), DEL/Poli and PEE/COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, RJ
来源
关键词
D O I
10.17743/JAES.2019.0039
中图分类号
学科分类号
摘要
This work presents a new strategy for obtaining a sparse time-frequency representation of polyphonic audio signals that exhibit continuous pitch changes by combining different instances of the fan-chirp transform, showing for the first time that it is applicable to effectively describe simultaneous audio sources with such characteristics. The method described blends two recent proposals: A fast implementation of the fan-chirp transform based on the structure tensor and a smart combination of time-frequency representations that explores local sparsity. Both methods are further improved in this work: now the former provides better estimates of the transforms' chirp rates by including a filtering stage, and the latter yields smoother and more continuous combinations of the representations. A set of experiments with synthetic and real audio signals illustrates the performance of the method. © 2019 Audio Engineering Society. All rights reserved.
引用
收藏
页码:894 / 905
页数:11
相关论文
共 50 条
  • [1] Sparse Time-Frequency Representations for Polyphonic Audio Based on Combined Efficient Fan-Chirp Transforms
    Costa, Mauricio V. M.
    Apolinario, Isabela F.
    Biscainho, Luiz W. P.
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2019, 67 (11): : 894 - 905
  • [2] Sparse time-frequency representations
    Gardner, TJ
    Magnasco, MO
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (16) : 6094 - 6099
  • [3] Time-frequency Signature Sparse Reconstruction using Chirp Dictionary
    Nguyen, Yen T. H.
    Amin, Moeness G.
    Ghogho, Mounir
    McLernon, Des
    COMPRESSIVE SENSING IV, 2015, 9484
  • [4] SPARSE DENOISING OF AUDIO BY GREEDY TIME-FREQUENCY SHRINKAGE
    Bhattacharya, Gautam
    Depalle, Philippe
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Variations on Hough-wavelet transforms for time-frequency chirp detection
    Morvidone, M
    Torrésani, B
    WAVELETS: APPLICATIONS IN SIGNAL AND IMAGE PROCESSING X, PTS 1 AND 2, 2003, 5207 : 181 - 195
  • [6] Time-frequency audio feature extraction based on tensor representation of sparse coding
    Zhang, Xue-Yuan
    He, Qian-Hua
    ELECTRONICS LETTERS, 2015, 51 (02) : 131 - U20
  • [7] Application of radon transforms and time-frequency representations to ISAR imagery
    Steeghs, P
    Gelsema, SJ
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, AND NEURAL NETWORKS, 2003, 5102 : 189 - 199
  • [8] Using edge information in time-frequency representations for chirp parameter estimation
    Bennett, NN
    Saito, N
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2005, 18 (02) : 186 - 197
  • [9] Missing Data Imputation for Time-Frequency Representations of Audio Signals
    Smaragdis, Paris
    Raj, Bhiksha
    Shashanka, Madhusudana
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 65 (03): : 361 - 370
  • [10] Histogram of Gradients of Time-Frequency Representations for Audio Scene Classification
    Rakotomamonjy, Alain
    Gasso, Gilles
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 142 - 153