Sparse time-frequency representations for polyphonic audio based on combined efficient fan-chirp transforms

被引:0
|
作者
Costa M.V.M. [1 ]
Apolinário I.F. [1 ]
Biscainho L.W.P. [1 ]
机构
[1] Signals, Multimedia, and Telecommunications Lab (SMT), DEL/Poli and PEE/COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, RJ
来源
关键词
D O I
10.17743/JAES.2019.0039
中图分类号
学科分类号
摘要
This work presents a new strategy for obtaining a sparse time-frequency representation of polyphonic audio signals that exhibit continuous pitch changes by combining different instances of the fan-chirp transform, showing for the first time that it is applicable to effectively describe simultaneous audio sources with such characteristics. The method described blends two recent proposals: A fast implementation of the fan-chirp transform based on the structure tensor and a smart combination of time-frequency representations that explores local sparsity. Both methods are further improved in this work: now the former provides better estimates of the transforms' chirp rates by including a filtering stage, and the latter yields smoother and more continuous combinations of the representations. A set of experiments with synthetic and real audio signals illustrates the performance of the method. © 2019 Audio Engineering Society. All rights reserved.
引用
收藏
页码:894 / 905
页数:11
相关论文
共 50 条
  • [21] BENCHMARKING FLEXIBLE ADAPTIVE TIME-FREQUENCY TRANSFORMS FOR UNDERDETERMINED AUDIO SOURCE SEPARATION
    Nesbit, Andrew
    Vincent, Emmanuel
    Plumbley, Mark D.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 37 - +
  • [22] Time-frequency analysis based on curvelet transforms with time skewing
    Wang, Jiao
    Li, Zhenchun
    Gross, Lutz
    Tyson, Stephen
    GEOPHYSICAL PROSPECTING, 2019, 67 (07) : 1838 - 1851
  • [23] Sparse Recovery of Time-Frequency Representations via Recurrent Neural Networks
    Khalifa, Yassin
    Zhang, Zhenwei
    Sejdie, Ervin
    2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [24] REDUCED INTERFERENCE TIME-FREQUENCY REPRESENTATIONS AND SPARSE RECONSTRUCTION OF UNDERSAMPLED DATA
    Zhang, Yimin D.
    Amin, Moeness G.
    Himed, Braham
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [25] RANDOM TIME-FREQUENCY SUBDICTIONARY DESIGN FOR SPARSE REPRESENTATIONS WITH GREEDY ALGORITHMS
    Moussallam, Manuel
    Daudet, Laurent
    Richard, Gael
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3577 - 3580
  • [26] TIME-FREQUENCY REPRESENTATIONS BASED ON COMPRESSIVE SAMPLES
    Sejdic, Ervin
    Chaparro, Luis E.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [27] Audio Fingerprint Extraction Based on Time-Frequency Domain
    Liu, Zhengzheng
    Li, Cong
    Cao, Sanxing
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1975 - 1979
  • [28] Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification
    Yamamoto, Yuya
    Nam, Juhan
    Terasawa, Hiroko
    Hiraga, Yuzuru
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 890 - 896
  • [29] Time-Frequency MUSIC: An array signal processing method based on time-frequency signal representations
    Amin, MG
    Belouchrani, A
    RADAR PROCESSING, TECHNOLOGY, AND APPLICATIONS III, 1998, 3462 : 186 - 194
  • [30] Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis
    Qin, Shaoqian
    Guo, Jie
    Zhu, Changan
    SENSORS, 2015, 15 (03) : 6497 - 6519