EVALUATION OF SPECTRAL TRANSFORMS FOR MUSIC SIGNAL ANALYSIS

被引:0
作者
Nagathil, Anil [1 ]
Martin, Rainer [1 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, D-44780 Bochum, Germany
来源
2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2013年
关键词
Spectral analysis; music signal analysis; constant-Q transform; source separation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a study on the spectral analysis of music signals comparing the time domain representation, the short-time Fourier transform (STFT) and the constant-Q transform (CQT) which are additionally combined with different signal-dependent transforms. The comparison is carried out with respect to the spectral compactness, the data compression ability and the temporal continuity of transform coefficients for which we propose measures in this paper. In addition, we investigate the performance of these transforms in a source separation task in which we strive for recovering the main melody line from a mixed instrument recording. Our experiments reveal that performing a rank-reduced principal component analysis based on a CQT representation exhibits the best results in terms of instrumental source separation measures and listening impression which points towards the potential of the CQT for improving existing source separation methods which are currently often based on the STFT.
引用
收藏
页数:4
相关论文
共 13 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM
    BROWN, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) : 425 - 434
  • [3] INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT
    COMON, P
    [J]. SIGNAL PROCESSING, 1994, 36 (03) : 287 - 314
  • [4] A Framework for Invertible, Real-Time Constant-Q Transforms
    Holighaus, Nicki
    Doerfler, Monika
    Angelo Velasco, Gino
    Grill, Thomas
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 775 - 785
  • [5] Comparing Measures of Sparsity
    Hurley, Niall
    Rickard, Scott
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (10) : 4723 - 4741
  • [6] Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
  • [7] Learning the parts of objects by non-negative matrix factorization
    Lee, DD
    Seung, HS
    [J]. NATURE, 1999, 401 (6755) : 788 - 791
  • [8] Lee DD, 2001, ADV NEUR IN, V13, P556
  • [9] Nagathil A, 2012, INT CONF ACOUST SPEE, P349, DOI 10.1109/ICASSP.2012.6287888
  • [10] Rickard S., 2006, P EUR SIGN PROC C EU