EVALUATION OF SPECTRAL TRANSFORMS FOR MUSIC SIGNAL ANALYSIS

被引:0
作者
Nagathil, Anil [1 ]
Martin, Rainer [1 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, D-44780 Bochum, Germany
来源
2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2013年
关键词
Spectral analysis; music signal analysis; constant-Q transform; source separation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a study on the spectral analysis of music signals comparing the time domain representation, the short-time Fourier transform (STFT) and the constant-Q transform (CQT) which are additionally combined with different signal-dependent transforms. The comparison is carried out with respect to the spectral compactness, the data compression ability and the temporal continuity of transform coefficients for which we propose measures in this paper. In addition, we investigate the performance of these transforms in a source separation task in which we strive for recovering the main melody line from a mixed instrument recording. Our experiments reveal that performing a rank-reduced principal component analysis based on a CQT representation exhibits the best results in terms of instrumental source separation measures and listening impression which points towards the potential of the CQT for improving existing source separation methods which are currently often based on the STFT.
引用
收藏
页数:4
相关论文
共 13 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]   CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM [J].
BROWN, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) :425-434
[3]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[4]   A Framework for Invertible, Real-Time Constant-Q Transforms [J].
Holighaus, Nicki ;
Doerfler, Monika ;
Angelo Velasco, Gino ;
Grill, Thomas .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04) :775-785
[5]   Comparing Measures of Sparsity [J].
Hurley, Niall ;
Rickard, Scott .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (10) :4723-4741
[6]  
Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
[7]   Learning the parts of objects by non-negative matrix factorization [J].
Lee, DD ;
Seung, HS .
NATURE, 1999, 401 (6755) :788-791
[8]  
Lee DD, 2001, ADV NEUR IN, V13, P556
[9]  
Nagathil A, 2012, INT CONF ACOUST SPEE, P349, DOI 10.1109/ICASSP.2012.6287888
[10]  
Rickard S., 2006, P EUR SIGN PROC C EU