A COMPARATIVE ANALYSIS OF TIME-FREQUENCY DECOMPOSITIONS IN POLYPHONIC PITCH ESTIMATION

被引:0
作者
Canadas-Quesada, F. J. [1 ]
Vera-Candeas, P. [1 ]
Ruiz-Reyes, N. [1 ]
Carabias, J. [1 ]
Cabanas, P. [1 ]
Rodriguez, F. [1 ]
机构
[1] Univ Jaen, Polytech Sch, Telecommun Engn Dept, Jaen, Spain
来源
SIGMAP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATION | 2010年
关键词
Polyphonic signal; Time-frequency decomposition; Fundamental frequency; Constant Q Transform; STFT; Note-event; Candidate; Overlapped partial; Spectral modeling; MUSIC; TRANSCRIPTION; SEPARATION; SIGNALS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In a monaural polyphonic music context, time-frequency information used by most of the multiple fundamental frequency estimation systems, extracted from temporal-domain of the polyphonic signal, is mainly computed using fixed-resolution or variable resolution time-frequency decompositions. This time-frequency information is crucial in the polyphonic estimation process because it must clearly represent all useful information in order to find the set of active pitches. In this paper, we present a preliminary study analyzing two different decompositions, Constant Q Transform and Short Time Fourier Transform, which are integrated in the same multiple fundamental frequency estimation system, with the aim of determining what decomposition is more suitable for polyphonic musical signal analysis and how each of them influences in the accuracy results of the polyphonic estimation considering low-middle-high frequency evaluation.
引用
收藏
页码:145 / 150
页数:6
相关论文
共 19 条
[1]   Automatic piano transcription using frequency and time-domain information [J].
Bello, Juan P. ;
Daudet, Laurent ;
Sandler, Mark B. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :2242-2251
[2]   CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM [J].
BROWN, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) :425-434
[3]  
Burred J., 2007, P INT C MUS INF RETR
[4]  
Cafiadas Quesada F., 2008, J NEW MUSIC RES, V89, P1653
[5]  
Carabias J., 2008, 124 AUDIO ENG SOC AE
[6]  
Cariadas Quesada F., 2010, J NEW MUSIC IN PRESS
[7]  
EMIYA V, 2008, P EUR C SIGN PROC EU
[8]   Separation of synchronous pitched notes by spectral filtering of harmonics [J].
Every, Mark R. ;
Szymanski, John E. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05) :1845-1856
[9]   A real-time music-scene-description system: predominant-FO estimation for detecting melody and bass lines in real-world audio signals [J].
Goto, M .
SPEECH COMMUNICATION, 2004, 43 (04) :311-329
[10]  
IDMT F., 2009, MUS