Joint detection and tracking of time-varying harmonic components: A flexible Bayesian approach

被引:36
作者
Dubois, Corentin [1 ]
Davy, Manuel
机构
[1] IRCCyN, F-44321 Nantes 03, France
[2] CNRS, LAGIS, F-59651 Villeneuve, France
[3] INRIA, FUTURS SequeL Team, F-59651 Villeneuve, France
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 04期
关键词
audio signal analysis; Bayesian filtering; harmonic structure; multipitch estimation; particle filtering; Rao-Blackwellization; time-frequency representation; time-varying amplitude/frequency tracking;
D O I
10.1109/TASL.2007.894522
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the joint estimation and detection of time-varying harmonic components in audio signals. We follow a flexible viewpoint, where several frequency/amplitude trajectories are tracked in spectrogram using particle filtering. The core idea is that each harmonic component (composed of a fundamental partial together with several overtone partials) is considered a target. Tracking requires to define a state-space model with state transition and measurement equations. Particle filtering algorithms rely on a so-called sequential importance distribution, and we show that it can be built on previous multipitch estimation algorithms, so as to yield an even more efficient estimation procedure with established convergence properties. Moreover, as our model captures all the harmonic model information, it actually separates the harmonic sources. Simulations on synthetic and real music data show the interest of our approach.
引用
收藏
页码:1283 / 1295
页数:13
相关论文
共 46 条
[1]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.7551/MITPRESS/1486.001.0001, 10.1121/1.408434, DOI 10.1121/1.408434]
[2]   Efficient particle filtering for jump Markov systems. Application to time-varying autoregressions [J].
Andrieu, C ;
Davy, M ;
Doucet, A .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (07) :1762-1770
[3]  
[Anonymous], 1998, COMPUTATIONAL AUDITO
[4]  
[Anonymous], THESIS TAMPERE U TEC
[5]  
BELLO JP, 2000, P COST G6 C DIG AUD, P135
[6]   PERCEPTUAL GROUPING OF MUSICAL SOUNDS - A COMPUTATIONAL MODEL [J].
BROWN, GJ ;
COOKE, M .
JOURNAL OF NEW MUSIC RESEARCH, 1994, 23 (02) :107-132
[7]  
Casella G., 2005, MONTE CARLO STAT MET
[8]   A generative model for music transcription [J].
Cemgil, AT ;
Kappen, HJ ;
Barber, D .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02) :679-694
[9]  
CERNGIL AT, 2003, IEEE WASPAA, P181
[10]   Bayesian analysis of polyphonic western tonal music [J].
Davy, M ;
Godsill, S ;
Idier, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (04) :2498-2517