Time-Frequency Analysis as Probabilistic Inference

被引：24

作者：

Turner, Richard E. ^{[1
]}

Sahani, Maneesh ^{[2
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1TN, England

[2] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2014年 / 62卷 / 23期

基金：

英国工程与自然科学研究理事会;

关键词：

Audio signal processing; inference; machine-learning; time-frequency analysis; NONNEGATIVE MATRIX FACTORIZATION; REPRESENTATION; NOISE; EM;

D O I：

10.1109/TSP.2014.2362100

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes a new view of time-frequency analysis framed in terms of probabilistic inference. Natural signals are assumed to be formed by the superposition of distinct time-frequency components, with the analytic goal being to infer these components by application of Bayes' rule. The framework serves to unify various existing models for natural time-series; it relates to both the Wiener and Kalman filters, and with suitable assumptions yields inferential interpretations of the short-time Fourier transform, spectrogram, filter bank, and wavelet representations. Value is gained by placing time-frequency analysis on the same probabilistic basis as is often employed in applications such as denoising, source separation, or recognition. Uncertainty in the time-frequency representation can be propagated correctly to application-specific stages, improving the handing of noise and missing data. Probabilistic learning allows modules to be co-adapted; thus, the time-frequency representation can be adapted to both the demands of the application and the time-varying statistics of the signal at hand. Similarly, the application module can be adapted to fine properties of the signal propagated by the initial time-frequency processing. We demonstrate these benefits by combining probabilistic time-frequency representations with non-negative matrix factorization, finding benefits in audio denoising and inpainting tasks, albeit with higher computational cost than incurred by the standard approach.

引用

页码：6171 / 6183

页数：13

共 54 条

[11] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[12]

Bretthorst G. L, 1988, Bayesian Spectrum Analysis and Parameter Estimation

[13]

Cemgil A. T., 2005, P EUR SIGN PROC C AN

[14]

Cemgil AT, 2007, LECT NOTES COMPUT SC, V4666, P697

[15] Modulation decompositions for the interpolation of long gaps in acoustic signals [J].

Clark, Pascal ;

Atlas, Les .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :3741-3744

[16]

Cohen L., 1994, TIME FREQUENCY ANAL

[17]

Coughlan J. M., 2001, ADV NEURAL INFORM PR, V14, P1231

[18] Online Bayesian Inference in Some Time-Frequency Representations of Non-Stationary Processes [J].

Everitt, Richard Geoffrey ;

Andrieu, Christophe ;

Davy, Manuel .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (22) :5755-5766

[19] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].

Fevotte, Cedric ;

Bertin, Nancy ;

Durrieu, Jean-Louis .

NEURAL COMPUTATION, 2009, 21 (03) :793-830

[20]

Fisher W., 1986, PROC DARPA WORKSHOP, P93

← 1 2 3 4 5 6 →