Time-Frequency Analysis as Probabilistic Inference

被引:24
作者
Turner, Richard E. [1 ]
Sahani, Maneesh [2 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1TN, England
[2] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
基金
英国工程与自然科学研究理事会;
关键词
Audio signal processing; inference; machine-learning; time-frequency analysis; NONNEGATIVE MATRIX FACTORIZATION; REPRESENTATION; NOISE; EM;
D O I
10.1109/TSP.2014.2362100
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a new view of time-frequency analysis framed in terms of probabilistic inference. Natural signals are assumed to be formed by the superposition of distinct time-frequency components, with the analytic goal being to infer these components by application of Bayes' rule. The framework serves to unify various existing models for natural time-series; it relates to both the Wiener and Kalman filters, and with suitable assumptions yields inferential interpretations of the short-time Fourier transform, spectrogram, filter bank, and wavelet representations. Value is gained by placing time-frequency analysis on the same probabilistic basis as is often employed in applications such as denoising, source separation, or recognition. Uncertainty in the time-frequency representation can be propagated correctly to application-specific stages, improving the handing of noise and missing data. Probabilistic learning allows modules to be co-adapted; thus, the time-frequency representation can be adapted to both the demands of the application and the time-varying statistics of the signal at hand. Similarly, the application module can be adapted to fine properties of the signal propagated by the initial time-frequency processing. We demonstrate these benefits by combining probabilistic time-frequency representations with non-negative matrix factorization, finding benefits in audio denoising and inpainting tasks, albeit with higher computational cost than incurred by the standard approach.
引用
收藏
页码:6171 / 6183
页数:13
相关论文
共 54 条
[11]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[12]  
Bretthorst G. L, 1988, Bayesian Spectrum Analysis and Parameter Estimation
[13]  
Cemgil A. T., 2005, P EUR SIGN PROC C AN
[14]  
Cemgil AT, 2007, LECT NOTES COMPUT SC, V4666, P697
[15]   Modulation decompositions for the interpolation of long gaps in acoustic signals [J].
Clark, Pascal ;
Atlas, Les .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :3741-3744
[16]  
Cohen L., 1994, TIME FREQUENCY ANAL
[17]  
Coughlan J. M., 2001, ADV NEURAL INFORM PR, V14, P1231
[18]   Online Bayesian Inference in Some Time-Frequency Representations of Non-Stationary Processes [J].
Everitt, Richard Geoffrey ;
Andrieu, Christophe ;
Davy, Manuel .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (22) :5755-5766
[19]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[20]  
Fisher W., 1986, PROC DARPA WORKSHOP, P93