Non-negative tensor factorization models for Bayesian audio processing

被引:9
作者
Simsekli, Umut [1 ]
Virtanen, Tuomas [2 ]
Cemgil, Ali Taylan [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Tampere Univ Technol, Dept Signal Proc, Tampere 33720, Finland
基金
芬兰科学院;
关键词
Nonnegative matrix and tensor factorization; Coupled factorization; Bayesian audio modeling; Bayesian inference; MATRIX FACTORIZATION;
D O I
10.1016/j.dsp.2015.03.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We provide an overview of matrix and tensor factorization methods from a Bayesian perspective, giving emphasis on both the inference methods and modeling techniques. Factorization based models and their many extensions such as tensor factorizations have proved useful in a broad range of applications, supporting a practical and computationally tractable framework for modeling. Especially in audio processing, tensor models help in a unified manner the use of prior knowledge about signals, the data generation processes as well as available data from different modalities. After a general review of tensor models, we describe the general statistical framework, give examples of several audio applications and describe modeling strategies for key problems such as deconvolution, source separation, and transcription. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:178 / 191
页数:14
相关论文
共 78 条
[1]  
Acar E, 2012, INT J KNOWL DISCOV B, V3, P22
[2]  
[Anonymous], P ISCA TUT RES WORKS
[3]  
[Anonymous], 2006, Pattern Recognition and Machine Learning. Information Science and Statistics, DOI DOI 10.1007/978-0-387-45528-0
[4]  
[Anonymous], 2012 IEEE INT WORKSH
[5]  
Badeau R., 2013, IEEE WORKSH APPL SIG, P1
[6]  
Badeau R, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P253, DOI 10.1109/ASPAA.2011.6082264
[7]   Email surveillance using non-negative matrix factorization [J].
Berry M.W. ;
Browne M. .
Computational & Mathematical Organization Theory, 2005, 11 (3) :249-264
[8]   Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription [J].
Bertin, Nancy ;
Badeau, Roland ;
Vincent, Emmanuel .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :538-549
[9]  
Beutel A., 2014, SIAM INT C DAT MIN
[10]   PARAFAC. Tutorial and applications [J].
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 38 (02) :149-171