Non-negative tensor factorization models for Bayesian audio processing

被引:9
作者
Simsekli, Umut [1 ]
Virtanen, Tuomas [2 ]
Cemgil, Ali Taylan [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Tampere Univ Technol, Dept Signal Proc, Tampere 33720, Finland
基金
芬兰科学院;
关键词
Nonnegative matrix and tensor factorization; Coupled factorization; Bayesian audio modeling; Bayesian inference; MATRIX FACTORIZATION;
D O I
10.1016/j.dsp.2015.03.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We provide an overview of matrix and tensor factorization methods from a Bayesian perspective, giving emphasis on both the inference methods and modeling techniques. Factorization based models and their many extensions such as tensor factorizations have proved useful in a broad range of applications, supporting a practical and computationally tractable framework for modeling. Especially in audio processing, tensor models help in a unified manner the use of prior knowledge about signals, the data generation processes as well as available data from different modalities. After a general review of tensor models, we describe the general statistical framework, give examples of several audio applications and describe modeling strategies for key problems such as deconvolution, source separation, and transcription. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:178 / 191
页数:14
相关论文
共 78 条
[11]   ANALYSIS OF INDIVIDUAL DIFFERENCES IN MULTIDIMENSIONAL SCALING VIA AN N-WAY GENERALIZATION OF ECKART-YOUNG DECOMPOSITION [J].
CARROLL, JD ;
CHANG, JJ .
PSYCHOMETRIKA, 1970, 35 (03) :283-&
[12]  
Cemgil A. T., 2011, P IEEE WORKSH APPL S
[13]  
Cemgil A.T., 2009, COMPUT INTELL NEUROS
[14]   Marginal likelihood from the Gibbs output [J].
Chib, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1313-1321
[15]   Nonnegative matrix and tensor factorization [J].
Cichocki, Andrzej ;
Zdunek, Rafal ;
Amari, Shun-Ichi .
IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (01) :142-145
[16]   Nonnegative Matrix Factorization: An Analytical and Interpretive Tool in Computational Biology [J].
Devarajan, Karthik .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (07)
[17]  
Dikmen O, 2013, IEEE WORK APPL SIG
[18]   A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation [J].
Durrieu, Jean-Louis ;
David, Bertrand ;
Richard, Gael .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) :1180-1191
[19]   Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals [J].
Durrieu, Jean-Louis ;
Richard, Gael ;
David, Bertrand ;
Fevotte, Cedric .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :564-575
[20]  
Fevotte C., 2011, IEEE STAT SIGN PROC