Choice and adaptation of statistical models for single channel singing voice separation

被引:0
作者
Ozerov, Alexey [1 ,2 ]
Philippe, Pierrick [1 ]
Gribonval, Remi [2 ]
Bimbot, Frederic [2 ]
机构
[1] Orange Labs, F-35512 Cesson Sevigne, France
[2] IRISA, CNRS & INRIA, Projet METISS, F-35042 Rennes, France
关键词
single channel source separation; singing voice; statistical models; Gaussian mixture models; adaptive Wiener filtering; models adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of singing voice extraction from mono audio recordings, i.e., one microphone separation of voice and music, is studied. The approach is based on a priori probabilistic models for two sources, more precisely on Gaussian Mixture Models (GMM). A method for model adaptation to the characteristics of the mixed sources is developed and a comparative study of different models and estimators is performed. We show that the adaptation of the model of music from the non-vocal parts of songs yields good results in realistic conditions.
引用
收藏
页码:211 / 224
页数:14
相关论文
共 26 条
[1]   Audio source separation with a single sensor [J].
Benaroya, L ;
Bimbot, F ;
Gribonval, R .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :191-199
[2]  
BENAROYA L, 2003, THESIS U RENNES 1
[3]  
BENAROYA L, 2003, INT C IND COMP AN BL, P957
[4]  
BURSHTEIN D, 1999, EUR C SPEECH COMM TE, V6, P2591
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[7]   A BAYESIAN-ESTIMATION APPROACH FOR SPEECH ENHANCEMENT USING HIDDEN MARKOV-MODELS [J].
EPHRAIM, Y .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (04) :725-735
[8]  
GRIBONVAL R, 2003, ICA, P763
[9]  
Kay S. M., 1993, Fundamentals of Statistical Signal Processing: Estimation Theory
[10]  
Kristjansson T, 2004, INT CONF ACOUST SPEE, P817