Subjective and Objective Quality Assessment of Audio Source Separation

被引:203
作者
Emiya, Valentin [1 ]
Vincent, Emmanuel [1 ]
Harlander, Niklas [2 ]
Hohmann, Volker [2 ]
机构
[1] INRIA, Ctr Inria Rennes Bretagne Atlantique, F-35042 Rennes, France
[2] Carl von Ossietzky Univ Oldenburg, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 07期
关键词
Audio; objective measure; quality assessment; source separation; subjective test protocol; MODEL;
D O I
10.1109/TASL.2011.2109381
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.
引用
收藏
页码:2046 / 2057
页数:12
相关论文
共 39 条
[11]  
Fox B, 2007, LECT NOTES COMPUT SC, V4666, P454
[12]  
Glasberg BR, 2002, J AUDIO ENG SOC, V50, P331
[13]   Postprocessing method for suppressing musical noise generated by spectral subtraction [J].
Goh, Z ;
Tan, KC ;
Tan, BTG .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03) :287-292
[14]  
Herzke T, 2007, ACTA ACUST UNITED AC, V93, P498
[15]  
Hohmann V, 2002, ACTA ACUST UNITED AC, V88, P433
[16]   Evaluation of objective quality measures for speech enhancement [J].
Hu, Yi ;
Loizou, Philipos C. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01) :229-238
[17]   PEMO-Q - A new method for objective: Audio quality assessment using a model of auditory perception [J].
Huber, Rainer ;
Kollmeier, Birger .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :1902-1911
[18]  
ITU, 2003, BS15341 ITUR
[19]   An adaptive stereo basis method for convolutive blind audio source separation [J].
Jafari, Maria G. ;
Vincent, Emmanuel ;
Abdallah, Samer A. ;
Plumbley, Mark D. ;
Davies, Mike E. .
NEUROCOMPUTING, 2008, 71 (10-12) :2087-2097
[20]  
Joby J., 2004, THESIS INDIAN I SCI