Subjective and Objective Quality Assessment of Audio Source Separation

被引：203

作者：

Emiya, Valentin ^{[1
]}

Vincent, Emmanuel ^{[1
]}

Harlander, Niklas ^{[2
]}

Hohmann, Volker ^{[2
]}

机构：

[1] INRIA, Ctr Inria Rennes Bretagne Atlantique, F-35042 Rennes, France

[2] Carl von Ossietzky Univ Oldenburg, D-26111 Oldenburg, Germany

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 07期

关键词：

Audio; objective measure; quality assessment; source separation; subjective test protocol; MODEL;

D O I：

10.1109/TASL.2011.2109381

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.

引用

页码：2046 / 2057

页数：12

共 39 条

[1]

[Anonymous], 2003, P835 ITUT

[2]

[Anonymous], 1975, 532 ISO

[3]

[Anonymous], BS13871 ITUR

[4]

[Anonymous], 2007, Speech Enhancement: Theory and Practice

[5]

Araki S, 2005, INT CONF ACOUST SPEE, P81

[6]

Araki S., 2010, P 9 INT C IND COMP A

[7]

Comon P, 2010, HANDBOOK OF BLIND SOURCE SEPARATION: INDEPENDENT COMPONENT ANALYSIS AND APPLICATIONS, P1

[8] Perceptual evaluation of blind source separation for robust speech recognition [J].

Di Persia, Leandro ;

Milone, Diego ;

Rufiner, Hugo Leonardo ;

Yanagida, Masuzo .

SIGNAL PROCESSING, 2008, 88 (10) :2578-2583

[9] Evaluating speech separation systems [J].

Ellis, DPW .

SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, :295-304

[10]

Etame T. Etame, 2009, 2009 17th European Signal Processing Conference (EUSIPCO 2009), P914

← 1 2 3 4 →