Psychophysical Evaluation of Audio Source Separation Methods

被引:3
作者
Simpson, Andrew J. R. [1 ]
Roma, Gerard [1 ]
Grais, Emad M. [1 ]
Mason, Russell D. [2 ]
Hummersone, Christopher [2 ]
Plumbley, Mark D. [1 ]
机构
[1] Ctr Vis Speech & Signal Proc, Guildford, Surrey, England
[2] Univ Surrey, Inst Sound Recording, Guildford, Surrey, England
来源
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017) | 2017年 / 10169卷
基金
英国工程与自然科学研究理事会;
关键词
Deep learning; Source separation; Perceptual evaluation;
D O I
10.1007/978-3-319-53547-0_21
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Source separation evaluation is typically a top-down process, starting with perceptual measures which capture fitness-for-purpose and followed by attempts to find physical (objective) measures that are predictive of the perceptual measures. In this paper, we take a contrasting bottom-up approach. We begin with the physical measures provided by the Blind Source Separation Evaluation Toolkit (BSS Eval) and we then look for corresponding perceptual correlates. This approach is known as psychophysics and has the distinct advantage of leading to interpretable, psychophysical models. We obtained perceptual similarity judgments from listeners in two experiments featuring vocal sources within musical mixtures. In the first experiment, listeners compared the overall quality of vocal signals estimated from musical mixtures using a range of competing source separation methods. In a loudness experiment, listeners compared the loudness balance of the competing musical accompaniment and vocal. Our preliminary results provide provisional validation of the psychophysical approach.
引用
收藏
页码:211 / 221
页数:11
相关论文
共 19 条
[1]  
[Anonymous], 2014, BS15343 ITUR
[2]  
Cano E, 2016, EUR SIGNAL PR CONF, P1758, DOI 10.1109/EUSIPCO.2016.7760550
[3]  
Cartwright M, 2016, INT CONF ACOUST SPEE, P619, DOI 10.1109/ICASSP.2016.7471749
[4]   MODIFIED RANDOMIZATION TESTS FOR NONPARAMETRIC HYPOTHESES [J].
DWASS, M .
ANNALS OF MATHEMATICAL STATISTICS, 1957, 28 (01) :181-187
[5]   Subjective and Objective Quality Assessment of Audio Source Separation [J].
Emiya, Valentin ;
Vincent, Emmanuel ;
Harlander, Niklas ;
Hohmann, Volker .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07) :2046-2057
[6]  
Fechner GT, 1860, Elemente der psychophysik, V2
[7]   Loudness, its definition, measurement and calculation [J].
Fletcher, H ;
Munson, WA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1933, 5 (02) :82-108
[8]  
Gescheider G. A., 1997, PSYCHOPHYSICS FUNDAM
[9]  
Grais E.M., 2017, 13 INT C LAT VAR AN
[10]  
Gupta U., 2015, IEEE WORK APPL SIG