An Instrumental Quality Measure for Artificially Bandwidth-Extended Speech Signals

被引:11
作者
Abel, Johannes [1 ]
Kaniewska, Magdalena [2 ]
Guillaume, Cyril [2 ]
Tirry, Wouter [2 ]
Fingscheidt, Tim [1 ]
机构
[1] Tech Univ Carolo Wilhelmina Braunschweig, Inst Commun Technol, D-38106 Braunschweig, Germany
[2] NXP Semicond, B-3001 Leuven, Belgium
关键词
Artificial speech bandwidth extension; objective speech quality assessment; perceptual model; NEURAL-NETWORK; EXTENSION; INTELLIGIBILITY;
D O I
10.1109/TASLP.2016.2635022
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Various studies have shown that the instrumental measures wideband PESQ and POLQA are not reliably predicting speech quality for artificial speech bandwidth extension (ABE) test conditions, as this has never been their scope. Based on data from a coordinated subjective listening test with 12 ABE variants developed by 6 different institutions, conducted in 4 languages, we propose in this work a novel instrumental quality measure that is specifically suited for narrowband-to-wideband ABE test conditions. In particular, our contributions are fourfold: First, we propose quality indicators particularly being able to detect ABE-related distortions. Second, we investigate the combination of perceptually and nonperceptually motivated distortion-related statistics. Third, we propose a support-vector-machine-based high-performance MOS predictor for ABE speech quality assessment, finally, we present the training process based on the subjective listening test data. A k-fold cross-validation test on 1) disjoint languages, 2) disjoint speakers, and 3) disjoint ABE solutions proves the superiority of our proposed measure in the ITU-T-recommended categories accuracy, consistency, and linearity compared to both, wideband PESQ and POLQA.
引用
收藏
页码:384 / 396
页数:13
相关论文
共 41 条
  • [1] [Anonymous], ITU T REC P 800 METH
  • [2] [Anonymous], ITU T REC P 862 2 WI
  • [3] [Anonymous], ITU T REC P 863 PERC
  • [4] [Anonymous], 2016, IEEE INT WORKSH AC S
  • [5] [Anonymous], ITU T REC P 862 PERC
  • [6] [Anonymous], P WORKSH AUD SPRACHV
  • [7] [Anonymous], ITU T REC P 56 OBJ M
  • [8] [Anonymous], 2011, INTERSPEECH
  • [9] [Anonymous], P 4 INT WORKSH PERC
  • [10] [Anonymous], THESIS