Nonintrusive speech quality estimation using Gaussian mixture models

被引:25
作者
Falk, TH [1 ]
Chan, WY [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada
关键词
Gaussian mixtures; quality assurance; quality measurement; quality of service; speech coding; speech quality; speech transmission; telephony;
D O I
10.1109/LSP.2005.861598
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An algorithm for nonintrusive speech quality estimation based on Gaussian mixture models (GMMs) is presented. GMMs are used to form an artificial reference model of the behavior of features of undegraded speech. Consistency measures between the degraded speech signal and the reference model serve as indicators of speech quality. Consistency values are mapped to an objective speech quality score using a multivariate adaptive regression splines function. When tested on unseen data, the proposed algorithm generally outperforms ITU-T standard P.563, which is the current "state-of-the-art" algorithm. The algorithm computes objective quality scores roughly twice as fast as P.563.
引用
收藏
页码:108 / 111
页数:4
相关论文
共 16 条
  • [1] [Anonymous], G729 ITUT
  • [2] [Anonymous], 2004, ITU T RECOMMENDATION
  • [3] Nonintrusive speech quality evaluation using an adaptive neurofuzzy inference system
    Chen, G
    Parsa, V
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (05) : 403 - 406
  • [4] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [5] FALK TH, 2005, P IEEE INT C AC SPEE, V1, P125
  • [6] Advanced glycation end-products in diabetic nephropathy
    Friedman, EA
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 1999, 14 : 1 - 9
  • [7] Gersho A., 1992, VECTOR QUANTIZATION
  • [8] Non-intrusive speech-quality assessment using vocal-tract models
    Gray, P
    Hollier, MP
    Massara, RE
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 493 - 501
  • [9] PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH
    HERMANSKY, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) : 1738 - 1752
  • [10] *ITU T, 1998, 23 ITUT P S