A DATA-DRIVEN COGNITIVE SALIENCE MODEL FOR OBJECTIVE PERCEPTUAL AUDIO QUALITY ASSESSMENT

被引:2
|
作者
Delgado, Pablo M. [1 ]
Herre, Juergen [1 ,2 ]
机构
[1] Int Audio Labs Erlangen, Wolfsmantel 33, D-91058 Erlangen, Germany
[2] Fraunhofer IIS, Wolfsmantel 33, D-91058 Erlangen, Germany
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Psychoacoustics; Cognitive Modeling; Objective Audio Quality Assessment; PEAQ; ViSQOL;
D O I
10.1109/ICASSP43922.2022.9747064
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Objective audio quality measurement systems often use perceptual models to predict the subjective quality scores of processed signals, as reported in listening tests. Most systems map different metrics of perceived degradation into a single quality score predicting subjective quality. This requires a quality mapping stage that is informed by real listening test data using statistical learning (i. e., a data-driven approach) with distortion metrics as input features. However, the amount of reliable training data is limited in practice, and usually not sufficient for a comprehensive training of large learning models. Models of cognitive effects in objective systems can, however, improve the learning model. Specifically, considering the salience of certain distortion types, they provide additional features to the mapping stage that improve the learning process, especially for limited amounts of training data. We propose a novel data-driven salience model that informs the quality mapping stage by explicitly estimating the cognitive/degradation metric interactions using a salience measure. Systems incorporating the novel salience model are shown to outperform equivalent systems that only use statistical learning to combine cognitive and degradation metrics, as well as other well-known measurement systems, for a representative validation dataset.
引用
收藏
页码:986 / 990
页数:5
相关论文
共 12 条
  • [11] The Quality of Response Time Data Inference: A Blinded, Collaborative Assessment of the Validity of Cognitive Models
    Dutilh, Gilles
    Annis, Jeffrey
    Brown, Scott D.
    Cassey, Peter
    Evans, Nathan J.
    Grasman, Raoul P. P. P.
    Hawkins, Guy E.
    Heathcote, Andrew
    Holmes, William R.
    Krypotos, Angelos-Miltiadis
    Kupitz, Colin N.
    Leite, Fabio P.
    Lerche, Veronika
    Lin, Yi-Shin
    Logan, Gordon D.
    Palmeri, Thomas J.
    Starns, Jeffrey J.
    Trueblood, Jennifer S.
    van Maanen, Leendert
    van Ravenzwaaij, Don
    Vandekerckhove, Joachim
    Visser, Ingmar
    Voss, Andreas
    White, Corey N.
    Wiecki, Thomas V.
    Rieskamp, Joerg
    Donkin, Chris
    PSYCHONOMIC BULLETIN & REVIEW, 2019, 26 (04) : 1051 - 1069
  • [12] Development of psychoacoustic model based on the correlation of the subjective and objective sound quality assessment of automatic washing machines
    Moravec, Marek
    Izarikova, Gabriela
    Liptai, Pavol
    Badida, Miroslav
    Badidova, Anna
    APPLIED ACOUSTICS, 2018, 140 : 178 - 182