A DATA-DRIVEN COGNITIVE SALIENCE MODEL FOR OBJECTIVE PERCEPTUAL AUDIO QUALITY ASSESSMENT

被引:2
|
作者
Delgado, Pablo M. [1 ]
Herre, Juergen [1 ,2 ]
机构
[1] Int Audio Labs Erlangen, Wolfsmantel 33, D-91058 Erlangen, Germany
[2] Fraunhofer IIS, Wolfsmantel 33, D-91058 Erlangen, Germany
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Psychoacoustics; Cognitive Modeling; Objective Audio Quality Assessment; PEAQ; ViSQOL;
D O I
10.1109/ICASSP43922.2022.9747064
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Objective audio quality measurement systems often use perceptual models to predict the subjective quality scores of processed signals, as reported in listening tests. Most systems map different metrics of perceived degradation into a single quality score predicting subjective quality. This requires a quality mapping stage that is informed by real listening test data using statistical learning (i. e., a data-driven approach) with distortion metrics as input features. However, the amount of reliable training data is limited in practice, and usually not sufficient for a comprehensive training of large learning models. Models of cognitive effects in objective systems can, however, improve the learning model. Specifically, considering the salience of certain distortion types, they provide additional features to the mapping stage that improve the learning process, especially for limited amounts of training data. We propose a novel data-driven salience model that informs the quality mapping stage by explicitly estimating the cognitive/degradation metric interactions using a salience measure. Systems incorporating the novel salience model are shown to outperform equivalent systems that only use statistical learning to combine cognitive and degradation metrics, as well as other well-known measurement systems, for a representative validation dataset.
引用
收藏
页码:986 / 990
页数:5
相关论文
共 12 条
  • [1] Towards Improved Objective Perceptual Audio Quality Assessment - Part 1: A Novel Data-Driven Cognitive Model
    Delgado, Pablo M.
    Herre, Juergen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4661 - 4675
  • [2] Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio
    Sloan, Colm
    Harte, Naomi
    Kelly, Damien
    Kokaram, Anil C.
    Hines, Andrew
    IEEE TRANSACTIONS ON BROADCASTING, 2017, 63 (04) : 693 - 705
  • [3] Perceptual-based quality assessment for audio visual services: A survey
    You, Junyong
    Reiter, Ulrich
    Hannuksela, Miska M.
    Gabbouj, Moncef
    Perkis, Andrew
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) : 482 - 501
  • [4] OBJECTIVE ASSESSMENT OF SPATIAL AUDIO QUALITY USING DIRECTIONAL LOUDNESS MAPS
    Delgado, Pablo M.
    Herre, Juergen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 621 - 625
  • [5] Comparison of Two Objective Methods of Quality Assessment for Digital Audio Broadcasting
    Rund, Frantisek
    Ulovec, Karel
    2018 28TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2018,
  • [6] Applying objective perceptual quality assessment methods in network performance modeling
    Conway, AE
    Zhu, YL
    ELEVENTH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 2002, : 116 - 123
  • [7] Can we still use PEAQ? A Performance Analysis of the ITU Standard for the Objective Assessment of Perceived Audio Quality
    Delgado, Pablo M.
    Herre, Juergen
    2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
  • [8] Inferring a Cognitive Architecture from Multitask Neuroimaging Data: A Data-Driven Test of the Common Model of Cognition Using Granger Causality
    Hake, Holly Sue
    Sibert, Catherine
    Stocco, Andrea
    TOPICS IN COGNITIVE SCIENCE, 2022, 14 (04) : 845 - 859
  • [9] Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications
    Pocta, Peter
    Beerends, John G.
    IEEE TRANSACTIONS ON BROADCASTING, 2015, 61 (03) : 407 - 415
  • [10] The Quality of Response Time Data Inference: A Blinded, Collaborative Assessment of the Validity of Cognitive Models
    Gilles Dutilh
    Jeffrey Annis
    Scott D. Brown
    Peter Cassey
    Nathan J. Evans
    Raoul P. P. P. Grasman
    Guy E. Hawkins
    Andrew Heathcote
    William R. Holmes
    Angelos-Miltiadis Krypotos
    Colin N. Kupitz
    Fábio P. Leite
    Veronika Lerche
    Yi-Shin Lin
    Gordon D. Logan
    Thomas J. Palmeri
    Jeffrey J. Starns
    Jennifer S. Trueblood
    Leendert van Maanen
    Don van Ravenzwaaij
    Joachim Vandekerckhove
    Ingmar Visser
    Andreas Voss
    Corey N. White
    Thomas V. Wiecki
    Jörg Rieskamp
    Chris Donkin
    Psychonomic Bulletin & Review, 2019, 26 : 1051 - 1069