Combining audio and video metrics to assess audio-visual quality

被引:14
作者
Becerra Martinez, Helard A. [1 ]
Farias, Mylene C. Q. [2 ]
机构
[1] Univ Brasilia UnB, Dept Comp Sci, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
[2] Univ Brasilia UnB, Dept Elect Engn, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
关键词
Video quality metrics; Audio quality metrics; Audio-visual quality metrics; Qoe; Multimedia quality assessment; STANDARD;
D O I
10.1007/s11042-018-5656-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we studied the use of combination models to integrate audio and video quality estimates to predict the overall audio-visual quality. More specifically, an overall quality prediction for an audio-visual signal is obtained by combining the outputs of individual audio and video quality metrics with either a linear, a Minkowski, or a power function. A total of 7 different video quality metrics are considered, from which 3 are Full-Reference and 4 are No-Reference. Similarly, a total of 4 audio quality metrics are tested, 2 of which are Full-Reference and 2 are No-Reference. In total, we tested 18 Full-Reference audio-visual combination metrics and 24 No-Reference audio-visual combination metrics. The performance of all combination metrics are tested on two different audio-visual databases. Therefore, besides analysing the performance of a set of individual audio and video quality metrics, we analyzed the performance of the models that combine these audio and video quality metrics. This work gives an important contribution to the area of audio-visual quality assessment, since previous works either tested combination models only on subjective quality scores or used linear models to combine the outputs of a limited number of audio and video quality metrics.
引用
收藏
页码:23993 / 24012
页数:20
相关论文
共 50 条
[21]   Video Transmission and Presentation Methods for Multi-View Video and Audio IP Transmission [J].
Nunome, Toshiro ;
Ishida, Takuya .
2014 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2014,
[22]   Perceptual Coding of High-Quality Digital Audio [J].
Brandenburg, Karlheinz ;
Faller, Christof ;
Herre, Juergen ;
Johnston, James D. ;
Kleijn, W. Bastiaan .
PROCEEDINGS OF THE IEEE, 2013, 101 (09) :1905-1919
[23]   Effect of Packet Loss and Reorder on Quality of Audio Streaming [J].
Laghari, Asif Ali ;
Laghari, Rashid Ali ;
Wagan, Asif Ali ;
Umrani, Aamir Iqbal .
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (24) :1-7
[24]   A robust method for estimating synchronization and delay of audio and video for communication services [J].
Andreas Rossholm ;
Benny Lövström .
Multimedia Tools and Applications, 2016, 75 :527-545
[25]   QoE Assessment of Multi-View Video and Audio IP Transmission [J].
Rodriguez, Erick Jimenez ;
Nunome, Toshiro ;
Tasaka, Shuji .
IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (06) :1373-1383
[26]   A robust method for estimating synchronization and delay of audio and video for communication services [J].
Rossholm, Andreas ;
Lovstrom, Benny .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (01) :527-545
[27]   A QoE and Visual Attention Evaluation on the Influence of Spatial Audio in 360 Videos [J].
Hirway, Amit ;
Qiao, Yuansong ;
Murray, Niall .
2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR 2020), 2020, :345-350
[28]   Demo: A QoE and Visual Attention Evaluation on the Influence of Audio in 360° Videos [J].
Hirway, Amit ;
Qiao, Yuansong ;
Murray, Niall .
2020 21ST IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (IEEE WOWMOM 2020), 2020, :191-193
[29]   USING OVERLAPPING SUBJECTIVE DATASETS TO ASSESS THE PERFORMANCE OF OBJECTIVE QUALITY METRICS ON SCALABLE VIDEO CODING AND ERROR CONCEALMENT [J].
Pitrey, Y. ;
Pepion, R. ;
Le Callet, P. ;
Barkowsky, M. .
2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX), 2012, :103-108
[30]   The Impact of ICN/CCN Cache Decision Policies on Video and Audio Transmission QoE [J].
Kobayashi, Keisuke ;
Nunome, Toshiro .
2022 32ND INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2022, :207-212