Combining audio and video metrics to assess audio-visual quality

被引:14
作者
Becerra Martinez, Helard A. [1 ]
Farias, Mylene C. Q. [2 ]
机构
[1] Univ Brasilia UnB, Dept Comp Sci, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
[2] Univ Brasilia UnB, Dept Elect Engn, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
关键词
Video quality metrics; Audio quality metrics; Audio-visual quality metrics; Qoe; Multimedia quality assessment; STANDARD;
D O I
10.1007/s11042-018-5656-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we studied the use of combination models to integrate audio and video quality estimates to predict the overall audio-visual quality. More specifically, an overall quality prediction for an audio-visual signal is obtained by combining the outputs of individual audio and video quality metrics with either a linear, a Minkowski, or a power function. A total of 7 different video quality metrics are considered, from which 3 are Full-Reference and 4 are No-Reference. Similarly, a total of 4 audio quality metrics are tested, 2 of which are Full-Reference and 2 are No-Reference. In total, we tested 18 Full-Reference audio-visual combination metrics and 24 No-Reference audio-visual combination metrics. The performance of all combination metrics are tested on two different audio-visual databases. Therefore, besides analysing the performance of a set of individual audio and video quality metrics, we analyzed the performance of the models that combine these audio and video quality metrics. This work gives an important contribution to the area of audio-visual quality assessment, since previous works either tested combination models only on subjective quality scores or used linear models to combine the outputs of a limited number of audio and video quality metrics.
引用
收藏
页码:23993 / 24012
页数:20
相关论文
共 50 条
[41]   Guaranteeing QoE in audio-video transmission by IEEE 802.11e HCCA [J].
Noh, Zul Azri Bin Muhamad ;
Suzuki, Takahiro ;
Tasaka, Shuji .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (07) :1551-1561
[42]   A Cache Decision Policy for QoE Enhancement of Video and Audio Transmission over ICN/CCN [J].
Nunome, Toshiro ;
Kobayashi, Keisuke .
ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2023, 12 (01) :143-152
[43]   Evaluating Visual Attention and QoE for 360° videos with non-spatial and spatial audio [J].
Hirway, Amit ;
Qiao, Yuansong ;
Murray, Niall .
PROCEEDINGS OF THE 2024 15TH ACM MULTIMEDIA SYSTEMS CONFERENCE 2024, MMSYS 2024, 2024, :532-535
[44]   AsQM: Audio Streaming Quality Metric Based on Network Impairments and User Preferences [J].
dos Santos, Marcelo Rodrigo ;
Batista, Andreza Patricia ;
Rosa, Renata Lopes ;
Saadi, Muhammad ;
Melgarejo, Dick Carrillo ;
Rodriguez, Demostenes Zegarra .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2023, 69 (03) :408-420
[45]   QoE Assessment of Multi-view Video and Audio IP Transmission Methods from Multipoint [J].
Nunome, Toshiro ;
Nakagaito, Masaya .
PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION ENGINEERING AND TECHNOLOGY (ICCET 2019), 2019, :88-92
[46]   AMBIQUAL: Towards a Quality Metric for Headphone Rendered Compressed Ambisonic Spatial Audio [J].
Narbutt, Miroslaw ;
Skoglund, Jan ;
Allen, Andrew ;
Chinen, Michael ;
Barry, Dan ;
Hines, Andrew .
APPLIED SCIENCES-BASEL, 2020, 10 (09)
[47]   A Dual Rig Approach for Multi-View Video and Spatialized Audio Capture in Medical Training [J].
Maraval, Joshua ;
Wei, Bangning ;
Pesce, David ;
Gayral, Yann ;
Outtas, Meriem ;
Ramin, Nicolas ;
Zhang, Lu .
2024 16TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX 2024, 2024, :274-277
[48]   The Effect of Bandwidth Allocation Methods on QoE of Multi-View Video and Audio IP Transmission [J].
Nunome, Toshiro ;
Furukawa, Keita .
2017 IEEE 22ND INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2017,
[49]   The Effects of Camera Arrangements and Contents on QoE in Multi-View Video and Audio IP Transmission [J].
Yamamoto, Makoto ;
Nunome, Toshiro ;
Tasaka, Shuji .
TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, :1450-1455
[50]   The Effect of Cache Decision Policies on QoE in Multiple Video and Audio Streaming over ICN/CCN [J].
Kobayashi, Keisuke ;
Nunome, Toshiro .
2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, :523-528