Combining audio and video metrics to assess audio-visual quality

被引:14
作者
Becerra Martinez, Helard A. [1 ]
Farias, Mylene C. Q. [2 ]
机构
[1] Univ Brasilia UnB, Dept Comp Sci, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
[2] Univ Brasilia UnB, Dept Elect Engn, Campus Univ Darcy Ribeiro, BR-70919970 Brasilia, DF, Brazil
关键词
Video quality metrics; Audio quality metrics; Audio-visual quality metrics; Qoe; Multimedia quality assessment; STANDARD;
D O I
10.1007/s11042-018-5656-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we studied the use of combination models to integrate audio and video quality estimates to predict the overall audio-visual quality. More specifically, an overall quality prediction for an audio-visual signal is obtained by combining the outputs of individual audio and video quality metrics with either a linear, a Minkowski, or a power function. A total of 7 different video quality metrics are considered, from which 3 are Full-Reference and 4 are No-Reference. Similarly, a total of 4 audio quality metrics are tested, 2 of which are Full-Reference and 2 are No-Reference. In total, we tested 18 Full-Reference audio-visual combination metrics and 24 No-Reference audio-visual combination metrics. The performance of all combination metrics are tested on two different audio-visual databases. Therefore, besides analysing the performance of a set of individual audio and video quality metrics, we analyzed the performance of the models that combine these audio and video quality metrics. This work gives an important contribution to the area of audio-visual quality assessment, since previous works either tested combination models only on subjective quality scores or used linear models to combine the outputs of a limited number of audio and video quality metrics.
引用
收藏
页码:23993 / 24012
页数:20
相关论文
共 50 条
  • [21] Video Transmission and Presentation Methods for Multi-View Video and Audio IP Transmission
    Nunome, Toshiro
    Ishida, Takuya
    2014 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2014,
  • [22] Perceptual Coding of High-Quality Digital Audio
    Brandenburg, Karlheinz
    Faller, Christof
    Herre, Juergen
    Johnston, James D.
    Kleijn, W. Bastiaan
    PROCEEDINGS OF THE IEEE, 2013, 101 (09) : 1905 - 1919
  • [23] Effect of Packet Loss and Reorder on Quality of Audio Streaming
    Laghari, Asif Ali
    Laghari, Rashid Ali
    Wagan, Asif Ali
    Umrani, Aamir Iqbal
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (24): : 1 - 7
  • [24] QoE Assessment of Multi-View Video and Audio IP Transmission
    Rodriguez, Erick Jimenez
    Nunome, Toshiro
    Tasaka, Shuji
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (06) : 1373 - 1383
  • [25] A robust method for estimating synchronization and delay of audio and video for communication services
    Andreas Rossholm
    Benny Lövström
    Multimedia Tools and Applications, 2016, 75 : 527 - 545
  • [26] A robust method for estimating synchronization and delay of audio and video for communication services
    Rossholm, Andreas
    Lovstrom, Benny
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (01) : 527 - 545
  • [27] Demo: A QoE and Visual Attention Evaluation on the Influence of Audio in 360° Videos
    Hirway, Amit
    Qiao, Yuansong
    Murray, Niall
    2020 21ST IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (IEEE WOWMOM 2020), 2020, : 191 - 193
  • [28] USING OVERLAPPING SUBJECTIVE DATASETS TO ASSESS THE PERFORMANCE OF OBJECTIVE QUALITY METRICS ON SCALABLE VIDEO CODING AND ERROR CONCEALMENT
    Pitrey, Y.
    Pepion, R.
    Le Callet, P.
    Barkowsky, M.
    2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX), 2012, : 103 - 108
  • [29] A QoE and Visual Attention Evaluation on the Influence of Spatial Audio in 360 Videos
    Hirway, Amit
    Qiao, Yuansong
    Murray, Niall
    2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR 2020), 2020, : 345 - 350
  • [30] The Impact of ICN/CCN Cache Decision Policies on Video and Audio Transmission QoE
    Kobayashi, Keisuke
    Nunome, Toshiro
    2022 32ND INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2022, : 207 - 212