Study of Spatio-Temporal Modeling in Video Quality Assessment

被引:7
|
作者
Fang, Yuming [1 ]
Li, Zhaoqian [1 ]
Yan, Jiebin [1 ]
Sui, Xiangjie [1 ]
Liu, Hantao [2 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Jiangxi, Peoples R China
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 3AA, Wales
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Video quality assessment; spatio-temporal modeling; recurrent neural network; PREDICTION; DATABASE; FLOW;
D O I
10.1109/TIP.2023.3272480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video quality assessment (VQA) has received remarkable attention recently. Most of the popular VQA models employ recurrent neural networks (RNNs) to capture the temporal quality variation of videos. However, each long-term video sequence is commonly labeled with a single quality score, with which RNNs might not be able to learn long-term quality variation well: What's the real role of RNNs in learning the visual quality of videos? Does it learn spatio-temporal representation as expected or just aggregating spatial features redundantly? In this study, we conduct a comprehensive study by training a family of VQA models with carefully designed frame sampling strategies and spatio-temporal fusion methods. Our extensive experiments on four publicly available in- the-wild video quality datasets lead to two main findings. First, the plausible spatio-temporal modeling module (i. e., RNNs) does not facilitate quality-aware spatio-temporal feature learning. Second, sparsely sampled video frames are capable of obtaining the competitive performance against using all video frames as the input. In other words, spatial features play a vital role in capturing video quality variation for VQA. To our best knowledge, this is the first work to explore the issue of spatio-temporal modeling in VQA.
引用
收藏
页码:2693 / 2702
页数:10
相关论文
共 50 条
  • [31] Annoyance of spatio-temporal artifacts in segmentation quality assessment
    Gelasca, EDG
    Ebrahimi, T
    Farias, MCQ
    Carli, MC
    Mitra, SK
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 345 - 348
  • [32] Objective No-Reference Video Quality Assessment Method Based on Spatio-Temporal Pixel Analysis
    da Silva, Wyllian B.
    Fonseca, Keiko V. O.
    Pohl, Alexandre de A. P.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (07): : 1325 - 1332
  • [33] VIDEO AND SPATIO-TEMPORAL PARAMETER ASSESSMENT OF GAIT AFTER ABOBOTULINUMTOXINA TREATMENT: A PILOT STUDY
    Esquenazi, Alberto
    Le, Stella
    TOXICON, 2021, 190 : S22 - S23
  • [34] The Impact of Temporal Pooling and Spatio-Temporal Quality Interaction on the Prediction Accuracy of Video Quality Metrics
    Wulf, Steffen
    Zoelzer, Udo
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 26 - 31
  • [35] Video Segmentation with Spatio-Temporal Tubes
    Trichet, Remi
    Nevatia, Ramakant
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 330 - 335
  • [36] Spatio-temporal segmentation for video surveillance
    Sun, HZ
    Tan, TN
    ELECTRONICS LETTERS, 2001, 37 (01) : 20 - 21
  • [37] Spatio-temporal segmentation for video surveillance
    Sun, HZ
    Feng, T
    Tan, TN
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 843 - 846
  • [38] VideoZoom Spatio-Temporal Video Browser
    Smith, John R.
    IEEE TRANSACTIONS ON MULTIMEDIA, 1999, 1 (02) : 157 - 171
  • [39] Spatio-temporal video contrast enhancement
    Celik, Turgay
    IET IMAGE PROCESSING, 2013, 7 (06) : 543 - 555
  • [40] Spatio-Temporal Perturbations for Video Attribution
    Li, Zhenqiang
    Wang, Weimin
    Li, Zuoyue
    Huang, Yifei
    Sato, Yoichi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2043 - 2056