Study of Spatio-Temporal Modeling in Video Quality Assessment

被引:7
|
作者
Fang, Yuming [1 ]
Li, Zhaoqian [1 ]
Yan, Jiebin [1 ]
Sui, Xiangjie [1 ]
Liu, Hantao [2 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Jiangxi, Peoples R China
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 3AA, Wales
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Video quality assessment; spatio-temporal modeling; recurrent neural network; PREDICTION; DATABASE; FLOW;
D O I
10.1109/TIP.2023.3272480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video quality assessment (VQA) has received remarkable attention recently. Most of the popular VQA models employ recurrent neural networks (RNNs) to capture the temporal quality variation of videos. However, each long-term video sequence is commonly labeled with a single quality score, with which RNNs might not be able to learn long-term quality variation well: What's the real role of RNNs in learning the visual quality of videos? Does it learn spatio-temporal representation as expected or just aggregating spatial features redundantly? In this study, we conduct a comprehensive study by training a family of VQA models with carefully designed frame sampling strategies and spatio-temporal fusion methods. Our extensive experiments on four publicly available in- the-wild video quality datasets lead to two main findings. First, the plausible spatio-temporal modeling module (i. e., RNNs) does not facilitate quality-aware spatio-temporal feature learning. Second, sparsely sampled video frames are capable of obtaining the competitive performance against using all video frames as the input. In other words, spatial features play a vital role in capturing video quality variation for VQA. To our best knowledge, this is the first work to explore the issue of spatio-temporal modeling in VQA.
引用
收藏
页码:2693 / 2702
页数:10
相关论文
共 50 条
  • [21] No-reference Video Quality Assessment Based on Spatio-temporal Perception Feature Fusion
    Yaya Tan
    Guangqian Kong
    Xun Duan
    Huiyun Long
    Yun Wu
    Neural Processing Letters, 2023, 55 : 1317 - 1335
  • [22] Saliency-Aware Spatio-Temporal Artifact Detection for Compressed Video Quality Assessment
    Lin, Liqun
    Zheng, Yang
    Chen, Weiling
    Lan, Chengdong
    Zhao, Tiesong
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 693 - 697
  • [23] No-reference Video Quality Assessment Based on Spatio-temporal Perception Feature Fusion
    Tan, Yaya
    Kong, Guangqian
    Duan, Xun
    Long, Huiyun
    Wu, Yun
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1317 - 1335
  • [24] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [25] Spatio-temporal modeling in video and multimedia geographic information systems
    Pissinou, N
    Radev, I
    Makki, K
    GEOINFORMATICA, 2001, 5 (04) : 375 - 409
  • [26] Spatio-Temporal Modeling in Video and Multimedia Geographic Information Systems
    Niki Pissinou
    Ivan Radev
    Kia Makki
    GeoInformatica, 2001, 5 : 375 - 409
  • [27] An Empirical Investigation of Efficient Spatio-Temporal Modeling in Video Restoration
    Fan, Yuchen
    Yu, Jiahui
    Liu, Ding
    Huang, Thomas S.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2159 - 2168
  • [28] ANALYSIS OF VIDEO QUALITY INDUCED SPATIO-TEMPORAL SALIENCY SHIFTS
    Wu, Xinbo
    Dong, Zhengyan
    Zhang, Fan
    Rosin, Paul L.
    Liu, Hantao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1581 - 1585
  • [29] Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement
    Deng, Jianing
    Wang, Li
    Pu, Shiliang
    Zhuo, Cheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10696 - 10703
  • [30] Using multiple spatio-temporal features to estimate video quality
    Freitas, Pedro Garcia
    Akamine, Welington Y. L.
    Farias, Mylene C. Q.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 1 - 10