No-Reference Video Quality Assessment Metric Using Spatiotemporal Features Through LSTM

被引:0
|
作者
Kwong, Ngai-Wing [1 ]
Tsang, Sik-Ho [2 ]
Chan, Yui-Lam [1 ]
Lun, Daniel Pak-Kong [1 ,2 ]
Lee, Tsz-Kwan [3 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
[2] Ctr Adv Reliabil & Safety Ltd CAiRS, Hong Kong Sci Pk, Hong Kong, Peoples R China
[3] Deakin Univ, Sch Informat Technol, Deakin, Australia
关键词
video quality assessment; no reference; long short-term memory; spatiotemporal; pre-padding; masking layer;
D O I
10.1117/12.2590406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, a precise video quality assessment (VQA) model is essential to maintain the quality of service (QoS). However, most existing VQA metrics are designed for specific purposes and ignore the spatiotemporal features of nature video. This paper proposes a novel general-purpose no-reference (NR) VQA metric adopting Long Short-Term Memory (LSTM) modules with the masking layer and pre-padding strategy, namely VQA-LSTM, to solve the above issues. First, we divide the distorted video into frames and extract some significant but also universal spatial and temporal features that could effectively reflect the quality of frames. Second, the data preprocessing stage and pre-padding strategy are used to process data to ease the training for our VQA-LSTM. Finally, a three-layer LSTM model incorporated with masking layer is designed to learn the sequence of spatial features as spatiotemporal features and learn the sequence of temporal features as the gradient of temporal features to evaluate the quality of videos. Two widely used VQA database, MCL-V and LIVE, are tested to prove the robustness of our VQA-LSTM, and the experimental results show that our VQA-LSTM has a better correlation with human perception than some state-of-the-art approaches.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences
    Fan, Yu
    Zhang, Zicheng
    Sun, Wei
    Min, Xiongkuo
    Liu, Ning
    Zhou, Quan
    He, Jun
    Wang, Qiyuan
    Zhai, Guangtao
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [32] Predictive no-reference assessment of video quality
    Torres Vega M.
    Mocanu D.C.
    Stavrou S.
    Liotta A.
    Torres Vega, Maria (m.torres.vega@tue.nl), 1600, Elsevier B.V., Netherlands (52): : 20 - 32
  • [33] Novel No-Reference Video Quality Assessment Metric with Estimation of Dynamic Range Distortion
    Kim, Yo-Han
    Han, Junghyun
    Kim, Hyuntai
    Shin, Jitae
    12TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY: ICT FOR GREEN GROWTH AND SUSTAINABLE DEVELOPMENT, VOLS 1 AND 2, 2010, : 1689 - 1692
  • [34] Multi-pooled Inception Features for No-reference Video Quality Assessment
    Varga, Domonkos
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 338 - 347
  • [35] No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features
    Domonkos Varga
    Neural Processing Letters, 2019, 50 : 2595 - 2608
  • [36] No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features
    Varga, Domonkos
    NEURAL PROCESSING LETTERS, 2019, 50 (03) : 2595 - 2608
  • [37] No-reference artifacts measurements based video quality metric
    Vranjes, Mario
    Bajcinovci, Viliams
    Grbic, Ratko
    Vajak, Denis
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 78 : 345 - 358
  • [38] No-reference video quality assessment based on perceptual features extracted from multi-directional video spatiotemporal slices images
    Yan, Peng
    Mou, Xuanqin
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY V, 2018, 10817
  • [39] A NO-REFERENCE AUDIO-VISUAL VIDEO QUALITY METRIC
    Martinez, Helard Becerra
    Farias, Mylene C. Q.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2125 - 2129
  • [40] A no-reference quality metric for evaluating deinterlaced video frames
    Lam, Eric P.
    Leddy, Christopher A.
    Nash, Stephen R.
    Parks, H. Alan
    INFRARED TECHNOLOGY AND APPLICATIONS XXXII, PTS 1AND 2, 2006, 6206