Video quality assessment with dense features and ranking pooling

被引:6
作者
Zhang, Yu [1 ]
He, Lihuo [1 ]
Lu, Wen [1 ]
Li, Jie [1 ]
Gao, Xinbo [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Video & Image Proc Syst Lab, Xian 710071, Peoples R China
关键词
Video quality assessment; DenseNet; Spatial pyramid pooling; Learning to rank; SIMILARITY;
D O I
10.1016/j.neucom.2021.06.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Benefiting with the rapid development of communication networks, effective video quality assessment (VQA) models which provide guidance for video transmission and compression technologies are highly demanded. This paper proposes a general-purpose full-reference VQA method combining DenseNet with spatial pyramid pooling and RankNet to not only extract high-level distortion representation and global spatial information of samples but also characterize the temporal correlation among frames. Firstly, the pretrained DenseNet is modified and finetuned to extract high-level features of distorBenefiting with the rapid development of communication networks, effective video quality assessment (VQA) models which provide guidance for video transmission and compression technologies are highly demanded. This paper proposes a general-purpose full-reference VQA method combining DenseNet with spatial pyramid pooling and RankNet to not only extract high-level distortion representation and global spatial information of samples but also characterize the temporal correlation among frames. Firstly, the pretrained DenseNet is modified and finetuned to extract high-level features of distorted videos. Then, spatial pyramid pooling is equipped in the DenseNet module to process flexible inputs with arbitrary spatial resolution. Thus, this kind of input which has the same spatial resolution as the original distorted video is processed by the well-trained DenseNet to generate frame-level quality, which considers the global spatial information of videos directly. Finally, learning to rank is introduced to explore the high-level temporal correlation of distorted videos by taking the RankNet as the temporal pooling function. The experimental results on two public VQA databases show that the proposed algorithm performs consistently with human visual perception. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:242 / 253
页数:12
相关论文
共 47 条
  • [1] [Anonymous], 2018, P EUR C COMP VIS ECC
  • [2] Burges C.J.C., 2005, P INT C MACHINE LEAR, V119, P89
  • [3] Cao Z., 2007, P 24 INT C MACH LEAR, V227, P129, DOI DOI 10.1145/1273496.1273513
  • [4] Video quality assessment accounting for temporal visual masking of local flicker
    Choi, Lark Kwon
    Bovik, Alan Conrad
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 67 : 182 - 198
  • [5] Image quality assessment based on multi-feature extraction and synthesis with support vector regression
    Ding, Yong
    Zhao, Yang
    Zhao, Xinyu
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 54 : 81 - 92
  • [6] Blind image quality assessment via learnable attention-based pooling
    Gu, Jie
    Meng, Gaofeng
    Xiang, Shiming
    Pan, Chunhong
    [J]. PATTERN RECOGNITION, 2019, 91 : 332 - 344
  • [7] Video quality assessment by compact representation of energy in 3D-DCT domain
    He, Lihuo
    Lu, Wen
    Jia, Changcheng
    Hao, Lei
    [J]. NEUROCOMPUTING, 2017, 269 : 108 - 116
  • [8] No-reference video quality evaluation by a deep transfer CNN architecture
    Hou, Rui
    Zhao, YunHao
    Hu, Yang
    Liu, Huan
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 83
  • [9] Objective Video Quality Assessment Based on Perceptually Weighted Mean Squared Error
    Hu, Sudeng
    Jin, Lina
    Wang, Hanli
    Zhang, Yun
    Kwong, Sam
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (09) : 1844 - 1855
  • [10] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269