DEEP NEURAL NETWORKS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT

被引:0
作者
You, Junyong [1 ]
Korhonen, Jari [2 ]
机构
[1] Norwegian Res Ctr NORCE, Bergen, Norway
[2] Shenzhen Univ, Shenzhen, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
3D-CNN; deep learning; LSTM; video quality assessment; PREDICTION;
D O I
10.1109/icip.2019.8803395
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Video quality assessment (VQA) is a challenging task due to the complexity of modeling perceived quality characteristics in both spatial and temporal domains. A novel no-reference (NR) video quality metric (VQM) is proposed in this paper based on two deep neural networks (NN), namely 3D convolution network (3D-CNN) and a recurrent NN composed of long short-term memory (LSTM) units. 3D-CNNs are utilized to extract local spatiotemporal features from small cubic clips in video, and the features are then fed into the LSTM networks to predict the perceived video quality. Such design can elaborately tackle the issue of insufficient training data whilst also efficiently capture perceptive quality features in both spatial and temporal domains. Experimental results with respect to two publicly available video quality datasets have demonstrate that the proposed quality metric outperforms the other compared NR quality metrics.
引用
收藏
页码:2349 / 2353
页数:5
相关论文
共 28 条
[1]   On the use of deep learning for blind image quality assessment [J].
Bianco, Simone ;
Celona, Luigi ;
Napoletano, Paolo ;
Schettini, Raimondo .
SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (02) :355-362
[2]   Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].
Bosse, Sebastian ;
Maniry, Dominique ;
Mueller, Klaus-Robert ;
Wiegand, Thomas ;
Samek, Wojciech .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219
[3]   Prediction of Transmission Distortion for Wireless Video Communication: Analysis [J].
Chen, Zhifeng ;
Wu, Dapeng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (03) :1123-1137
[4]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[5]   In-Capture Mobile Video Distortions: A Study of Subjective Behavior and Objective Algorithms [J].
Ghadiyaram, Deepti ;
Pan, Janice ;
Bovik, Alan C. ;
Moorthy, Anush Krishna ;
Panda, Prasanjit ;
Yang, Kai-Chieh .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (09) :2061-2077
[6]   Perceptual quality prediction on authentically distorted images using a bag of features approach [J].
Ghadiyaram, Deepti ;
Bovik, Alan C. .
JOURNAL OF VISION, 2017, 17 (01)
[7]  
Giannopoulos M., 2018, SIGNAL PROCESS UNPUB
[8]  
Hara K., 2018, P IEEE INT C COMP VI
[9]  
Hosu V, 2017, INT WORK QUAL MULTIM
[10]  
Karpathy A., 2014, P IEEE INT C COMP VI