DEEP NEURAL NETWORKS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT

被引：0

作者：

You, Junyong ^{[1
]}

Korhonen, Jari ^{[2
]}

机构：

[1] Norwegian Res Ctr NORCE, Bergen, Norway

[2] Shenzhen Univ, Shenzhen, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

关键词：

3D-CNN; deep learning; LSTM; video quality assessment; PREDICTION;

D O I：

10.1109/icip.2019.8803395

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Video quality assessment (VQA) is a challenging task due to the complexity of modeling perceived quality characteristics in both spatial and temporal domains. A novel no-reference (NR) video quality metric (VQM) is proposed in this paper based on two deep neural networks (NN), namely 3D convolution network (3D-CNN) and a recurrent NN composed of long short-term memory (LSTM) units. 3D-CNNs are utilized to extract local spatiotemporal features from small cubic clips in video, and the features are then fed into the LSTM networks to predict the perceived video quality. Such design can elaborately tackle the issue of insufficient training data whilst also efficiently capture perceptive quality features in both spatial and temporal domains. Experimental results with respect to two publicly available video quality datasets have demonstrate that the proposed quality metric outperforms the other compared NR quality metrics.

引用

页码：2349 / 2353

页数：5

共 28 条

[1] On the use of deep learning for blind image quality assessment [J].