MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video

被引:186
作者
Guan, Zhenyu [1 ]
Xing, Qunliang [1 ]
Xu, Mai [1 ,2 ]
Yang, Ren [1 ]
Liu, Tie [1 ]
Wang, Zulin [1 ]
机构
[1] Beihang Univ, Beijing 100191, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst, Beijing, Peoples R China
关键词
Transform coding; Image coding; Databases; MPEG; 1; Standard; Task analysis; Video recording; Quality enhancement; compressed video; deep learning; MOTION COMPENSATION; SUPERRESOLUTION; ARTIFACTS; DCT;
D O I
10.1109/TPAMI.2019.2944806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The past few years have witnessed great success in applying deep learning to enhance the quality of compressed image/video. The existing approaches mainly focus on enhancing the quality of a single frame, not considering the similarity between consecutive frames. Since heavy fluctuation exists across compressed video frames as investigated in this paper, frame similarity can be utilized for quality enhancement of low-quality frames given their neighboring high-quality frames. This task is Multi-Frame Quality Enhancement (MFQE). Accordingly, this paper proposes an MFQE approach for compressed video, as the first attempt in this direction. In our approach, we first develop a Bidirectional Long Short-Term Memory (BiLSTM) based detector to locate Peak Quality Frames (PQFs) in compressed video. Then, a novel Multi-Frame Convolutional Neural Network (MF-CNN) is designed to enhance the quality of compressed video, in which the non-PQF and its nearest two PQFs are the input. In MF-CNN, motion between the non-PQF and PQFs is compensated by a motion compensation subnet. Subsequently, a quality enhancement subnet fuses the non-PQF and compensated PQFs, and then reduces the compression artifacts of the non-PQF. Also, PQF quality is enhanced in the same way. Finally, experiments validate the effectiveness and generalization ability of our MFQE approach in advancing the state-of-the-art quality enhancement of compressed video.
引用
收藏
页码:949 / 963
页数:15
相关论文
共 57 条
[21]   FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks [J].
Ilg, Eddy ;
Mayer, Nikolaus ;
Saikia, Tonmoy ;
Keuper, Margret ;
Dosovitskiy, Alexey ;
Brox, Thomas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1647-1655
[22]  
Ioffe S., 2015, PMLR, V37, P448
[23]  
Jancsary J, 2012, LECT NOTES COMPUT SC, V7578, P112, DOI 10.1007/978-3-642-33786-4_9
[24]   Image deblocking via sparse representation [J].
Jung, Cheolkon ;
Jiao, Licheng ;
Qi, Hongtao ;
Sun, Tian .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2012, 27 (06) :663-677
[25]   Video Super-Resolution With Convolutional Neural Networks [J].
Kappeler, Armin ;
Yoo, Seunghwan ;
Dai, Qiqin ;
Katsaggelos, Aggelos K. .
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2016, 2 (02) :109-122
[26]  
Kingma DP, 2015, C TRACK P
[27]  
Le Gall D. J., 1992, Signal Processing: Image Communication, V4, P129, DOI 10.1016/0923-5965(92)90019-C
[28]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[29]   Video Superresolution via Motion Compensation and Deep Residual Learning [J].
Li, Dingyi ;
Wang, Zengfu .
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (04) :749-762
[30]  
Li K, 2017, IEEE INT CON MULTI, P1320, DOI 10.1109/ICME.2017.8019416