MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video

被引:189
作者
Guan, Zhenyu [1 ]
Xing, Qunliang [1 ]
Xu, Mai [1 ,2 ]
Yang, Ren [1 ]
Liu, Tie [1 ]
Wang, Zulin [1 ]
机构
[1] Beihang Univ, Beijing 100191, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst, Beijing, Peoples R China
关键词
Transform coding; Image coding; Databases; MPEG; 1; Standard; Task analysis; Video recording; Quality enhancement; compressed video; deep learning; MOTION COMPENSATION; SUPERRESOLUTION; ARTIFACTS; DCT;
D O I
10.1109/TPAMI.2019.2944806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The past few years have witnessed great success in applying deep learning to enhance the quality of compressed image/video. The existing approaches mainly focus on enhancing the quality of a single frame, not considering the similarity between consecutive frames. Since heavy fluctuation exists across compressed video frames as investigated in this paper, frame similarity can be utilized for quality enhancement of low-quality frames given their neighboring high-quality frames. This task is Multi-Frame Quality Enhancement (MFQE). Accordingly, this paper proposes an MFQE approach for compressed video, as the first attempt in this direction. In our approach, we first develop a Bidirectional Long Short-Term Memory (BiLSTM) based detector to locate Peak Quality Frames (PQFs) in compressed video. Then, a novel Multi-Frame Convolutional Neural Network (MF-CNN) is designed to enhance the quality of compressed video, in which the non-PQF and its nearest two PQFs are the input. In MF-CNN, motion between the non-PQF and PQFs is compensated by a motion compensation subnet. Subsequently, a quality enhancement subnet fuses the non-PQF and compensated PQFs, and then reduces the compression artifacts of the non-PQF. Also, PQF quality is enhanced in the same way. Finally, experiments validate the effectiveness and generalization ability of our MFQE approach in advancing the state-of-the-art quality enhancement of compressed video.
引用
收藏
页码:949 / 963
页数:15
相关论文
共 56 条
[1]  
[Anonymous], 2019, CISC VIS NETW IND GL, P1
[2]   Study of Temporal Effects on Subjective Video Quality of Experience [J].
Bampis, Christos George ;
Li, Zhi ;
Moorthy, Anush Krishna ;
Katsavounidis, Ioannis ;
Aaron, Anne ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) :5217-5231
[3]  
Bossen F., 2013, JCTVC-L1100, V12
[4]   Super resolution of video using key frames [J].
Brandi, Fernanda ;
de Queiroz, Ricardo ;
Mukherjee, Debargha .
PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, :1608-+
[5]   Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].
Caballero, Jose ;
Ledig, Christian ;
Aitken, Andrew ;
Acosta, Alejandro ;
Totz, Johannes ;
Wang, Zehan ;
Shi, Wenzhe .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857
[6]  
Cavigelli L, 2017, IEEE IJCNN, P752, DOI 10.1109/IJCNN.2017.7965927
[7]   Reducing Artifacts in JPEG Decompression Via a Learned Dictionary [J].
Chang, Huibin ;
Ng, Michael K. ;
Zeng, Tieyong .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (03) :718-728
[8]   A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].
Dai, Yuanying ;
Liu, Dong ;
Wu, Feng .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39
[9]  
Das Gupta M, 2005, PROC CVPR IEEE, P638
[10]  
De Vito F, 2005, 2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, P612