High-quality Frame Recurrent Video De-raining with Multi-contextual Adversarial Network

被引:1
作者
Sharma, Prasen Kumar [1 ]
Ghosh, Sujoy [1 ]
Sur, Arijit [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Comp Sci & Eng, Multimedia Lab, Bongara 781039, Assam, India
关键词
Video de-raining; deep learning; generative adversarial network; STRUCTURAL SIMILARITY; IMAGE; REMOVAL;
D O I
10.1145/3444974
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we address the problem of rain-streak removal in the videos. Unlike the image, challenges in video restoration comprise temporal consistency besides spatial enhancement. The researchers across the world have proposed several effective methods for estimating the de-noised videos with outstanding temporal consistency. However, such methods also amplify the computational cost due to their larger size. By way of analysis, incorporating separate modules for spatial and temporal enhancement may require more computational resources. It motivates us to propose a unified architecture that directly estimates the de-rained frame with maximal visual quality and minimal computational cost. To this end, we present a deep learning-based Frame-recurrent Multi-contextual Adversarial Network for rain-streak removal in videos. The proposed model is built upon a Conditional Generative Adversarial Network (CGAN)-based framework where the generator model directly estimates the de-rained frame from the previously estimated one with the help of its multicontextual adversary. To optimize the proposed model, we have incorporated the Perceptual loss function in addition to the conventional Euclidean distance. Also, instead of traditional entropy loss from the adversary, we propose to use the Euclidean distance between the features of de-rained and clean frames, extracted from the discriminator model as a cost function for video de-raining. Various experimental observations across 11 test sets, with over 10 state-of-the-art methods, using 14 image-quality metrics, prove the efficacy of the proposed work, both visually and computationally.
引用
收藏
页数:24
相关论文
共 47 条
[1]  
[Anonymous], 2015, P INT C LEARN REPR
[2]  
[Anonymous], ACM T MULTIMEDIA COM, V17
[3]  
[Anonymous], 2017, NIPS 2017 WORKSH AUT
[4]   Analysis of Rain and Snow in Frequency Space [J].
Barnum, Peter C. ;
Narasimhan, Srinivasa ;
Kanade, Takeo .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 86 (2-3) :256-274
[5]   Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework [J].
Chen, Jie ;
Tan, Cheen-Hau ;
Hou, Junhui ;
Chau, Lap-Pui ;
Li, He .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6286-6295
[6]   A Rain Pixel Recovery Algorithm for Videos With Highly Dynamic Scenes [J].
Chen, Jie ;
Chau, Lap-Pui .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (03) :1097-1104
[7]   Image Denoising via CNNs: An Adversarial Approach [J].
Divakar, Nithish ;
Babu, R. Venkatesh .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1076-1083
[8]   Removing rain from single images via a deep detail network [J].
Fu, Xueyang ;
Huang, Jiabin ;
Zeng, Delu ;
Huang, Yue ;
Ding, Xinghao ;
Paisley, John .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1715-1723
[9]  
Garg K, 2005, IEEE I CONF COMP VIS, P1067
[10]  
Garg K, 2004, PROC CVPR IEEE, P528