Deep Video Dehazing With Semantic Segmentation

被引:168
作者
Ren, Wenqi [1 ]
Zhang, Jingang [2 ]
Xu, Xiangyu [3 ]
Ma, Lin [5 ]
Cao, Xiaochun [1 ]
Meng, Gaofeng [4 ]
Liu, Wei [5 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Med Sch, Beijing 100080, Peoples R China
[3] SenseTime Res, Beijing 100084, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[5] Tencent AI Lab, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Video dehazing; defogging; transmission map; convolutional neural network; IMAGE; FUSION;
D O I
10.1109/TIP.2018.2876178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research have shown the potential of using convolutional neural networks (CNNs) to accomplish single image dehazing. In this paper, we take one step further to explore the possibility of exploiting a network to perform haze removal for videos. Unlike single image dehazing, video-based approaches can take advantage of the abundant information that exists across neighboring frames. In this paper, assuming that a scene point yields highly correlated transmission values between adjacent video frames, we develop a deep learning solution for video dehazing, where a CNN is trained end-to-end to learn how to accumulate information across frames for transmission estimation. The estimated transmission map is subsequently used to recover a haze-free frame via atmospheric scattering model. In addition, as the semantic information of a scene provides a strong prior for image restoration, we propose to incorporate global semantic priors as input to regularize the transmission maps so that the estimated maps can be smooth in the regions of the same object and only discontinuous across the boundaries of different objects. To train this network, we generate a dataset consisted of synthetic hazy and haze-free videos for supervision based on the NYU depth dataset. We show that the features learned from this dataset are capable of removing haze that arises in outdoor scenes in a wide range of videos. Extensive experiments demonstrate that the proposed algorithm performs favorably against the state-of-the-art methods on both synthetic and real-world videos.
引用
收藏
页码:1895 / 1908
页数:14
相关论文
共 61 条
[11]   DehazeNet: An End-to-End System for Single Image Haze Removal [J].
Cai, Bolun ;
Xu, Xiangmin ;
Jia, Kui ;
Qing, Chunmei ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) :5187-5198
[12]   Scene Text Deblurring Using Text-Specific Multiscale Dictionaries [J].
Cao, Xiaochun ;
Ren, Wenqi ;
Zuo, Wangmeng ;
Guo, Xiaojie ;
Foroosh, Hassan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (04) :1302-1314
[13]  
Caraffa L, 2013, IEEE INT VEH SYM, P994, DOI 10.1109/IVS.2013.6629596
[14]   Robust Image and Video Dehazing with Visual Artifact Suppression via Gradient Residual Minimization [J].
Chen, Chen ;
Do, Minh N. ;
Wang, Jue .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :576-591
[15]  
Chen JY, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P162
[16]  
Cheng Ziang, 2018, SEMANTIC SINGLE IMAG
[17]   Referenceless Prediction of Perceptual Fog Density and Perceptual Image Defogging [J].
Choi, Lark Kwon ;
You, Jaehee ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) :3888-3901
[18]   Hand-Held Video Deblurring Via Efficient Fourier Aggregation [J].
Delbracio, Mauricio ;
Sapiro, Guillermo .
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2015, 1 (04) :270-283
[19]   Two-Layer Gaussian Process Regression With Example Selection for Image Dehazing [J].
Fan, Xin ;
Wang, Yi ;
Tang, Xianxuan ;
Gao, Renjie ;
Luo, Zhongxuan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) :2505-2517
[20]   Dehazing Using Color-Lines [J].
Fattal, Raanan .
ACM TRANSACTIONS ON GRAPHICS, 2014, 34 (01)