Deep Video Dehazing With Semantic Segmentation

被引：168

作者：

Ren, Wenqi ^{[1
]}

Zhang, Jingang ^{[2
]}

Xu, Xiangyu ^{[3
]}

Ma, Lin ^{[5
]}

Cao, Xiaochun ^{[1
]}

Meng, Gaofeng ^{[4
]}

Liu, Wei ^{[5
]}

机构：

[1] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100093, Peoples R China

[2] Univ Chinese Acad Sci, Med Sch, Beijing 100080, Peoples R China

[3] SenseTime Res, Beijing 100084, Peoples R China

[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

[5] Tencent AI Lab, Shenzhen 518057, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2019年 / 28卷 / 04期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Video dehazing; defogging; transmission map; convolutional neural network; IMAGE; FUSION;

D O I：

10.1109/TIP.2018.2876178

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent research have shown the potential of using convolutional neural networks (CNNs) to accomplish single image dehazing. In this paper, we take one step further to explore the possibility of exploiting a network to perform haze removal for videos. Unlike single image dehazing, video-based approaches can take advantage of the abundant information that exists across neighboring frames. In this paper, assuming that a scene point yields highly correlated transmission values between adjacent video frames, we develop a deep learning solution for video dehazing, where a CNN is trained end-to-end to learn how to accumulate information across frames for transmission estimation. The estimated transmission map is subsequently used to recover a haze-free frame via atmospheric scattering model. In addition, as the semantic information of a scene provides a strong prior for image restoration, we propose to incorporate global semantic priors as input to regularize the transmission maps so that the estimated maps can be smooth in the regions of the same object and only discontinuous across the boundaries of different objects. To train this network, we generate a dataset consisted of synthetic hazy and haze-free videos for supervision based on the NYU depth dataset. We show that the features learned from this dataset are capable of removing haze that arises in outdoor scenes in a wide range of videos. Extensive experiments demonstrate that the proposed algorithm performs favorably against the state-of-the-art methods on both synthetic and real-world videos.

引用

页码：1895 / 1908

页数：14

共 61 条

[1] Single Image Dehazing by Multi-Scale Fusion [J].

Ancuti, Codruta Orniana ;

Ancuti, Cosmin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) :3271-3282

[2] Effective Single Image Dehazing by Fusion [J].

Ancuti, Codruta Orniana ;

Ancuti, Cosmin ;

Bekaert, Philippe .

2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, :3541-3544

[3]

[Anonymous], 2014, ADV NEURAL INFORM PR

[4]

[Anonymous], 2017, ADV MULTIMEDIA INFOR

[5]

[Anonymous], 2018, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-030-01261-8_42

[6]

[Anonymous], IEEE T NEUR IN PRESS

[7]

[Anonymous], 2017, Benchmarking Single Image Dehazing and beyond

[8]

[Anonymous], 2017, IMAGE DERAINING USIN

[9] Non-Local Image Dehazing [J].

Berman, Dana ;

Treibitz, Tali ;

Avidan, Shai .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1674-1682

[10] Single Image Dehazing Using Color Ellipsoid Prior [J].

Bui, Trung Minh ;

Kim, Wonha .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) :999-1009

← 1 2 3 4 5 6 7 →