Density-Aware Haze Image Synthesis by Self-Supervised Content-Style Disentanglement

被引：16

作者：

Zhang, Chi ^{[1
]}

Lin, Zihang ^{[1
]}

Xu, Liheng ^{[1
]}

Li, Zongliang ^{[1
]}

Tang, Wei ^{[2
]}

Liu, Yuehu ^{[1
]}

Meng, Gaofeng ^{[3
,4
,5
]}

Wang, Le ^{[1
]}

Li, Li ^{[6
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Shaanxi, Peoples R China

[2] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA

[3] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China

[4] Chinese Acad Sci, Ctr Artificial Intelligence & Robot, HK Inst Sci & Innovat, Beijing 100190, Peoples R China

[5] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[6] Tsinghua Univ, Dept Automat, BNRist, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Image synthesis; Scattering; Generative adversarial networks; Atmospheric modeling; Training; Testing; Haze synthesis; unsupervised image-to-image translation; self-supervised disentanglement;

D O I：

10.1109/TCSVT.2021.3130158

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The key procedure of haze image synthesis with adversarial training lies in the disentanglement of the feature involved only in haze synthesis, i.e., the style feature, from the feature representing the invariant semantic content, i.e., the content feature. Previous methods introduced a binary classifier to constrain the domain membership from being distinguished through the learned content feature during the training stage, thereby the style information is separated from the content feature. However, we find that these methods cannot achieve complete content-style disentanglement. The entanglement of the flawed style feature with content information inevitably leads to the inferior rendering of haze images. To address this issue, we propose a self-supervised style regression model with stochastic linear interpolation that can suppress the content information in the style feature. Ablative experiments demonstrate the disentangling completeness and its superiority in density-aware haze image synthesis. Moreover, the synthesized haze data are applied to test the generalization ability of vehicle detectors. Further study on the relation between haze density and detection performance shows that haze has an obvious impact on the generalization ability of vehicle detectors and that the degree of performance degradation is linearly correlated to the haze density, which in turn validates the effectiveness of the proposed method.

引用

页码：4552 / 4572

页数：21

共 47 条

[1]

[Anonymous], 2018, CoRR, abs/1812.02230

[2]

Chen X, 2016, ADV NEUR IN, V29

[3] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].

Choi, Yunjey ;

Choi, Minje ;

Kim, Munyoung ;

Ha, Jung-Woo ;

Kim, Sunghun ;

Choo, Jaegul .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797

[4]

Denton E. L., 2017, ADV NEURAL INFORM PR, P4414

[5] Rain Streak Removal From Light Field Images [J].

Ding, Yuyang ;

Li, Mingyue ;

Yan, Tao ;

Zhang, Fan ;

Liu, Yuan ;

Lau, Rynson W. H. .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) :467-482

[6] Age Factor Removal Network Based on Transfer Learning and Adversarial Learning for Cross-Age Face Recognition [J].

Du, Lingshuang ;

Hu, Haifeng ;

Wu, Yongbo .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) :2830-2842

[7] Cycle-Dehaze: Enhanced CycleGAN for Single Image Dehazing [J].

Engin, Deniz ;

Genc, Anil ;

Ekenel, Hazim Kemal .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :938-946

[8] Occluded Face Recognition in the Wild by Identity-Diversity Inpainting [J].

Ge, Shiming ;

Li, Chenyu ;

Zhao, Shengwei ;

Zeng, Dan .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) :3387-3397

[9] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

[10]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

← 1 2 3 4 5 →