Lookahead adversarial learning for near real-time semantic segmentation

被引:4
作者
Jamali-Rad, Hadi [1 ,2 ]
Szabo, Attila [1 ,3 ]
机构
[1] Shell Technol Ctr Amsterdam STCA, Grasweg 31, NL-1031 HW Amsterdam, Netherlands
[2] Delft Univ Technol, TU Delft, NL-2628 XE Delft, Netherlands
[3] Univ Amsterdam, NL-1012 WX Amsterdam, Netherlands
关键词
Semantic segmentation; Conditional adversarial training; Computer vision; Deep learning;
D O I
10.1016/j.cviu.2021.103271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is one of the most fundamental problems in computer vision with significant impact on a wide variety of applications. Adversarial learning is shown to be an effective approach for improving semantic segmentation quality by enforcing higher-level pixel correlations and structural information. However, state-of-the-art semantic segmentation models cannot be easily plugged into an adversarial setting because they are not designed to accommodate convergence and stability issues in adversarial networks. We bridge this gap by building a conditional adversarial network with a state-of-the-art segmentation model (DeepLabv3+) at its core. To battle the stability issues, we introduce a novel lookahead adversarial learning (LoAd) approach with an embedded label map aggregation module. We focus on semantic segmentation models that run fast at inference for near real-time field applications. Through extensive experimentation, we demonstrate that the proposed solution can alleviate divergence issues in an adversarial semantic segmentation setting and results in considerable performance improvements (+5% in some classes) on the baseline for three standard datasets.
引用
收藏
页数:17
相关论文
共 54 条
[1]  
[Anonymous], 2019, IEEE T MED IMAGING, DOI DOI 10.1109/TMI.2018.2867261
[2]  
[Anonymous], 2018, P EUROPEAN C COMPUTE
[3]  
[Anonymous], 2014, 27THINT C NEURAL INF
[4]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[5]   Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-Ray Segmentation [J].
Chen, Cheng ;
Dou, Qi ;
Chen, Hao ;
Heng, Pheng-Ann .
MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 :143-151
[6]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   SEMEDA: Enhancing segmentation precision with semantic edge aware loss [J].
Chen, Yifu ;
Dapogny, Arnaud ;
Cord, Matthieu .
PATTERN RECOGNITION, 2020, 108
[9]   Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation [J].
Cheng, Bowen ;
Collins, Maxwell D. ;
Zhu, Yukun ;
Liu, Ting ;
Huang, Thomas S. ;
Adam, Hartwig ;
Chen, Liang-Chieh .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12472-12482
[10]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807