Structural inference embedded adversarial networks for scene parsing

被引:0
作者
Wang, ZeYu [1 ]
Wu, YanXia [1 ]
Bu, ShuHui [2 ,3 ]
Hang, PengCheng [2 ]
Zhang, GuoYin [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Northwestern Polytech Univ, Sch Aeronaut, Xian, Shaanxi, Peoples R China
[3] Shaanxi Key Lab Integrated & Intelligent Nav, Xian, Shaanxi, Peoples R China
来源
PLOS ONE | 2018年 / 13卷 / 04期
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
10.1371/journal.pone.0195114
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Explicit structural inference is one key point to improve the accuracy of scene parsing. Meanwhile, adversarial training method is able to reinforce spatial contiguity in output segmentations. To take both advantages of the structural learning and adversarial training simultaneously, we propose a novel deep learning network architecture called Structural Inference Embedded Adversarial Networks (SIEANs) for pixel-wise scene labeling. The generator of our SIEANs, a novel designed scene parsing network, makes full use of convolutional neural networks and long short-term memory networks to learn the global contextual information of objects in four different directions from RGB-(D) images, which is able to describe the (three-dimensional) spatial distributions of objects in a more comprehensive and accurate way. To further improve the performance, we explore the adversarial training method to optimize the generator along with a discriminator, which can not only detect and correct higher-order inconsistencies between the predicted segmentations and corresponding ground truths, but also exploit full advantages of the generator by fine-tuning its parameters so as to obtain higher consistencies. The experimental results demonstrate that our proposed SIEANs is able to achieve a better performance on PASCAL VOC 2012, SIFT FLOW, PASCAL Person-Part, Cityscapes, Stanford Background, NYUDv2, and SUNRGBD datasets compared to the most of state-of-the-art methods.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Scene Classification Based on Knowledge Sharing and Latent Structural Constraints
    Fan, Yuhua
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1356 - 1360
  • [22] Scene-Aware Deep Networks for Semantic Segmantation of Images
    Yi, Zhike
    Chang, Tao
    Li, Shuai
    Liu, Ruijun
    Zhang, Jing
    Hao, Aimin
    IEEE ACCESS, 2019, 7 : 69184 - 69193
  • [23] Scene Text Script Identification with Convolutional Recurrent Neural Networks
    Mei, Jieru
    Dai, Luo
    Shi, Baoguang
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4053 - 4058
  • [24] Multitask Adversarial Networks Based on Extensive Nonlinear Spiking Neuron Models
    Fu, Jun
    Peng, Hong
    Li, Bing
    Liu, Zhicai
    Lugu, Rikong
    Wang, Jun
    Ramirez-de-Arellano, Antonio
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (06)
  • [25] Road Detection From Remote Sensing Images by Generative Adversarial Networks
    Shi, Qian
    Liu, Xiaoping
    Li, Xia
    IEEE ACCESS, 2018, 6 : 25486 - 25494
  • [26] Classification of Optical Coherence Tomography Images Using Generative Adversarial Networks
    Aghaei, S. M. H. Seyed
    Rashno, A.
    Fadaei, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2025, 38 (02): : 389 - 399
  • [27] A Multifaceted Deep Generative Adversarial Networks Model for Mobile Malware Detection
    Alotaibi, Fahad Mazaed
    Fawad
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [28] GAN-GL: Generative Adversarial Networks for Glacial Lake Mapping
    Zhao, Hang
    Zhang, Meimei
    Chen, Fang
    REMOTE SENSING, 2021, 13 (22)
  • [29] Learning spatial relations and shapes for structural object description and scene recognition
    Clement, Michael
    Kurtz, Camille
    Wendling, Laurent
    PATTERN RECOGNITION, 2018, 84 : 197 - 210
  • [30] RGB-D Scene Labeling with Multimodal Recurrent Neural Networks
    Fan, Heng
    Mei, Xue
    Prokhorov, Danil
    Ling, Haibin
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 203 - 211