Structural inference embedded adversarial networks for scene parsing

被引:0
作者
Wang, ZeYu [1 ]
Wu, YanXia [1 ]
Bu, ShuHui [2 ,3 ]
Hang, PengCheng [2 ]
Zhang, GuoYin [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Northwestern Polytech Univ, Sch Aeronaut, Xian, Shaanxi, Peoples R China
[3] Shaanxi Key Lab Integrated & Intelligent Nav, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
10.1371/journal.pone.0195114
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Explicit structural inference is one key point to improve the accuracy of scene parsing. Meanwhile, adversarial training method is able to reinforce spatial contiguity in output segmentations. To take both advantages of the structural learning and adversarial training simultaneously, we propose a novel deep learning network architecture called Structural Inference Embedded Adversarial Networks (SIEANs) for pixel-wise scene labeling. The generator of our SIEANs, a novel designed scene parsing network, makes full use of convolutional neural networks and long short-term memory networks to learn the global contextual information of objects in four different directions from RGB-(D) images, which is able to describe the (three-dimensional) spatial distributions of objects in a more comprehensive and accurate way. To further improve the performance, we explore the adversarial training method to optimize the generator along with a discriminator, which can not only detect and correct higher-order inconsistencies between the predicted segmentations and corresponding ground truths, but also exploit full advantages of the generator by fine-tuning its parameters so as to obtain higher consistencies. The experimental results demonstrate that our proposed SIEANs is able to achieve a better performance on PASCAL VOC 2012, SIFT FLOW, PASCAL Person-Part, Cityscapes, Stanford Background, NYUDv2, and SUNRGBD datasets compared to the most of state-of-the-art methods.
引用
收藏
页数:29
相关论文
共 50 条
[41]   Flexible Android Malware Detection Model based on Generative Adversarial Networks with Code Tensor [J].
Yang, Zhao ;
Deng, Fengyang ;
Han, Linxi .
2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, :19-28
[42]   DenseUnetGAN: A Hybrid Approach for Efficient Brain Tumor Classification Using Generative Adversarial Networks [J].
Kuppusamy, Radhakrishnan ;
Sundarabai, Leena Jasmine John .
TRAITEMENT DU SIGNAL, 2025, 42 (02) :751-760
[43]   Large patch convolutional neural networks for the scene classification of high spatial resolution imagery [J].
Zhong, Yanfei ;
Fe, Feng ;
Zhang, Liangpei .
JOURNAL OF APPLIED REMOTE SENSING, 2016, 10
[44]   Aerial Scene Classification via Multilevel Fusion Based on Deep Convolutional Neural Networks [J].
Yu, Yunlong ;
Liu, Fuxian .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (02) :287-291
[45]   Improving patch-based scene text script identification with ensembles of conjoined networks [J].
Gomez, Lluis ;
Nicolaou, Anguelos ;
Karatzas, Dimosthenis .
PATTERN RECOGNITION, 2017, 67 :85-96
[46]   Training Small Networks for Scene Classification of Remote Sensing Images via Knowledge Distillation [J].
Chen, Guanzhou ;
Zhang, Xiaodong ;
Tan, Xiaoliang ;
Cheng, Yufeng ;
Dai, Fan ;
Zhu, Kun ;
Gong, Yuanfu ;
Wang, Qing .
REMOTE SENSING, 2018, 10 (05)
[47]   Unsupervised Learning for Cell-Level Visual Representation in Histopathology Images With Generative Adversarial Networks [J].
Hu, Bo ;
Tang, Ye ;
Chang, Eric I-Chao ;
Fan, Yubo ;
Lai, Maode ;
Xu, Yan .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (03) :1316-1328
[48]   Spatial up-sampling of HRTF sets using generative adversarial networks: A pilot study [J].
Siripornpitak, Pongsakorn ;
Engel, Isaac ;
Squires, Isaac ;
Cooper, Samuel J. ;
Picinali, Lorenzo .
FRONTIERS IN SIGNAL PROCESSING, 2022, 2
[49]   An Approach of Transferring Pre-trained Deep Convolutional Neural Networks for Aerial Scene Classification [J].
Devi, Nilakshi ;
Borah, Bhogeswar .
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 :551-558
[50]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85