Structural inference embedded adversarial networks for scene parsing

被引:0
作者
Wang, ZeYu [1 ]
Wu, YanXia [1 ]
Bu, ShuHui [2 ,3 ]
Hang, PengCheng [2 ]
Zhang, GuoYin [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Northwestern Polytech Univ, Sch Aeronaut, Xian, Shaanxi, Peoples R China
[3] Shaanxi Key Lab Integrated & Intelligent Nav, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
10.1371/journal.pone.0195114
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Explicit structural inference is one key point to improve the accuracy of scene parsing. Meanwhile, adversarial training method is able to reinforce spatial contiguity in output segmentations. To take both advantages of the structural learning and adversarial training simultaneously, we propose a novel deep learning network architecture called Structural Inference Embedded Adversarial Networks (SIEANs) for pixel-wise scene labeling. The generator of our SIEANs, a novel designed scene parsing network, makes full use of convolutional neural networks and long short-term memory networks to learn the global contextual information of objects in four different directions from RGB-(D) images, which is able to describe the (three-dimensional) spatial distributions of objects in a more comprehensive and accurate way. To further improve the performance, we explore the adversarial training method to optimize the generator along with a discriminator, which can not only detect and correct higher-order inconsistencies between the predicted segmentations and corresponding ground truths, but also exploit full advantages of the generator by fine-tuning its parameters so as to obtain higher consistencies. The experimental results demonstrate that our proposed SIEANs is able to achieve a better performance on PASCAL VOC 2012, SIFT FLOW, PASCAL Person-Part, Cityscapes, Stanford Background, NYUDv2, and SUNRGBD datasets compared to the most of state-of-the-art methods.
引用
收藏
页数:29
相关论文
共 50 条
[31]   Learning spatial relations and shapes for structural object description and scene recognition [J].
Clement, Michael ;
Kurtz, Camille ;
Wendling, Laurent .
PATTERN RECOGNITION, 2018, 84 :197-210
[32]   RGB-D Scene Labeling with Multimodal Recurrent Neural Networks [J].
Fan, Heng ;
Mei, Xue ;
Prokhorov, Danil ;
Ling, Haibin .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :203-211
[33]   Scene Classification Using Unsupervised Neural Networks for Mobile Robot Vision [J].
Madokoro, Hirokazu ;
Utsumi, Yuya ;
Sato, Kazuhito .
2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, :1568-1573
[34]   GraphBind: protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues [J].
Xia, Ying ;
Xia, Chun-Qiu ;
Pan, Xiaoyong ;
Shen, Hong-Bin .
NUCLEIC ACIDS RESEARCH, 2021, 49 (09) :E51
[35]   Integrated inference and learning of neural factors in structural support vector machines [J].
Houthooft, Rein ;
De Turck, Filip .
PATTERN RECOGNITION, 2016, 59 :292-301
[36]   Deep Multiple Instance Convolutional Neural Networks for Learning Robust Scene Representations [J].
Li, Zhili ;
Xu, Kai ;
Xie, Jiafen ;
Bi, Qi ;
Qin, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05) :3685-3702
[37]   Towards better exploiting convolutional neural networks for remote sensing scene classification [J].
Nogueira, Keiller ;
Penatti, Otavio A. B. ;
dos Santos, Jefersson A. .
PATTERN RECOGNITION, 2017, 61 :539-556
[38]   A literature review on remote sensing scene categorization based on convolutional neural networks [J].
Kaul, Ajay ;
Kumari, Monika .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (08) :2611-2642
[39]   FUSING TWO CONVOLUTIONAL NEURAL NETWORKS FOR HIGH-RESOLUTION SCENE CLASSIFICATION [J].
Bian, Xiaoyong ;
Chen, Chen ;
Sheng, Yuxia ;
Xu, Yan ;
Du, Qian .
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, :3242-3245
[40]   Sample generation based on a supervised Wasserstein Generative Adversarial Network for high-resolution remote-sensing scene classification [J].
Han, Wei ;
Wang, Lizhe ;
Feng, Ruyi ;
Gao, Lang ;
Chen, Xiaodao ;
Deng, Ze ;
Chen, Jia ;
Liu, Peng .
INFORMATION SCIENCES, 2020, 539 :177-194