Multi-level graph convolutional recurrent neural network for semantic image segmentation

被引:7
|
作者
Jiang, Dingchao [1 ]
Qu, Hua [1 ]
Zhao, Jihong [2 ]
Zhao, Jianlong [3 ]
Liang, Wei [4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[4] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
关键词
Deep learning; Semantic image segmentation; Graph convolutional recurrent neural network; Multi-level features;
D O I
10.1007/s11235-021-00769-y
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the advent of the Internet of Things (IoT) era, many devices have surfaced that capture and generate various visual data. To recognize and extract a meaningful pattern from these visual data, powerful methods are required for different IoT applications. Fortunately, deep convolutional neural networks (CNNs) significantly improve the performance of almost all tasks in computer vision, including semantic image segmentation. However, the feature extraction of CNNs may cause the loss of contextual and spatial information. Moreover, the standard convolutional and pooling layers adopted by most CNN architectures lead to a fixed receptive field, which makes it challenging to deal with multi-scale objects in the image. To remedy these issues of CNNs for semantic image segmentation, this paper proposes a multi-level graph convolutional recurrent neural network (MGCRNN) to combine CNNs and graph neural networks (GNNs) for fusing multi-level features. By applying graph convolutional recurrent neural network (GCRNN), the proposed model acquires a global view of the image and aggregates multi-level contextual and structural information. The experiments verify the ability of GCRNN to obtain a flexible receptive field and learn structure features without losing spatial information. Results of these experiments conducted on the Pascal VOC 2012 and Cityscapes datasets show that the proposed model outperforms baseline approaches and can be competitive with state-of-the-art methods
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [11] Study on semantic image segmentation based on convolutional neural network
    Li, Lin-Hui
    Qian, Bo
    Lian, Jing
    Zheng, Wei-Na
    Zhou, Ya-Fu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (06) : 3397 - 3404
  • [12] Automatic Retinal Blood Vessel Segmentation Based on Multi-level Convolutional Neural Network
    Guo, Jinnan
    Ren, Shiwei
    Shi, Yueting
    Wang, Haoyu
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [13] Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
    Ouyang, Song
    Li, Yansheng
    REMOTE SENSING, 2021, 13 (01) : 1 - 22
  • [14] MLSE-Net: Multi-level Semantic Enriched Network for Medical Image Segmentation
    Gai, Di
    Luo, Heng
    He, Jing
    Su, Pengxiang
    Huang, Zheng
    Zhang, Song
    Tu, Zhijun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (09): : 2458 - 2482
  • [15] Multi-query and multi-level enhanced network for semantic segmentation
    Xie, Bin
    Cao, Jiale
    Anwer, Rao Muhammad
    Xie, Jin
    Nie, Jing
    Yang, Aiping
    Pang, Yanwei
    PATTERN RECOGNITION, 2024, 156
  • [16] Multi-level adversarial network for domain adaptive semantic segmentation
    Huang, Jiaxing
    Guan, Dayan
    Xiao, Aoran
    Lu, Shijian
    PATTERN RECOGNITION, 2022, 123
  • [17] Hierarchical multi-level dynamic hyperparameter deformable image registration with convolutional neural network
    Zhu, Zhenyu
    Li, Qianqian
    Wei, Ying
    Song, Rui
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (17):
  • [18] Multi-level region-based Convolutional Neural Network for image emotion classification
    Rao, Tianrong
    Li, Xiaoxu
    Zhang, Haimin
    Xu, Min
    NEUROCOMPUTING, 2019, 333 : 429 - 439
  • [19] Multi-level and cross-domain search engine based on graph convolutional neural network
    Li, Qiang
    Zhuang, Li
    Wang, Qiulin
    Zhang, Xiaodong
    Chen, Jianghai
    INTERNATIONAL JOURNAL OF LOW-CARBON TECHNOLOGIES, 2024, 19 : 1215 - 1221
  • [20] MCNet: Multi-level Correction Network for thermal image semantic segmentation of nighttime driving scene
    Xiong, Haitao
    Cai, Wenjie
    Liu, Qiong
    INFRARED PHYSICS & TECHNOLOGY, 2021, 113