Graph-FCN for Image Semantic Segmentation

被引:122
作者
Lu, Yi [1 ,2 ]
Chen, Yaran [1 ,2 ]
Zhao, Dongbin [1 ,2 ]
Chen, Jianxin [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[3] Beijing Univ Chinese Med, Beijing 100029, Peoples R China
来源
ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I | 2019年 / 11554卷
关键词
Graph neural network; Graph convolutional network; Semantic segmentation;
D O I
10.1007/978-3-030-22796-8_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation with deep learning has achieved great progress in classifying the pixels in the image. However, the local location information is usually ignored in the high-level feature extraction by the deep learning, which is important for image semantic segmentation. To avoid this problem, we propose a graph model initialized by a fully convolutional network (FCN) named Graph-FCN for image semantic segmentation. Firstly, the image grid data is extended to graph structure data by a convolutional network, which transforms the semantic segmentation problem into a graph node classification problem. Then we apply graph convolutional network to solve this graph node classification problem. As far as we know, it is the first time that we apply the graph convolutional network in image semantic segmentation. Our method achieves competitive performance in mean intersection over union (mIOU) on the VOC dataset (about 1.34% improvement), compared to the original FCN model.
引用
收藏
页码:97 / 105
页数:9
相关论文
共 20 条
[1]  
[Anonymous], 2015, INT C LEARN REPR SAN
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]  
Chen L.-C., 2017, C COMP VIS PATT REC
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   Multi-task learning for dangerous object detection in autonomous driving [J].
Chen, Yaran ;
Zhao, Dongbin ;
Lv, Le ;
Zhang, Qichao .
INFORMATION SCIENCES, 2018, 432 :559-571
[6]   Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection [J].
Chen, Yaran ;
Zhao, Dongbin .
ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 :28-35
[7]  
Chen YR, 2016, PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), P764, DOI 10.1109/WCICA.2016.7578651
[8]  
Defferrard M., 2016, P ADV NEUR INF PROC, P3844
[9]  
Gilmer J, 2017, PR MACH LEARN RES, V70
[10]  
Gori M, 2005, IEEE IJCNN, P729