Category-Guided Graph Convolution Network for Semantic Segmentation

被引:4
作者
Xu, Zeyuan [1 ]
Yang, Zhe [2 ]
Wang, Danwei [3 ]
Wu, Zhe [1 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117585, Singapore
[2] Phigent Robot, Hangzhou 311121, Peoples R China
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2024年 / 11卷 / 06期
关键词
Convolution; Semantics; Semantic segmentation; Accuracy; Tensors; Context modeling; Task analysis; pixel; category; graph convolution network; strong (or weak) relationship; REPRESENTATION; IMAGE; CONTEXT;
D O I
10.1109/TNSE.2024.3448609
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Contextual information has been widely used to improve results of semantic segmentation. However, most approaches investigate contextual dependencies through self-attention and lack guidance on which pixels should have strong (or weak) relationships. In this paper, a category-guided graph convolution network (CGGCN) is proposed to reveal the relationships among pixels. First, we train a coarse segmentation map under the supervision of the ground truth and use it to construct an adjacency matrix among pixels. It turns out that the pixels belonging to the same category have strong connections, and those belonging to different categories have weak connections. Second, a GCN is exploited to enhance the representation of pixels by aggregating contextual information among pixels. The feature of each pixel is represented by node, and the relationship among pixels is denoted by edge. Subsequently, we design four different kinds of network structures by leveraging the CGGCN module and determine the most accurate segmentation result by comparing them. Finally, we reimplement the CGGCN module to refine the final prediction from coarse to fine. The results of extensive evaluations demonstrate that the proposed approach is superior to the existing semantic segmentation approaches and has better convergence.
引用
收藏
页码:6080 / 6089
页数:10
相关论文
共 60 条
[1]   Region-Based Semantic Segmentation with End-to-End Training [J].
Caesar, Holger ;
Uijlings, Jasper ;
Ferrari, Vittorio .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :381-397
[2]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen YP, 2018, ADV NEUR IN, V31
[5]  
Cheng B, 2021, ADV NEUR IN, V34
[6]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[7]   Interaction via Bi-directional Graph of Semantic Region Affinity for Scene Parsing [J].
Ding, Henghui ;
Zhang, Hui ;
Liu, Jun ;
Li, Jiaxin ;
Feng, Zijian ;
Jiang, Xudong .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15828-15838
[8]   Semantic Correlation Promoted Shape-Variant Context for Segmentation [J].
Ding, Henghui ;
Jiang, Xudong ;
Shuai, Bing ;
Liu, Ai Qun ;
Wang, Gang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8877-8886
[9]   Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation [J].
Ding, Henghui ;
Jiang, Xudong ;
Shuai, Bing ;
Liu, Ai Qun ;
Wang, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2393-2402
[10]   Learning to Refine Human Pose Estimation [J].
Fieraru, Mihai ;
Khoreva, Anna ;
Pishchulin, Leonid ;
Schiele, Bernt .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :318-327