Information aggregation and fusion in deep neural networks for object interaction exploration for semantic segmentation

被引：11

作者：

Bai, Shuang ^{[1
]}

Wang, Congcong ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, 3 Shang Yuan Cun, Beijing, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2021年 / 218卷

关键词：

Semantic segmentation; Object interaction; Feature fusion; Logit aggregation; ATTENTION;

D O I：

10.1016/j.knosys.2021.106843

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To tackle the semantic segmentation task, which is a fundamental problem in computer vision, various approaches have been proposed. However, how to utilize object interaction information for improving semantic segmentation performances is not paid enough attention to. In this paper, we propose a method for information aggregation and fusion for exploring object interaction information effectively for improving semantic segmentation performances. Specifically, we propose a logit aggregation strategy to explore object interaction information for semantic segmentation. Furthermore, to facilitate object interaction to guide the training of the semantic segmentation model, we propose to fuse features from intermediate layers of the model to aid pixel semantic label predication. And to fuse these features effectively, a buffered layer connection approach is presented. The proposed method is evaluated extensively in experiments. Obtained results demonstrate the effectiveness of the proposed method. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：13

共 69 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[3] Convolutional Random Walk Networks for Semantic Image Segmentation [J].

Bertasius, Gedas ;

Torresani, Lorenzo ;

Yu, Stella X. ;

Shi, Jianbo .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6137-6145

[4] Dense Decoder Shortcut Connections for Single-Pass Semantic Segmentation [J].

Bilinski, Piotr ;

Prisacariu, Victor .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6596-6605

[5] Loss Max-Pooling for Semantic Image Segmentation [J].

Bulo, Samuel Rota ;

Neuhold, Gerhard ;

Kontschieder, Peter .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7082-7091

[6] Importance-Aware Semantic Segmentation for Autonomous Vehicles [J].

Chen, Bike ;

Gong, Chen ;

Yang, Jian .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (01) :137-148

[7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[8]

Chen Liang-Chieh, 2015, INT C LEARNING REPRE

[9] A Tree-Based Context Model for Object Recognition [J].

Choi, Myung Jin ;

Torralba, Antonio ;

Willsky, Alan S. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) :240-252

[10] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

← 1 2 3 4 5 6 7 →