Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment

被引：8

作者：

Furitsu, Yuki ^{[1
]}

Deguchi, Daisuke ^{[1
]}

Kawanishi, Yasutomo ^{[1
]}

Ide, Ichiro ^{[1
]}

Murase, Hiroshi ^{[1
]}

Mukojima, Hiroki ^{[2
]}

Nagamine, Nozomi ^{[2
]}

机构：

[1] Nagoya Univ, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan

[2] Railway Tech Res Inst, 2-8-38 Hikari Cho, Kokubunji, Tokyo 1858540, Japan

来源：

PATTERN RECOGNITION LETTERS | 2021年 / 150卷

关键词：

Semantic segmentation; Railway; Label relaxation;

D O I：

10.1016/j.patrec.2021.07.014

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we focus on the challenging task of the semantic segmentation of train front-view images. Managing trackside facilities can be done by using detailed and precise information about the surrounding railway environment. Semantic segmentation enables us to understand the 2D environment, but there is no adequate large-scale dataset available for training a CNN for this purpose. Some attempts have been made to generate pseudo-data from unlabeled sequential frames to compensate for the lack of volume in training data, but the moving speed of trains makes it difficult to apply them directly. We aim to solve this problem by proposing the Soft Boundary Label Relaxation (Soft-BLR) method, which considers label boundaries extending over multiple pixels to cope with more severely distorted pseudo-data and to better train the CNN in the initial training stage. Furthermore, we modify the loss function to penalize inference results based on the distance from the label boundary to solve the misalignment problems of border pixels. Through experimental evaluation, we report that the proposed method outperforms previous methods on not only the semantic segmentation of challenging railway images, but also that of general street-view images. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：258 / 264

页数：7

共 18 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Label Propagation in Video Sequences
Badrinarayanan, Vijay
Galasso, Fabio
Cipolla, Roberto
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3265 - 3272
[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[4] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[5] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6] The PASCAL Visual Object Classes Challenge: A Retrospective
Everingham, Mark
Eslami, S. M. Ali
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
[7] Furitsu Y, 2019, PATTERN RECOGN, P639
[8] Semantic Video CNNs through Representation Warping
Gadde, Raghudeep
Jampani, Varun
Gehler, Peter V.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4463 - 4472
[9] FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Ilg, Eddy
Mayer, Nikolaus
Saikia, Tonmoy
Keuper, Margret
Dosovitskiy, Alexey
Brox, Thomas
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1647 - 1655
[10] Bidirectional Learning for Domain Adaptation of Semantic Segmentation
Li, Yunsheng
Yuan, Lu
Vasconcelos, Nuno
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6929 - 6938

← 1 2 →