Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment

被引:8
作者
Furitsu, Yuki [1 ]
Deguchi, Daisuke [1 ]
Kawanishi, Yasutomo [1 ]
Ide, Ichiro [1 ]
Murase, Hiroshi [1 ]
Mukojima, Hiroki [2 ]
Nagamine, Nozomi [2 ]
机构
[1] Nagoya Univ, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[2] Railway Tech Res Inst, 2-8-38 Hikari Cho, Kokubunji, Tokyo 1858540, Japan
关键词
Semantic segmentation; Railway; Label relaxation;
D O I
10.1016/j.patrec.2021.07.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the challenging task of the semantic segmentation of train front-view images. Managing trackside facilities can be done by using detailed and precise information about the surrounding railway environment. Semantic segmentation enables us to understand the 2D environment, but there is no adequate large-scale dataset available for training a CNN for this purpose. Some attempts have been made to generate pseudo-data from unlabeled sequential frames to compensate for the lack of volume in training data, but the moving speed of trains makes it difficult to apply them directly. We aim to solve this problem by proposing the Soft Boundary Label Relaxation (Soft-BLR) method, which considers label boundaries extending over multiple pixels to cope with more severely distorted pseudo-data and to better train the CNN in the initial training stage. Furthermore, we modify the loss function to penalize inference results based on the distance from the label boundary to solve the misalignment problems of border pixels. Through experimental evaluation, we report that the proposed method outperforms previous methods on not only the semantic segmentation of challenging railway images, but also that of general street-view images. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:258 / 264
页数:7
相关论文
共 18 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Label Propagation in Video Sequences
    Badrinarayanan, Vijay
    Galasso, Fabio
    Cipolla, Roberto
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3265 - 3272
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [5] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [6] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [7] Furitsu Y, 2019, PATTERN RECOGN, P639
  • [8] Semantic Video CNNs through Representation Warping
    Gadde, Raghudeep
    Jampani, Varun
    Gehler, Peter V.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4463 - 4472
  • [9] FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
    Ilg, Eddy
    Mayer, Nikolaus
    Saikia, Tonmoy
    Keuper, Margret
    Dosovitskiy, Alexey
    Brox, Thomas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1647 - 1655
  • [10] Bidirectional Learning for Domain Adaptation of Semantic Segmentation
    Li, Yunsheng
    Yuan, Lu
    Vasconcelos, Nuno
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6929 - 6938