ScaleNet: Scale Invariant Network for Semantic Segmentation in Urban Driving Scenes

被引:1
作者
Ansari, Mohammad Dawud [1 ,2 ]
Krauss, Stephan [1 ]
Wasenmueller, Oliver [1 ]
Stricker, Didier [1 ,2 ]
机构
[1] German Res Ctr Artificial Intelligence DFKI, Kaiserslautern, Germany
[2] Univ Kaiserslautern, Kaiserslautern, Germany
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP | 2018年
关键词
Semantic Segmentation; Autonomous Driving; Labeling; Automotive; Scale;
D O I
10.5220/0006723003990404
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The scale difference in driving scenarios is one of the essential challenges in semantic scene segmentation. Close objects cover significantly more pixels than far objects. In this paper, we address this challenge with a scale invariant architecture. Within this architecture, we explicitly estimate the depth and adapt the pooling field size accordingly. Our model is compact and can be extended easily to other research domains. Finally, the accuracy of our approach is comparable to the state-of-the-art and superior for scale problems. We evaluate on the widely used automotive dataset Cityscapes as well as a self-recorded dataset.
引用
收藏
页码:399 / 404
页数:6
相关论文
共 25 条
  • [1] [Anonymous], 2016, COMPUTER VISION PATT
  • [2] [Anonymous], 1990, WAVELETS, DOI DOI 10.1007/978-3-642-75988-8_28
  • [3] [Anonymous], 2015, COMPUTER VISION PATT
  • [4] [Anonymous], 2016, AS C COMP VIS WORKSH
  • [5] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.348
  • [6] [Anonymous], 2016, RefineNet: Multi-path refinement networks for high-resolution semantic segmentation
  • [7] [Anonymous], 2016, ARXIV160600915
  • [8] [Anonymous], 2016, ARXIV161107709
  • [9] [Anonymous], 2000, A Multigrid Tutorial
  • [10] [Anonymous], 2016, ARXIV