MVTec D2S: Densely Segmented Supermarket Dataset

被引:43
作者
Follmann, Patrick [1 ,2 ]
Boettger, Tobias [1 ,2 ]
Haertinger, Philipp [1 ]
Koenig, Rebecca [1 ]
Ulrich, Markus [1 ]
机构
[1] MVTec Software GmbH, D-80634 Munich, Germany
[2] Tech Univ Munich, D-80333 Munich, Germany
来源
COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷
关键词
Instance segmentation dataset; Industrial application;
D O I
10.1007/978-3-030-01249-6_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the Densely Segmented Supermarket (D2S) dataset, a novel benchmark for instance-aware semantic segmentation in an industrial domain. It contains 21 000 high-resolution images with pixel-wise labels of all object instances. The objects comprise groceries and everyday products from 60 categories. The benchmark is designed such that it resembles the real-world setting of an automatic checkout, inventory, or warehouse system. The training images only contain objects of a single class on a homogeneous background, while the validation and test sets are much more complex and diverse. To further benchmark the robustness of instance segmentation methods, the scenes are acquired with different lightings, rotations, and backgrounds. We ensure that there are no ambiguities in the labels and that every instance is labeled comprehensively. The annotations are pixel-precise and allow using crops of single instances for articial data augmentation. The dataset covers several challenges highly relevant in the field, such as a limited amount of training data and a high diversity in the test and validation sets. The evaluation of state-of-the-art object detection and instance segmentation methods on D2S reveals significant room for improvement.
引用
收藏
页码:581 / 597
页数:17
相关论文
共 36 条
  • [1] Abu-El-Haija S., 2016, CORR
  • [2] [Anonymous], 2002, P AS C COMP VIS
  • [3] [Anonymous], 2016, CORR
  • [4] [Anonymous], 2016, CORR
  • [5] [Anonymous], 2016, CORR
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [8] A Rotationally-Invariant Convolution Module by Feature Map Back-Rotation
    Follmann, Patrick
    Boettger, Tobias
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 784 - 792
  • [9] Vision meets robotics: The KITTI dataset
    Geiger, A.
    Lenz, P.
    Stiller, C.
    Urtasun, R.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) : 1231 - 1237
  • [10] George M, 2014, LECT NOTES COMPUT SC, V8690, P440, DOI 10.1007/978-3-319-10605-2_29