MVTec D2S: Densely Segmented Supermarket Dataset

被引：46

作者：

Follmann, Patrick ^{[1
,2
]}

Boettger, Tobias ^{[1
,2
]}

Haertinger, Philipp ^{[1
]}

Koenig, Rebecca ^{[1
]}

Ulrich, Markus ^{[1
]}

机构：

[1] MVTec Software GmbH, D-80634 Munich, Germany

[2] Tech Univ Munich, D-80333 Munich, Germany

来源：

COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷

关键词：

Instance segmentation dataset; Industrial application;

D O I：

10.1007/978-3-030-01249-6_35

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the Densely Segmented Supermarket (D2S) dataset, a novel benchmark for instance-aware semantic segmentation in an industrial domain. It contains 21 000 high-resolution images with pixel-wise labels of all object instances. The objects comprise groceries and everyday products from 60 categories. The benchmark is designed such that it resembles the real-world setting of an automatic checkout, inventory, or warehouse system. The training images only contain objects of a single class on a homogeneous background, while the validation and test sets are much more complex and diverse. To further benchmark the robustness of instance segmentation methods, the scenes are acquired with different lightings, rotations, and backgrounds. We ensure that there are no ambiguities in the labels and that every instance is labeled comprehensively. The annotations are pixel-precise and allow using crops of single instances for articial data augmentation. The dataset covers several challenges highly relevant in the field, such as a limited amount of training data and a high diversity in the test and validation sets. The evaluation of state-of-the-art object detection and instance segmentation methods on D2S reveals significant room for improvement.

引用

页码：581 / 597

页数：17

共 36 条

[1]

Abu-El-Haija S., 2016, CORR

[2]

[Anonymous], 2002, P AS C COMP VIS

[3]

[Anonymous], 2016, CORR

[4]

[Anonymous], 2016, CORR

[5]

[Anonymous], 2016, CORR

[6] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[7] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

[8] A Rotationally-Invariant Convolution Module by Feature Map Back-Rotation [J].

Follmann, Patrick ;

Boettger, Tobias .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :784-792

[9] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

[10]

George M, 2014, LECT NOTES COMPUT SC, V8690, P440, DOI 10.1007/978-3-319-10605-2_29

← 1 2 3 4 →