A comparison of shape-based matching with deep-learning-based object detection

被引：12

作者：

Ulrich, Markus ^{[1
]}

Follmann, Patrick ^{[1
,2
]}

Neudeck, Jan-Hendrik ^{[2
]}

机构：

[1] MVTec Software GmbH, Munich, Germany

[2] Tech Univ Munich, Munich, Germany

来源：

TM-TECHNISCHES MESSEN | 2019年 / 86卷 / 11期

关键词：

Matching; object detection; object recognition; weak supervision; deep learning;

D O I：

10.1515/teme-2019-0076

中图分类号：

TH7 [仪器、仪表];

学科分类号：

0804 ; 080401 ; 081102 ;

摘要：

Matching, i.e. determining the exact 2D pose (e.g., position and orientation) of objects, is still one of the key tasks in machine vision applications like robot navigation, measuring, or grasping an object. There are many classic approaches for matching, based on edges or on the pure gray values of the template. In recent years, deep learning has been utilized mainly for more difficult tasks where the objects of interest are from many different categories with high intra-class variations and classic algorithms are failing. In this work, we compare one of the latest deep-learning-based object detectors with classic shape-based matching. We evaluate the methods both on a matching dataset as well as an object detection dataset that contains rigid objects and is thus also suitable for shape-based matching. We show that for datasets of this type, where rigid objects appear with rigid transformations, shape-based matching still outperforms recent object detectors regarding runtime, robustness, and precision if only a single template image per object is used. On the other hand, we show that for the application of object detection, the deep-learning-based approach outperforms the classic approach if annotated data is used for training. Ultimately, the choice of the best suited approach depends on the conditions and requirements of the application.

引用

页码：685 / 698

页数：14

共 26 条

[1]

[Anonymous], 2002, International Archives of Photogrammetry Remote Sensing and Spatial Information Sciences

[2] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[3]

Follmann Patrick, 2019, Pattern Recognition. 40th German Conference, GCPR 2018. Proceedings: Lecture Notes in Computer Science (LNCS 11269), P363, DOI 10.1007/978-3-030-12939-2_25

[4] Detecting Animals in Infrared Images from Camera-Traps [J].

Follmann P. ;

Radig B. .

Pattern Recognition and Image Analysis, 2018, 28 (4) :605-611

[5] MVTec D2S: Densely Segmented Supermarket Dataset [J].

Follmann, Patrick ;

Boettger, Tobias ;

Haertinger, Philipp ;

Koenig, Rebecca ;

Ulrich, Markus .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :581-597

[6]

Girshick R., 2014, IEEE COMP SOC C COMP, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]

[7]

He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]

[8]

Iandola FN, 2016, ARXIV160207360

[9]

Joseph RK, 2016, CRIT POL ECON S ASIA, P1

[10] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

← 1 2 3 →