DeepFlux for Skeletons in the Wild

被引：34

作者：

Wang, Yukang ^{[1
]}

Xu, Yongchao ^{[1
]}

Tsogkas, Stavros ^{[2
,3
,4
]}

Bai, Xiang ^{[1
]}

Dickinson, Sven ^{[2
,3
,4
]}

Siddiqi, Kaleem ^{[5
,6
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China

[2] Univ Toronto, Toronto, ON, Canada

[3] Vector Inst Artificial Intelligence, Toronto, ON, Canada

[4] Samsung Toronto AI Res Ctr, Toronto, ON, Canada

[5] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2T5, Canada

[6] McGill Univ, Ctr Intelligent Machines, Montreal, PQ H3A 2T5, Canada

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

EDGE-DETECTION; RECOGNITION; SEGMENTATION; SHAPE;

D O I：

10.1109/CVPR.2019.00543

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computing object skeletons in natural images is challenging, owing to large variations in object appearance and scale, and the complexity of handling background clutter. Many recent methods frame object skeleton detection as a binary pixel classification problem, which is similar in spirit to learning-based edge detection, as well as to semantic segmentation methods. In the present article, we depart from this strategy by training a CNN to predict a two-dimensional vector field, which maps each scene point to a candidate skeleton pixel, in the spirit of flux-based skeletonization algorithms. This "image context flux'' representation has two major advantages over previous approaches. First, it explicitly encodes the relative position of skeletal pixels to semantically meaningful entities, such as the image points in their spatial context, and hence also the implied object boundaries. Second, since the skeleton detection context is a region-based vector field, it is better able to cope with object parts of large width. We evaluate the proposed method on three benchmark datasets for skeleton detection and two for symmetry detection, achieving consistently superior performance over state-of-the-art methods.

引用

页码：5282 / 5291

页数：10

共 54 条

[1]

[Anonymous], 2011, CVPR, DOI DOI 10.1109/CVPR.2011.5995316

[2]

[Anonymous], P ICLR

[3] Deep Watershed Transform for Instance Segmentation [J].

Bai, Min ;

Urtasun, Raquel .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2858-2866

[4] Active Skeleton for Non-rigid Object Detection [J].

Bai, Xiang ;

Wang, Xinggang ;

Latecki, Longin Jan ;

Liu, Wenyu ;

Tu, Zhuowen .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :575-582

[5] BIOLOGICAL SHAPE AND VISUAL SCIENCE .1. [J].

BLUM, H .

JOURNAL OF THEORETICAL BIOLOGY, 1973, 38 (02) :205-287

[6]

Borenstein E, 2002, LECT NOTES COMPUT SC, V2351, P109

[7] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [J].

Chen, Liang-Chieh ;

Hermans, Alexander ;

Papandreou, George ;

Schroff, Florian ;

Wang, Peng ;

Adam, Hartwig .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4013-4022

[8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[9]

Chen Xinlei, 2015, CoRR

[10] Video Object Segmentation by Learning Location-Sensitive Embeddings [J].

Ci, Hai ;

Wang, Chunyu ;

Wang, Yizhou .

COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :524-539

← 1 2 3 4 5 6 →