Deep Interactive Thin Object Selection

被引：21

作者：

Liew, Jun Hao ^{[1
]}

Cohen, Scott ^{[2
]}

Price, Brian ^{[2
]}

Mai, Long ^{[2
]}

Feng, Jiashi ^{[1
]}

机构：

[1] Natl Univ Singapore, Singapore, Singapore

[2] Adobe Res, San Jose, CA USA

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) | 2021年

关键词：

SEGMENTATION; CUT;

D O I：

10.1109/WACV48630.2021.00035

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing deep learning based interactive segmentation methods have achieved remarkable performance with only a few user clicks, e.g. DEXTR [32] attaining 91.5% IoU on PASCAL VOC with only four extreme clicks. However, we observe even the state-of-the-art methods would often struggle in cases of objects to be segmented with elongated thin structures (e.g. bug legs and bicycle spokes). We investigate such failures, and find the critical reasons behind are two-fold: 1) lack of appropriate training dataset; and 2) extremely imbalanced distribution w.rt. number of pixels belonging to thin and non-thin regions. Targeted at these challenges, we collect a large-scale dataset specifically for segmentation of thin elongated objects, named ThinObject-5K. Also, we present a novel integrative thin object segmentation network consisting of three streams. Among them, the high-resolution edge stream aims at preserving fine-grained details including elongated thin parts; the fixed-resolution context stream focuses on capturing semantic contexts. The two streams' outputs are then amalgamated in the fusion stream to complement each other for help producing a refined segmentation output with sharper predictions around thin parts. Extensive experimental results well demonstrate the effectiveness of our proposed solution on segmenting thin objects, surpassing the baseline by similar to 30% k U-thin despite using only four clicks. Codes and dataset are available at https://github.com/liewjunhao/thin-object-selection.

引用

页码：305 / 314

页数：10

共 53 条

[1] Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations [J].

Acuna, David ;

Kar, Amlan ;

Fidler, Sanja .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11067-11075

[2] Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus [J].

Acuna, David ;

Ling, Huan ;

Kar, Amlan ;

Fidler, Sanja .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :859-868

[3] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[4]

[Anonymous], 2018, ECCV

[5]

[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4

[6]

[Anonymous], 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence

[7] Annotating Object Instances with a Polygon-RNN [J].

Castrejon, Lluis ;

Kundu, Kaustav ;

Urtasun, Raquel ;

Fidler, Sanja .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4485-4493

[8] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[9] SPGNet: Semantic Prediction Guidance for Scene Parsing [J].

Cheng, Bowen ;

Chen, Liang-Chieh ;

Wei, Yunchao ;

Zhu, Yukun ;

Huang, Zilong ;

Xiong, Jinjun ;

Huang, Thomas S. ;

Hwu, Wen-Mei ;

Shi, Honghui .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5217-5227

[10] Learning to Predict Crisp Boundaries [J].

Deng, Ruoxi ;

Shen, Chunhua ;

Liu, Shengjun ;

Wang, Huibing ;

Liu, Xinru .

COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :570-586

← 1 2 3 4 5 6 →