Hybrid two-stage cascade for instance segmentation of overlapping objects

被引：0

作者：

Yang, Yakun ^{[1
,3
]}

Luo, Wenjie ^{[1
,2
,3
]}

Tian, Xuedong ^{[1
,2
,3
]}

机构：

[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Peoples R China

[2] Hebei Univ, Hebei Machine Vis Engn Res Ctr, Baoding 071002, Peoples R China

[3] Hebei Univ, Lab Intelligence Image & Text, Baoding 071002, Peoples R China

来源：

PATTERN ANALYSIS AND APPLICATIONS | 2023年 / 26卷 / 03期

关键词：

Computer vision; Instance segmentation; Two-stage cascade model; Hybrid tasks learning; Object occlusion;

D O I：

10.1007/s10044-023-01185-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although two-stage methods of instance segmentation achieve better performance than one-stage counterparts, the segmentation results on overlapping objects are unsatisfactory. We found that occlusion significantly impacts the location of adjacent objects and produces coarse masks without adequate refinements. To circumvent the issue, we propose a hybrid model for instance segmentation called HTCIS, which iteratively forms the detection and segmentation. The main idea is to improve overall performance by optimizing every component based on a two-stage cascade structure. Compared with existing models, our approach decreases the loss of feature information, including semantic and detailed features. The detection branch prioritizes location accuracy when ranking bounding boxes, while the segmentation branch explores more contextual information and segments pixels in a multi-view fashion with the guide of an attention mechanism. Experimental results demonstrate that HTCIS is capable of processing occlusion. We conclude that multi-refinement of two-stage cascade is essential for accurate segmentation of overlapping objects, and our optimization is efficient in achieving this goal.

引用

页码：957 / 967

页数：11

共 51 条

[41] Vaswani A, 2017, ADV NEUR IN, V30
[42] Wang X., 2020, ARXIV
[43] Wang YQ, 2020, PROC CVPR IEEE, P9310, DOI 10.1109/CVPR42600.2020.00933
[44] Woo S., 2018, P EUR C COMP VIS ECC, P3, DOI [DOI 10.1007/978-3-030-01234-2_1, 10.1007/978-3-030-01234-2_1]
[45] Xie EZ, 2020, PROC CVPR IEEE, P12190, DOI 10.1109/CVPR42600.2020.01221
[46] Aggregated Residual Transformations for Deep Neural Networks
Xie, Saining
Girshick, Ross
Dollar, Piotr
Tu, Zhuowen
He, Kaiming
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5987 - 5995
[47] Xinlong Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12363), P649, DOI 10.1007/978-3-030-58523-5_38
[48] Explicit Shape Encoding for Real-Time Instance Segmentation
Xu, Wenqiang
Wang, Haiyang
Qi, Fubo
Lu, Cewu
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5167 - 5176
[49] RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features
Zhang, Gang
Lu, Xin
Tan, Jingru
Li, Jianmin
Zhang, Zhaoxiang
Li, Quanquan
Hu, Xiaolin
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6857 - 6865
[50] Zhang H., 2021, ARXIV

← 1 2 3 4 5 6 →