Hybrid two-stage cascade for instance segmentation of overlapping objects

被引:0
作者
Yang, Yakun [1 ,3 ]
Luo, Wenjie [1 ,2 ,3 ]
Tian, Xuedong [1 ,2 ,3 ]
机构
[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Peoples R China
[2] Hebei Univ, Hebei Machine Vis Engn Res Ctr, Baoding 071002, Peoples R China
[3] Hebei Univ, Lab Intelligence Image & Text, Baoding 071002, Peoples R China
关键词
Computer vision; Instance segmentation; Two-stage cascade model; Hybrid tasks learning; Object occlusion;
D O I
10.1007/s10044-023-01185-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although two-stage methods of instance segmentation achieve better performance than one-stage counterparts, the segmentation results on overlapping objects are unsatisfactory. We found that occlusion significantly impacts the location of adjacent objects and produces coarse masks without adequate refinements. To circumvent the issue, we propose a hybrid model for instance segmentation called HTCIS, which iteratively forms the detection and segmentation. The main idea is to improve overall performance by optimizing every component based on a two-stage cascade structure. Compared with existing models, our approach decreases the loss of feature information, including semantic and detailed features. The detection branch prioritizes location accuracy when ranking bounding boxes, while the segmentation branch explores more contextual information and segments pixels in a multi-view fashion with the guide of an attention mechanism. Experimental results demonstrate that HTCIS is capable of processing occlusion. We conclude that multi-refinement of two-stage cascade is essential for accurate segmentation of overlapping objects, and our optimization is efficient in achieving this goal.
引用
收藏
页码:957 / 967
页数:11
相关论文
共 51 条
  • [41] Vaswani A, 2017, ADV NEUR IN, V30
  • [42] Wang X., 2020, ARXIV
  • [43] Wang YQ, 2020, PROC CVPR IEEE, P9310, DOI 10.1109/CVPR42600.2020.00933
  • [44] Woo S., 2018, P EUR C COMP VIS ECC, P3, DOI [DOI 10.1007/978-3-030-01234-2_1, 10.1007/978-3-030-01234-2_1]
  • [45] Xie EZ, 2020, PROC CVPR IEEE, P12190, DOI 10.1109/CVPR42600.2020.01221
  • [46] Aggregated Residual Transformations for Deep Neural Networks
    Xie, Saining
    Girshick, Ross
    Dollar, Piotr
    Tu, Zhuowen
    He, Kaiming
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5987 - 5995
  • [47] Xinlong Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12363), P649, DOI 10.1007/978-3-030-58523-5_38
  • [48] Explicit Shape Encoding for Real-Time Instance Segmentation
    Xu, Wenqiang
    Wang, Haiyang
    Qi, Fubo
    Lu, Cewu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5167 - 5176
  • [49] RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features
    Zhang, Gang
    Lu, Xin
    Tan, Jingru
    Li, Jianmin
    Zhang, Zhaoxiang
    Li, Quanquan
    Hu, Xiaolin
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6857 - 6865
  • [50] Zhang H., 2021, ARXIV