Amodal instance segmentation with dual guidance from contextual and shape priors

被引:0
|
作者
Zhan, Jiao [1 ]
Luo, Yarong [1 ]
Guo, Chi [1 ,2 ]
Wu, Yejun [3 ]
Yang, Bohan [1 ]
Wang, Jingrong [1 ]
Liu, Jingnan [1 ]
机构
[1] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Hubei, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Hubei, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金;
关键词
Instance segmentation; Amodal instance segmentation; Pixel affinity; Contextual dependency;
D O I
10.1016/j.asoc.2024.112602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human perception possesses the remarkable ability to mentally reconstruct the complete structure of occluded objects, which has inspired researchers to pursue amodal instance segmentation for a more comprehensive understanding of the scene. Previous works have shown promising results, but they often capture the contextual dependencies in an unsupervised way, which can lead to undesirable contextual dependencies and unreasonable feature representations. To tackle this problem, we propose a Pixel Affinity-Parsing (PAP) module trained with the Pixel Affinity Loss (PAL). Embedded into CNN, the PAP module can leverage learned contextual priors to guide the network to explicitly distinguish different relationships between pixels, thus capturing the intraclass and inter-class contextual dependencies in a non-local and supervised way. This process helps to yield robust feature representations to prevent the network from misjudging. To demonstrate the effectiveness of the PAP module, we design an effective Pixel Affinity-Parsing Network (PAPNet). Notably, PAPNet also introduces shape priors to guide the amodal mask refinement process, thus preventing implausible shapes in the predicted masks. Consequently, with the dual guidance of contextual and shape priors, PAPNet can reconstruct the full shape of occluded objects accurately and reasonably. Experimental results demonstrate that the proposed PAPNet outperforms existing state-of-the-art methods on multiple amodal datasets. Specifically, on the KINS dataset, PAPNet achieves 37.1% AP, 60.6% AP50 and 39.8% AP75, surpassing C2F-Seg by 0.6%, 2.4% and 2.8%. On the D2SA dataset, PAPNet achieves 71.70% AP, 85.98% AP50 and 77.10% AP75, surpassing PGExp by 0.75% and 0.33% in AP50 and AP75, and being comparable to PGExp in AP. On the COCOA-cls dataset, PAPNet achieves 41.29% AP, 60.95% AP50 and 46.17% AP75, surpassing PGExp by 3.74%, 3.21% and 4.76%. On the CWALT dataset, PAPNet achieves 72.51% AP, 85.02% AP50 and 80.47% AP75, surpassing VRSPNet by 5.38%, 0.07% and 5.35%. The code is available at https://github.com/jiaoZ7688/PAP-Net.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Dual Data Augmentation Method for Data-Deficient and Occluded Instance Segmentation
    Yan, Bo
    Li, Yadong
    Zhao, Xingran
    Wang, Hongbin
    PROCEEDINGS OF THE 5TH ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2022, 2022, : 117 - 120
  • [22] Query-Based Instance Segmentation with Dual Attention Transformer for Autonomous Vehicles
    Taourirte, Aya
    Juang, Li-Hong
    WORLD ELECTRIC VEHICLE JOURNAL, 2025, 16 (01):
  • [23] Instance segmentation from small dataset by a dual-layer semantics-based deep learning framework
    Chen, YiMing
    Li, JianWei
    Hu, XiaoBing
    Liu, YiRui
    Ma, JianKai
    Xing, Chen
    Li, JunJie
    Wang, ZhiJun
    Wang, JinCheng
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (09) : 2817 - 2833
  • [24] DALaneNet: A Dual Attention Instance Segmentation Network for Real-Time Lane Detection
    Guo, Zhiyang
    Huang, Yingping
    Wei, Hongjian
    Zhang, Chong
    Zhao, Baigan
    Shao, Zheqin
    IEEE SENSORS JOURNAL, 2021, 21 (19) : 21730 - 21739
  • [25] Instance Segmentation in Very High Resolution Remote Sensing Imagery Based on Hard-to-Segment Instance Learning and Boundary Shape Analysis
    Gong, Yiping
    Zhang, Fan
    Jia, Xiangyang
    Mao, Zhu
    Huang, Xianfeng
    Li, Deren
    REMOTE SENSING, 2022, 14 (01)
  • [26] Mono is Enough: Instance Segmentation from Single Annotated Sample
    Yang, Longrong
    Li, Hongliang
    Wu, Qingbo
    Meng, Fanman
    Ngan, King Ngi
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 120 - 123
  • [27] Camouflaged Instance Segmentation From Global Capture to Local Refinement
    Li, Chen
    Jiao, Ge
    Wu, Yun
    Zhao, Weichen
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 661 - 665
  • [28] 3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints
    Xv, Jiabin
    Deng, Fei
    REMOTE SENSING, 2023, 15 (20)
  • [29] PROnet: Point Refinement Using Shape-Guided Offset Map for Nuclei Instance Segmentation
    Nam, Siwoo
    Jeong, Jaehoon
    Luna, Miguel
    Chikontwe, Philip
    Park, Sang Hyun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 528 - 538
  • [30] ICNet: A Dual-Branch Instance Segmentation Network for High-Precision Pig Counting
    Liu, Shanghao
    Zhao, Chunjiang
    Zhang, Hongming
    Li, Qifeng
    Li, Shuqin
    Chen, Yini
    Gao, Ronghua
    Wang, Rong
    Li, Xuwen
    AGRICULTURE-BASEL, 2024, 14 (01):