Amodal instance segmentation with dual guidance from contextual and shape priors

被引:0
|
作者
Zhan, Jiao [1 ]
Luo, Yarong [1 ]
Guo, Chi [1 ,2 ]
Wu, Yejun [3 ]
Yang, Bohan [1 ]
Wang, Jingrong [1 ]
Liu, Jingnan [1 ]
机构
[1] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Hubei, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Hubei, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金;
关键词
Instance segmentation; Amodal instance segmentation; Pixel affinity; Contextual dependency;
D O I
10.1016/j.asoc.2024.112602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human perception possesses the remarkable ability to mentally reconstruct the complete structure of occluded objects, which has inspired researchers to pursue amodal instance segmentation for a more comprehensive understanding of the scene. Previous works have shown promising results, but they often capture the contextual dependencies in an unsupervised way, which can lead to undesirable contextual dependencies and unreasonable feature representations. To tackle this problem, we propose a Pixel Affinity-Parsing (PAP) module trained with the Pixel Affinity Loss (PAL). Embedded into CNN, the PAP module can leverage learned contextual priors to guide the network to explicitly distinguish different relationships between pixels, thus capturing the intraclass and inter-class contextual dependencies in a non-local and supervised way. This process helps to yield robust feature representations to prevent the network from misjudging. To demonstrate the effectiveness of the PAP module, we design an effective Pixel Affinity-Parsing Network (PAPNet). Notably, PAPNet also introduces shape priors to guide the amodal mask refinement process, thus preventing implausible shapes in the predicted masks. Consequently, with the dual guidance of contextual and shape priors, PAPNet can reconstruct the full shape of occluded objects accurately and reasonably. Experimental results demonstrate that the proposed PAPNet outperforms existing state-of-the-art methods on multiple amodal datasets. Specifically, on the KINS dataset, PAPNet achieves 37.1% AP, 60.6% AP50 and 39.8% AP75, surpassing C2F-Seg by 0.6%, 2.4% and 2.8%. On the D2SA dataset, PAPNet achieves 71.70% AP, 85.98% AP50 and 77.10% AP75, surpassing PGExp by 0.75% and 0.33% in AP50 and AP75, and being comparable to PGExp in AP. On the COCOA-cls dataset, PAPNet achieves 41.29% AP, 60.95% AP50 and 46.17% AP75, surpassing PGExp by 3.74%, 3.21% and 4.76%. On the CWALT dataset, PAPNet achieves 72.51% AP, 85.02% AP50 and 80.47% AP75, surpassing VRSPNet by 5.38%, 0.07% and 5.35%. The code is available at https://github.com/jiaoZ7688/PAP-Net.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Forest Fire Segmentation from Aerial Imagery Data Using an Improved Instance Segmentation Model
    Guan, Zhihao
    Miao, Xinyu
    Mu, Yunjie
    Sun, Quan
    Ye, Qiaolin
    Gao, Demin
    REMOTE SENSING, 2022, 14 (13)
  • [32] From density to geometry: Instance segmentation for reverse engineering of optimized structures
    Rochefort-Beaudoin, Thomas
    Vadean, Aurelian
    Achiche, Sofiane
    Aage, Niels
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [33] A DSF-net-based approach to dual-branch instance segmentation of weak bridge defects
    Zhang, He
    Shen, Ruihong
    Lei, Jiawei
    Shen, Zhijing
    Zhang, Zhicheng
    Zhou, Yuhui
    ENGINEERING STRUCTURES, 2025, 327
  • [34] ARM-NMS: SHAPE BASED NON-MAXIMUM SUPPRESSION FOR INSTANCE SEGMENTATION IN LARGE SCALE IMAGERY
    Michel, Andreas
    Gross, Wolfgang
    Hinz, Stefan
    Middelmann, Wolfgang
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 291 - 298
  • [35] Application of Instance Segmentation to Identifying Insect Concentrations in Data from an Entomological Radar
    Wang, Rui
    Ren, Jiahao
    Li, Weidong
    Yu, Teng
    Zhang, Fan
    Wang, Jiangtao
    REMOTE SENSING, 2024, 16 (17)
  • [36] Learning Accurate Objectness Instance Segmentation from Photorealistic Rendering for Robotic Manipulation
    Li, Siyi
    Zhou, Jiaji
    Jia, Zhenzhong
    Yeung, Dit-Yan
    Mason, Matthew T.
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 245 - 255
  • [37] Parking Lot Instance Segmentation from Satellite Imagery through Associative Embeddings
    Berry, Tessa
    Dronen, Nicholas
    Jackson, Brett
    Endres, Ian
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 528 - 531
  • [38] FCENet: An Instance Segmentation Model for Extracting Figures and Captions From Material Documents
    Liu, Yingli
    Si, Changkai
    Jin, Kai
    Shen, Tao
    Hu, Meng
    IEEE ACCESS, 2021, 9 : 551 - 564
  • [39] Instance-Level Moving Object Segmentation from a Single Image with Events
    Wan, Zhexiong
    Fan, Bin
    Hui, Le
    Dai, Yuchao
    Lee, Gim Hee
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [40] BioSAM: Generating SAM Prompts From Superpixel Graph for Biological Instance Segmentation
    Cai, Miaomiao
    Liu, Xiaoyu
    Xiong, Zhiwei
    Chen, Xuejin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (01) : 273 - 284