Amodal instance segmentation with dual guidance from contextual and shape priors

被引:0
|
作者
Zhan, Jiao [1 ]
Luo, Yarong [1 ]
Guo, Chi [1 ,2 ]
Wu, Yejun [3 ]
Yang, Bohan [1 ]
Wang, Jingrong [1 ]
Liu, Jingnan [1 ]
机构
[1] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Hubei, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Hubei, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金;
关键词
Instance segmentation; Amodal instance segmentation; Pixel affinity; Contextual dependency;
D O I
10.1016/j.asoc.2024.112602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human perception possesses the remarkable ability to mentally reconstruct the complete structure of occluded objects, which has inspired researchers to pursue amodal instance segmentation for a more comprehensive understanding of the scene. Previous works have shown promising results, but they often capture the contextual dependencies in an unsupervised way, which can lead to undesirable contextual dependencies and unreasonable feature representations. To tackle this problem, we propose a Pixel Affinity-Parsing (PAP) module trained with the Pixel Affinity Loss (PAL). Embedded into CNN, the PAP module can leverage learned contextual priors to guide the network to explicitly distinguish different relationships between pixels, thus capturing the intraclass and inter-class contextual dependencies in a non-local and supervised way. This process helps to yield robust feature representations to prevent the network from misjudging. To demonstrate the effectiveness of the PAP module, we design an effective Pixel Affinity-Parsing Network (PAPNet). Notably, PAPNet also introduces shape priors to guide the amodal mask refinement process, thus preventing implausible shapes in the predicted masks. Consequently, with the dual guidance of contextual and shape priors, PAPNet can reconstruct the full shape of occluded objects accurately and reasonably. Experimental results demonstrate that the proposed PAPNet outperforms existing state-of-the-art methods on multiple amodal datasets. Specifically, on the KINS dataset, PAPNet achieves 37.1% AP, 60.6% AP50 and 39.8% AP75, surpassing C2F-Seg by 0.6%, 2.4% and 2.8%. On the D2SA dataset, PAPNet achieves 71.70% AP, 85.98% AP50 and 77.10% AP75, surpassing PGExp by 0.75% and 0.33% in AP50 and AP75, and being comparable to PGExp in AP. On the COCOA-cls dataset, PAPNet achieves 41.29% AP, 60.95% AP50 and 46.17% AP75, surpassing PGExp by 3.74%, 3.21% and 4.76%. On the CWALT dataset, PAPNet achieves 72.51% AP, 85.02% AP50 and 80.47% AP75, surpassing VRSPNet by 5.38%, 0.07% and 5.35%. The code is available at https://github.com/jiaoZ7688/PAP-Net.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Parking Lot Instance Segmentation from Satellite Imagery through Associative Embeddings
    Berry, Tessa
    Dronen, Nicholas
    Jackson, Brett
    Endres, Ian
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 528 - 531
  • [42] A dual-modal dynamic contour-based method for cervical vascular ultrasound image instance segmentation
    Chang, Chenkai
    Qi, Fei
    Xu, Chang
    Shen, Yiwei
    Li, Qingwu
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1038 - 1057
  • [43] Detection of vehicles on images obtained from unmanned aerial vehicles using instance segmentation
    Kovbasiuk, Serhiy
    Kanevskyy, Leonid
    Chernyshuk, Sergiy
    Romanchuk, Mykola
    15TH INTERNATIONAL CONFERENCE ON ADVANCED TRENDS IN RADIOELECTRONICS, TELECOMMUNICATIONS AND COMPUTER ENGINEERING (TCSET - 2020), 2020, : 267 - 271
  • [44] Unsupervised depth prediction from monocular sequences: Improving performances through instance segmentation
    Moreau, Ambroise
    Mancas, Matei
    Dutoit, Thierry
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 54 - 61
  • [45] Instance segmentation models for detecting floating macroplastic debris from river surface images
    Kataoka, Tomoya
    Yoshida, Takushi
    Yamamoto, Natsuki
    FRONTIERS IN EARTH SCIENCE, 2024, 12
  • [46] RTMFusion: An enhanced dual-stream architecture algorithm fusing RGB and depth features for instance segmentation of tomato organs
    Rong, Jiacheng
    Zheng, Wanli
    Qi, Zhongxian
    Yuan, Ting
    Wang, Pengbo
    MEASUREMENT, 2025, 239
  • [47] Fish Size Estimation from Instance Segmentation Results Obtained with a Deep Convolutional Network
    Alvarez Ellacuria, Amaya
    Catalan, Ignacio A.
    Lisani, Jose-Luis
    Palmer, Miquel
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 319 : 176 - 179
  • [48] Microstructure Instance Segmentation from Aluminum Alloy Metallographic Image Using Different Loss Functions
    Chen, Dali
    Guo, Dinghao
    Liu, Shixin
    Liu, Fang
    SYMMETRY-BASEL, 2020, 12 (04):
  • [49] Instance Segmentation of Buildings from High-Resolution Remote Sensing Images with Multitask Learning
    Hui J.
    Qin Q.
    Xu W.
    Sui J.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2019, 55 (06): : 1067 - 1077
  • [50] Unified graph-based method for instance separation from foreground-background segmentation
    Spasc, Milica
    Mihajlovc, Igor
    Spasc, Nikola
    Jankovc, Dragan
    2022 57TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2022, : 115 - 118