Rich Embedding Features for One-Shot Semantic Segmentation

被引:22
|
作者
Zhang, Xiaolin [1 ]
Wei, Yunchao [2 ]
Li, Zhao [3 ]
Yan, Chenggang [4 ]
Yang, Yi [5 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW 2007, Australia
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[3] Shandong Comp Sci Ctr, Shandong Artificial Intelligence Inst, Natl Supercomp Ctr Jinan, Jinan 250101, Peoples R China
[4] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou 310018, Peoples R China
[5] Zhejiang Univ, Coll Comp Sci & Technol, CCAI, Hangzhou 310027, Peoples R China
关键词
Image segmentation; Semantics; Feature extraction; Task analysis; Prototypes; Support vector machines; Pulse modulation; Deep learning; few shot segmentation; object segmentation; Siamese network;
D O I
10.1109/TNNLS.2021.3081693
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-shot semantic segmentation poses the challenging task of segmenting object regions from unseen categories with only one annotated example as guidance. Thus, how to effectively construct robust feature representations from the guidance image is crucial to the success of one-shot semantic segmentation. To this end, we propose in this article a simple, yet effective approach named rich embedding features (REFs). Given a reference image accompanied with its annotated mask, our REF constructs rich embedding features of the support object from three perspectives: 1) global embedding to capture the general characteristics; 2) peak embedding to capture the most discriminative information; 3) adaptive embedding to capture the internal long-range dependencies. By combining these informative features, we can easily harvest sufficient and rich guidance even from a single reference image. In addition to REF, we further propose a simple depth-priority context module to obtain useful contextual cues from the query image. This successfully raises the performance of one-shot semantic segmentation to a new level. We conduct experiments on pattern analysis, statical modeling and computational learning (Pascal) visual object classes (VOC) 2012 and common object in context (COCO) to demonstrate the effectiveness of our approach.
引用
收藏
页码:6484 / 6493
页数:10
相关论文
共 50 条
  • [1] Repurposing GANs for One-Shot Semantic Part Segmentation
    Rewatbowornwong, Pitchaporn
    Tritrong, Nontawat
    Suwajanakorn, Supasorn
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 5114 - 5125
  • [2] Repurposing GANs for One-shot Semantic Part Segmentation
    Tritrong, Nontawat
    Rewatbowornwong, Pitchaporn
    Suwajanakorn, Supasorn
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4473 - 4483
  • [3] Self-Supervised Interactive Embedding for One-Shot Organ Segmentation
    Yang, Yang
    Wang, Bo
    Zhang, Dingwen
    Yuan, Yixuan
    Yan, Qingsen
    Zhao, Shijie
    You, Zheng
    Han, Junwei
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (10) : 2799 - 2808
  • [4] SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation
    Zhang, Xiaolin
    Wei, Yunchao
    Yang, Yi
    Huang, Thomas S.
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3855 - 3865
  • [5] One-Shot Segmentation in Clutter
    Michaelis, Claudio
    Bethge, Matthias
    Ecker, Alexander S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [6] Informative Data Mining for One-shot Cross-Domain Semantic Segmentation
    Wang, Yuxi
    Liang, Jian
    Xiao, Jun
    Mei, Shuqi
    Yang, Yuran
    Zhang, Zhaoxiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1064 - 1074
  • [7] Recognizing novel patterns via adversarial learning for one-shot semantic segmentation
    Yang, Guangchao
    Niu, Dongmei
    Zhang, Caiming
    Zhao, Xiuyang
    INFORMATION SCIENCES, 2020, 518 : 225 - 237
  • [8] Weakly Supervised One-Shot Segmentation
    Raza, Hasnain
    Ravanbakhsh, Mahdyar
    Klein, Tassilo
    Nabi, Moin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1401 - 1406
  • [9] One-Shot Video Object Segmentation
    Caelles, S.
    Maninis, K. -K.
    Pont-Tuset, J.
    Leal-Taixe, L.
    Cremers, D.
    Van Gool, L.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5320 - 5329
  • [10] One-shot adaptation for cross-domain semantic segmentation in remote sensing images
    Tan, Jiaojiao
    Zhang, Haiwei
    Yao, Ning
    Yu, Qiang
    PATTERN RECOGNITION, 2025, 162