Rich Embedding Features for One-Shot Semantic Segmentation

被引:22
|
作者
Zhang, Xiaolin [1 ]
Wei, Yunchao [2 ]
Li, Zhao [3 ]
Yan, Chenggang [4 ]
Yang, Yi [5 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW 2007, Australia
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[3] Shandong Comp Sci Ctr, Shandong Artificial Intelligence Inst, Natl Supercomp Ctr Jinan, Jinan 250101, Peoples R China
[4] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou 310018, Peoples R China
[5] Zhejiang Univ, Coll Comp Sci & Technol, CCAI, Hangzhou 310027, Peoples R China
关键词
Image segmentation; Semantics; Feature extraction; Task analysis; Prototypes; Support vector machines; Pulse modulation; Deep learning; few shot segmentation; object segmentation; Siamese network;
D O I
10.1109/TNNLS.2021.3081693
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-shot semantic segmentation poses the challenging task of segmenting object regions from unseen categories with only one annotated example as guidance. Thus, how to effectively construct robust feature representations from the guidance image is crucial to the success of one-shot semantic segmentation. To this end, we propose in this article a simple, yet effective approach named rich embedding features (REFs). Given a reference image accompanied with its annotated mask, our REF constructs rich embedding features of the support object from three perspectives: 1) global embedding to capture the general characteristics; 2) peak embedding to capture the most discriminative information; 3) adaptive embedding to capture the internal long-range dependencies. By combining these informative features, we can easily harvest sufficient and rich guidance even from a single reference image. In addition to REF, we further propose a simple depth-priority context module to obtain useful contextual cues from the query image. This successfully raises the performance of one-shot semantic segmentation to a new level. We conduct experiments on pattern analysis, statical modeling and computational learning (Pascal) visual object classes (VOC) 2012 and common object in context (COCO) to demonstrate the effectiveness of our approach.
引用
收藏
页码:6484 / 6493
页数:10
相关论文
共 50 条
  • [21] Fast Instance and Semantic Segmentation Exploiting Local Connectivity, Metric Learning, and One-Shot Detection for Robotics
    Milioto, Andres
    Mandtler, Leonard
    Stachniss, Cyrill
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5481 - 5487
  • [22] Look-up and Adapt: A One-shot Semantic Parser
    Lu, Zhichu
    Arabshahi, Forough
    Labutov, Igor
    Mitchell, Tom
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1129 - 1139
  • [23] Make One-Shot Video Object Segmentation Efficient Again
    Meinhardt, Tim
    Leal-Taixe, Laura
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [24] Fully Convolutional One-Shot Object Segmentation for Industrial Robotics
    Schnieders, Benjamin
    Luo, Shan
    Palmer, Gregory
    Tuyls, Karl
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1161 - 1169
  • [25] One-shot integral invariant shape priors for variational segmentation
    Manay, S
    Cremers, D
    Yezzi, A
    Soatto, S
    ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2005, 3757 : 414 - 426
  • [26] Bilateral guidance network for one-shot metal defect segmentation
    Shan, Dexing
    Zhang, Yunzhou
    Liu, Xiaozheng
    Zhao, Jiaqi
    Coleman, Sonya
    Kerr, Dermot
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [27] One-shot Joint Extraction, Registration and Segmentation of Neuroimaging Data
    Su, Yao
    Qian, Zhentian
    Ma, Lei
    He, Lifang
    Kong, Xiangnan
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2049 - 2060
  • [28] A Spatiotemporal Mask Autoencoder for One-shot Video Object Segmentation
    Chen, Baiyu
    Zhao, Li
    Chan, Sixian
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 6 - 12
  • [29] Contour Transformer Network for One-Shot Segmentation of Anatomical Structures
    Lu, Yuhang
    Zheng, Kang
    Li, Weijian
    Wang, Yirui
    Harrison, Adam P.
    Lin, Chihung
    Wang, Song
    Xiao, Jing
    Lu, Le
    Kuo, Chang-Fu
    Miao, Shun
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2672 - 2684
  • [30] One-Shot Learning-Based Animal Video Segmentation
    Xue, Tengfei
    Qiao, Yongliang
    Kong, He
    Su, Daobilige
    Pan, Shirui
    Rafique, Khalid
    Sukkarieh, Salah
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (06) : 3799 - 3807