Decoupled Metric Network for Single-Stage Few-Shot Object Detection

被引:37
|
作者
Lu, Yue [1 ,2 ]
Chen, Xingyu [3 ]
Wu, Zhengxing [1 ,2 ]
Yu, Junzhi [1 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Kuaishou Technol, Ytech, Beijing 100085, Peoples R China
[4] Peking Univ, Coll Engn, BIC ESAT, Dept Adv Mfg & Robot,State Key Lab Turbulence & C, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Training; Head; Task analysis; Shape; Measurement; Computer vision; deep learning; few-shot learning; object detection;
D O I
10.1109/TCYB.2022.3149825
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Within the last few years, great efforts have been made to study few-shot learning. Although general object detection is advancing at a rapid pace, few-shot detection remains a very challenging problem. In this work, we propose a novel decoupled metric network (DMNet) for single-stage few-shot object detection. We design a decoupled representation transformation (DRT) and an image-level distance metric learning (IDML) to solve the few-shot detection problem. The DRT can eliminate the adverse effect of handcrafted prior knowledge by predicting objectness and anchor shape. Meanwhile, to alleviate the problem of representation disagreement between classification and location (i.e., translational invariance versus translational variance), the DRT adopts a decoupled manner to generate adaptive representations so that the model is easier to learn from only a few training data. As for a few-shot classification in the detection task, we design an IDML tailored to enhance the generalization ability. This module can perform metric learning for the whole visual feature, so it can be more efficient than traditional DML due to the merit of parallel inference for multiobjects. Based on the DRT and IDML, our DMNet efficiently realizes a novel paradigm for few-shot detection, called single-stage metric detection. Experiments are conducted on the PASCAL VOC dataset and the MS COCO dataset. As a result, our method achieves state-of-the-art performance in few-shot object detection. The codes are available at https://github.com/yrqs/DMNet.
引用
收藏
页码:514 / 525
页数:12
相关论文
共 50 条
  • [1] σ-Adaptive Decoupled Prototype for Few-Shot Object Detection
    Du, Jinhao
    Zhang, Shan
    Chen, Qiang
    Le, Haifeng
    Sun, Yanpeng
    Ni, Yao
    Wang, Jian
    He, Bin
    Wang, Jingdong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18904 - 18914
  • [2] Feature reconstruction and metric based network for few-shot object detection
    Li, Yuewen
    Feng, Wenquan
    Lyu, Shuchang
    Zhao, Qi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 227
  • [3] Few-Shot Object Detection via Metric Learning
    Zhu Min
    Zhang Chongyang
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [4] Few-Shot Air Object Detection Network
    Cai, Wei
    Wang, Xin
    Jiang, Xinhao
    Yang, Zhiyong
    Di, Xingyu
    Gao, Weijie
    ELECTRONICS, 2023, 12 (19)
  • [5] Temporal Speciation Network for Few-Shot Object Detection
    Zhao, Xiaowei
    Liu, Xianglong
    Ma, Yuqing
    Bai, Shihao
    Shen, Yifan
    Hao, Zeyu
    Liu, Aishan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8267 - 8278
  • [6] Orthogonal Progressive Network for Few-shot Object Detection
    Wang, Bingxin
    Yu, Dehong
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [7] FSMT: Few-shot object detection via Multi-Task Decoupled
    Qin, Jiahui
    Xu, Yang
    Fu, Yifan
    Wu, Zebin
    Wei, Zhihui
    PATTERN RECOGNITION LETTERS, 2025, 192 : 8 - 14
  • [8] DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection
    Qiao, Limeng
    Zhao, Yuxuan
    Li, Zhiyuan
    Qiu, Xi
    Wu, Jianan
    Zhang, Chi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8661 - 8670
  • [9] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [10] Few-Shot Object Counting and Detection
    Thanh Nguyen
    Chau Pham
    Khoi Nguyen
    Minh Hoai
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 348 - 365