ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection

被引:5
作者
Xin, Zhimeng [1 ]
Wu, Tianxu [2 ]
Chen, Shiming [2 ]
Zou, Yixiong [3 ]
Shao, Ling [4 ]
You, Xinge [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Cyber Sci & Engn, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[4] Univ Chinese Acad Sci UCAS, UCAS Terminus AI Lab, Beijing 100101, Peoples R China
关键词
Training; Detectors; Object detection; Feature extraction; Task analysis; Semantics; Adaptation models; Few-shot object detection; extensible attention; co-existing regions; NETWORK;
D O I
10.1109/TIP.2024.3411771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot object detection (FSOD) identifies objects from extremely few annotated samples. Most existing FSOD methods, recently, apply the two-stage learning paradigm, which transfers the knowledge learned from abundant base classes to assist the few-shot detectors by learning the global features. However, such existing FSOD approaches seldom consider the localization of objects from local to global. Limited by the scarce training data in FSOD, the training samples of novel classes typically capture part of objects, resulting in such FSOD methods being unable to detect the completely unseen object during testing. To tackle this problem, we propose an Extensible Co-Existing Attention (ECEA) module to enable the model to infer the global object according to the local parts. Specifically, we first devise an extensible attention mechanism that starts with a local region and extends attention to co-existing regions that are similar and adjacent to the given local region. We then implement the extensible attention mechanism in different feature scales to progressively discover the full object in various receptive fields. In the training process, the model learns the extensible ability on the base stage with abundant samples and transfers it to the novel stage of continuous extensible learning, which can assist the few-shot model to quickly adapt in extending local regions to co-existing regions. Extensive experiments on the PASCAL VOC and COCO datasets show that our ECEA module can assist the few-shot detector to completely predict the object despite some regions failing to appear in the training samples and achieve the new state-of-the-art compared with existing FSOD methods. Code is released at https://github.com/zhimengXin/ECEA.
引用
收藏
页码:5564 / 5576
页数:13
相关论文
共 50 条
[31]   Holistic Prototype Attention Network for Few-Shot Video Object Segmentation [J].
Tang, Yin ;
Chen, Tao ;
Jiang, Xiruo ;
Yao, Yazhou ;
Xie, Guo-Sen ;
Shen, Heng-Tao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) :6699-6709
[32]   Text-Guided Distribution Calibration for Few-Shot Object Detection in Remote Sensing Images [J].
Cao, Yu ;
Chen, Jingyi ;
Wang, Haoyu ;
Zhang, Lei ;
Ding, Chen ;
Wei, Wei ;
Cao, Shiqi ;
Xie, Meilin .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 :17671-17686
[33]   SGFNet: Structure-Guided Few-Shot Object Detection [J].
Ma, Jingkai ;
Bai, Shuang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) :3209-3221
[34]   Few-Shot Object Counting and Detection [J].
Thanh Nguyen ;
Chau Pham ;
Khoi Nguyen ;
Minh Hoai .
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 :348-365
[35]   Category Knowledge-Guided Parameter Calibration for Few-Shot Object Detection [J].
Chen, Chaofan ;
Yang, Xiaoshan ;
Zhang, Jinpeng ;
Dong, Bo ;
Xu, Changsheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :1092-1107
[36]   Decoupled Metric Network for Single-Stage Few-Shot Object Detection [J].
Lu, Yue ;
Chen, Xingyu ;
Wu, Zhengxing ;
Yu, Junzhi .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) :514-525
[37]   Retentive Compensation and Personality Filtering for Few-Shot Remote Sensing Object Detection [J].
Wu, Jiashan ;
Lang, Chunbo ;
Cheng, Gong ;
Xie, Xingxing ;
Han, Junwei .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) :5805-5817
[38]   Multi-View Part-Based Few-Shot Object Detection [J].
Ma, Jingkai ;
Bai, Shuang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (08) :14749-14763
[39]   Semantic Prototyping With CLIP for Few-Shot Object Detection in Remote Sensing Images [J].
Liu, Tianying ;
Zhou, Shuigeng ;
Li, Wengen ;
Zhang, Yichao ;
Guan, Jihong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[40]   CAMCFormer: Cross-Attention and Multicorrelation Aided Transformer for Few-Shot Object Detection in Optical Remote Sensing Images [J].
Wang, Lefan ;
Mei, Shaohui ;
Wang, Yi ;
Lian, Jiawei ;
Han, Zonghao ;
Feng, Yan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63