A comprehensive review of few-shot object detection on aerial imagery

被引:0
作者
Nguyen, Khang [1 ,2 ]
Huynh, Nhat-Thanh [1 ,2 ]
Le, Duc-Thanh [1 ,2 ]
Huynh, Dien-Thuc [1 ,2 ]
Bui, Thi-Thanh-Trang [1 ,2 ]
Dinh, Truong [1 ,2 ]
Nguyen, Khanh-Duy [1 ,2 ]
Nguyen, Tam, V [3 ]
机构
[1] Univ Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Univ Dayton, Dayton, OH USA
关键词
Few-shot object detection; Object detection; Aerial imagery; Convolutional neural network; REMOTE-SENSING IMAGES; ORIENTED GRADIENTS; CLASSIFICATION; HISTOGRAMS; BENCHMARK;
D O I
10.1016/j.cosrev.2025.100760
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of technology, drones, and satellites play an important role in human life. Related research problems receive great attention, especially in the computer vision community. Notably, the object detection models on aerial imagery take part in many applications in both civil and military domains. Although it has great potential and has achieved many achievements, it cannot be denied that object detection faces many challenges such as the small size and the quality of training datasets. The few-shot paradigm was explored to tackle that challenge. In this paper, we intensively investigate 55 state-of-the-art few-shot object detection methods using many different learning styles such as meta-learning and transfer learning. Moreover, we analyzed 12 aerial imagery datasets and benchmarked state-of-the-art methods on three popular datasets, namely, DIOR, NWPU VHR-10, and DOTA. These datasets reflect the richness of classes and the complexity of real-world conditions. From the experimental results and analysis, we discuss insights and pave the way to the future outlook of this research.
引用
收藏
页数:38
相关论文
共 203 条
[1]   DISCRETE COSINE TRANSFORM [J].
AHMED, N ;
NATARAJAN, T ;
RAO, KR .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93
[2]  
[Anonymous], 2002, Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, NIPS 2002, December 9-14, 2002, Vancouver, British Columbia
[3]   Continuous Adaptation for Interactive Segmentation Using Teacher-Student Architecture [J].
Atanyan, Barsegh ;
Khachatryan, Levon ;
Navasardyan, Shant ;
Wei, Yunchao ;
Sh, Humphrey .
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, :778-788
[4]   Unified multimodal fusion transformer for few shot object detection for remote sensing images [J].
Azeem, Abdullah ;
Li, Zhengzhou ;
Siddique, Abubakar ;
Zhang, Yuting ;
Zhou, Shangbo .
INFORMATION FUSION, 2024, 111
[5]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[6]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[7]   Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery [J].
Bou, Xavier ;
Facciolo, Gabriele ;
von Gioi, Rafael Grompone ;
Morel, Jean-Michel ;
Ehret, Thibaud .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, :430-439
[8]   Transfer Learning Benchmark for Cardiovascular Disease Recognition [J].
Boulares, Mehrez ;
Alafif, Tarik ;
Barnawi, Ahmed .
IEEE ACCESS, 2020, 8 :109475-109491
[9]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[10]   A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection [J].
Chen, Hao ;
Shi, Zhenwei .
REMOTE SENSING, 2020, 12 (10)