Few-Shot Object Detection Algorithm Based on Geometric Prior and Attention RPN

被引:1
作者
Chen, Xiu [1 ]
Li, Yujie [1 ]
Lu, Huimin [2 ]
机构
[1] Yangzhou Univ, Sch Informat Engn, Yangzhou, Jiangsu, Peoples R China
[2] Southeast Univ, Inst Adv Ocean Res Nantong, Sch Automat, Nanjing, Peoples R China
来源
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年
关键词
Few-Shot Object Detection; Geometric Similarity; Synthetic Data Sets; Attention RPN;
D O I
10.1109/IWCMC61514.2024.10592343
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Intelligent factories driven by deep vision technology use robotic arms to perform tasks such as picking and assembling in a production environment. In practical applications, with the continuous changes of products on the industrial pipeline, the detection model needs to continuously train new weights to adapt to new application scenarios. It is time-consuming and labor-intensive to manually collect training data when deploying the production line, and it cannot be quickly adapted in industrial scenarios. Therefore, we propose an attention RPN (Region Proposal Network) few-shot object detection algorithm based on geometric prior. The algorithm uses the attention RPN module to strengthen the feature extraction ability of the detection model and uses the virtual simulation software to generate synthetic data similar to the real object geometry as the base class data to train the feature extraction network so that the network obtains the ability to extract geometric features on the base class object. By comparing the learning strategies, only a small number of real data samples are used to train the detection model twice. The experimental results show that the algorithm can detect more objects than the existing few-shot object detection algorithm in the industrial scene with only a small amount of real sample data, and the detection accuracy can reach 97%.
引用
收藏
页码:706 / 711
页数:6
相关论文
共 36 条
[1]   Fusion-based foreground enhancement for background subtraction using multivariate multi-model Gaussian distribution [J].
Akilan, Thangarajah ;
Wu, Q. M. Jonathan ;
Yang, Yimin .
INFORMATION SCIENCES, 2018, 430 :414-431
[2]  
An Shengbiao, 2023, Journal of Frontiers of Computer Science and Technology, P511, DOI 10.3778/j.issn.1673-9418.2210035
[3]  
Chen H, 2018, AAAI CONF ARTIF INTE, P2836
[4]  
Chen Tung-I, 2021, IEEE T MULTIMEDIA
[5]  
Crespi Adam, 2020, Unity perception
[6]   Deep Learning Based 2D Human Pose Estimation: A Survey [J].
Dang, Qi ;
Yin, Jianqin ;
Wang, Bin ;
Zheng, Wenqing .
TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (06) :663-676
[7]  
Denninger M, 2019, Arxiv, DOI arXiv:1911.01911
[8]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[9]   MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis [J].
Gilles, Maximilian ;
Chen, Yuhao ;
Winter, Tim Robin ;
Zeng, E. Zhixuan ;
Wong, Alexander .
2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, :220-227
[10]  
Hsieh T I, 2019, Advances in neural information processing systems, P32