Binocular Vision-Based Target Detection Algorithm

被引:0
作者
Zhang, Huiguo [1 ]
机构
[1] Xian Univ Architecture & Technol, Xian 710055, Shaanxi, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Feature extraction; Object detection; Accuracy; Convolution; Target recognition; Training; Attention mechanisms; Laser radar; Classification algorithms; Radar detection; Targeted detection; complex environmental tasks; highly robust; Feffol;
D O I
10.1109/ACCESS.2025.3563965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of target detection, algorithms are challenged with multi-objective optimization problems in identifying detection targets, and it is also crucial to improve the recognition of small and insignificant targets. In this study, the Feffol network model is proposed to achieve excellent performance in complex detection tasks. In the feature extraction stage, the Feffol network model carefully chooses Efficient-v2 as the backbone network structure, which can provide a solid foundation for subsequent detection by taking advantage of its efficient feature extraction capability. In addition, an Ebifpn feature pyramid module with an spatial pyramid pooling-cross-stage partial connections(SppCSP) structure was innovatively introduced. This design not only effectively expands the feature sensing field, but also greatly enhances the fusion and expression of feature information of different sizes, so that the model can capture the target features more comprehensively. In addition, to solve the key problems in the detection process, the Feffol model adopts the Focal Loss and CIoU Loss functions. The former can effectively balance the positive and negative samples to avoid over-learning of easy-to-classify samples during the training process; the latter successfully solves the problem of the failure of the detection method when there is no intersection between the prediction frame and the real detection frame, which significantly improves the robustness of the model. The experimental results show that the Feffol network model has excellent performance, with a detection accuracy as high as 90.09% and a detection speed of 9.75it/s. Compared with other mainstream networks, the detection accuracy of Feffol network has been significantly improved, which fully proves its advancement and effectiveness in the task of target detection, and provides new ideas and methods for the further development of this field.
引用
收藏
页码:73440 / 73450
页数:11
相关论文
共 19 条
[1]  
Caiwu L., 2020, Optoelectron. Eng., V47, P40
[2]  
[曹莹 Cao Ying], 2013, [自动化学报, Acta Automatica Sinica], V39, P745
[3]   Survey on SVM and their application in image classification [J].
Chandra M.A. ;
Bedi S.S. .
International Journal of Information Technology, 2021, 13 (5) :1-11
[4]   Robust detection method for improving small traffic sign recognition based on spatial pyramid pooling [J].
Dewi, Christine ;
Chen, Rung-Ching ;
Yu, Hui ;
Jiang, Xiaoyi .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (7) :8135-8152
[5]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[6]   Effective Fusion Factor in FPN for Tiny Object Detection [J].
Gong, Yuqi ;
Yu, Xuehui ;
Ding, Yao ;
Peng, Xiaoke ;
Zhao, Jian ;
Han, Zhenjun .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1159-1167
[7]  
Jiayin L., 2017, Robotics, V39, P638
[8]  
Mukhoti J., 2020, Adv. Neural Inf. Process. Syst, V33, P15288, DOI DOI 10.48550/ARXIV.2002.09437
[9]  
Pei W., 2017, Comput. Eng., V43, P303
[10]  
Prisacariu V., 2009, Tech. Rep. 2310/09