Research on Object Detection Algorithm Based on Coordinate Information Fusion and Spatial Perception

被引：0

作者：

Liuha, Youyong ^{[1
]}

Luo, Liang ^{[1
]}

Wang, Xusheng ^{[1
]}

Zheng, Yuansheng ^{[1
]}

机构：

[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang, Sichuan, Peoples R China

来源：

2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024 | 2024年

关键词：

Object detection; CoordConv; RoIAlign; Deep learning;

D O I：

10.1109/ICCEA62105.2024.10604031

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The extensive integration of deep learning within the domain of object detection has escalated the necessity for enhancing detection precision and spatial fidelity. The proposed YOLO-CA detection algorithm introduces two groundbreaking techniques: CoordConv and RoIAlign. Initially, CoordConv integrates the coordinate data of pixel locations into the convolutional neural network architecture, enabling the network to more effectively comprehend pixel-wise spatial relationships, thereby boosting performance in tasks involving spatial structures. Following this, RoIAlign is employed to address potential pixel misalignment issues associated with RoIPool, thereby escalating the spatial accuracy of the regions of interest. Empirical findings on the VOC dataset demonstrate a 0.025 increase in the mAP score for the refined model incorporating both CoordConv and RoIAlign, signifying substantial improvements in object detection performance. These innovations have been empirically validated to boost detection accuracy and spatial precision, offering novel insights and strategies for deep learning algorithms within the object detection realm, carrying both theoretical significance and practical implications.

引用

页码：1289 / 1293

页数：5

共 17 条

[1] Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review
Amjoud, Ayoub Benali
Amrouch, Mustapha
[J]. IEEE ACCESS, 2023, 11 : 35479 - 35516
[2] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3] Boukabous M., 2023, Bull. Electr. Eng. Inform., V12, P1630
[4] Real-time construction demolition waste detection using state-of-the-art deep learning methods; single-stage vs two-stage detectors
Demetriou, Demetris
Mavromatidis, Pavlos
Robert, Ponsian M.
Papadopoulos, Harris
Petrou, Michael F.
Nicolaides, Demetris
[J]. WASTE MANAGEMENT, 2023, 167 : 194 - 203
[5] He J., 2021, P 35 C NEUR INF PROC, P20230
[6] ISTDU-Net: Infrared Small-Target Detection U-Net
Hou, Qingyu
Zhang, Liuwei
Tan, Fanjiao
Xi, Yuyang
Zheng, Haoliang
Li, Na
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7] Jocher G., CODE REPOSITORY
[8] A comprehensive review of object detection with deep learning
Kaur, Ravpreet
Singh, Sarbjeet
[J]. DIGITAL SIGNAL PROCESSING, 2023, 132
[9] Krishnachaithanya N., 2023, 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES), P784, DOI 10.1109/CISES58720.2023.10183503
[10] A Review on Deep Learning-Based Approaches for Automatic Sonar Target Recognition
Neupane, Dhiraj
Seok, Jongwon
[J]. ELECTRONICS, 2020, 9 (11) : 1 - 30

← 1 2 →