Research on Object Detection Algorithm Based on Coordinate Information Fusion and Spatial Perception

被引:0
作者
Liuha, Youyong [1 ]
Luo, Liang [1 ]
Wang, Xusheng [1 ]
Zheng, Yuansheng [1 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang, Sichuan, Peoples R China
来源
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024 | 2024年
关键词
Object detection; CoordConv; RoIAlign; Deep learning;
D O I
10.1109/ICCEA62105.2024.10604031
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The extensive integration of deep learning within the domain of object detection has escalated the necessity for enhancing detection precision and spatial fidelity. The proposed YOLO-CA detection algorithm introduces two groundbreaking techniques: CoordConv and RoIAlign. Initially, CoordConv integrates the coordinate data of pixel locations into the convolutional neural network architecture, enabling the network to more effectively comprehend pixel-wise spatial relationships, thereby boosting performance in tasks involving spatial structures. Following this, RoIAlign is employed to address potential pixel misalignment issues associated with RoIPool, thereby escalating the spatial accuracy of the regions of interest. Empirical findings on the VOC dataset demonstrate a 0.025 increase in the mAP score for the refined model incorporating both CoordConv and RoIAlign, signifying substantial improvements in object detection performance. These innovations have been empirically validated to boost detection accuracy and spatial precision, offering novel insights and strategies for deep learning algorithms within the object detection realm, carrying both theoretical significance and practical implications.
引用
收藏
页码:1289 / 1293
页数:5
相关论文
共 17 条
  • [1] Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review
    Amjoud, Ayoub Benali
    Amrouch, Mustapha
    [J]. IEEE ACCESS, 2023, 11 : 35479 - 35516
  • [2] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [3] Boukabous M., 2023, Bull. Electr. Eng. Inform., V12, P1630
  • [4] Real-time construction demolition waste detection using state-of-the-art deep learning methods; single-stage vs two-stage detectors
    Demetriou, Demetris
    Mavromatidis, Pavlos
    Robert, Ponsian M.
    Papadopoulos, Harris
    Petrou, Michael F.
    Nicolaides, Demetris
    [J]. WASTE MANAGEMENT, 2023, 167 : 194 - 203
  • [5] He J., 2021, P 35 C NEUR INF PROC, P20230
  • [6] ISTDU-Net: Infrared Small-Target Detection U-Net
    Hou, Qingyu
    Zhang, Liuwei
    Tan, Fanjiao
    Xi, Yuyang
    Zheng, Haoliang
    Li, Na
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] Jocher G., CODE REPOSITORY
  • [8] A comprehensive review of object detection with deep learning
    Kaur, Ravpreet
    Singh, Sarbjeet
    [J]. DIGITAL SIGNAL PROCESSING, 2023, 132
  • [9] Krishnachaithanya N., 2023, 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES), P784, DOI 10.1109/CISES58720.2023.10183503
  • [10] A Review on Deep Learning-Based Approaches for Automatic Sonar Target Recognition
    Neupane, Dhiraj
    Seok, Jongwon
    [J]. ELECTRONICS, 2020, 9 (11) : 1 - 30