Target positioning method based on B-spline level set and GC Yolo-v3

被引:0
作者
Zhang, Lin [1 ,3 ]
Ji, Lizhen [2 ,3 ]
Ma, Zongfang [1 ,3 ]
机构
[1] Xian Univ Architecture & Technol, Dept Comp Sci & Technol, 13 Yanta Rd Middle Sect, Xian 710055, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Math & Stat, 28 Xianning West Rd, Xian 710049, Shaanxi, Peoples R China
[3] Xian Univ Architecture & Technol, Dept Artificial Intelligence & Automat Syst, 13 Yanta Rd Middle Sect, Xian 710055, Shaanxi, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2024年 / 27卷 / 03期
基金
中国国家自然科学基金;
关键词
Object detection; Deep learning; Target positioning; Evaluation index;
D O I
10.1007/s10586-023-04164-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In industrial applications, mobile robot navigation involves autonomously moving towards specific target areas. Visual sensors are commonly used to identify and locate the target, enabling actions like manipulator grasping. However, when using the Yolo series for target positioning, two challenges arise. The first challenge is that the evaluation index Intersection over Union (IOU) only provides limited information and does not fully express the confidence in the target position. To overcome this, the Yolo layer utilizes a complete IOU as its loss function, which considers factors such as overlap, distance, and aspect ratio. Additionally, a Gaussian model layer is introduced between the non-maximum suppression layer and the output layer to estimate the coordinate uncertainty. The second challenge is that the prediction framework for target positioning lacks accuracy, resulting in significant deviations from the ground truth. To improve precision, this paper proposes combining the B-spline level set method for target segmentation. This technique adjusts the deviation from the minimum rectangular box surrounding the target. The proposed global optimization segmentation model, based on B-spline, effectively handles local optimal values in non-convex and traditional variational segmentation models. Experimental results demonstrate that: In the VOC dataset, compared with the Yolo-v3 method and Faster R-CNN method, the mAP of the GC Yolo-v3 method is increased by 2.95% and 5.85%, respectively, and the recall value is increased by 0.03 and 0.02, respectively. Compared with the Yolo-v3 method in VOC dataset, the average IOU value of the GC Yolo-v3 method is increased by 8.31%. In the real dataset, compared with the Yolo-v3 method and the Faster R-CNN method, the mAP of the GC Yolo-v3 method is increased by 3.32% and 7.41% respectively, and the recall value is increased by 0.08 and 0.05 respectively. The average IOU value of the GC Yolo-v3 method is 6.35% higher than that of the Yolo-v3 method in real dataset.
引用
收藏
页码:3575 / 3591
页数:17
相关论文
共 22 条
  • [1] Bochkovskiy A., ARXIV200410934
  • [2] Algorithms for finding global minimizers of image segmentation and denoising models
    Chan, Tony F.
    Esedoglu, Selim
    Nikolova, Mila
    [J]. SIAM JOURNAL ON APPLIED MATHEMATICS, 2006, 66 (05) : 1632 - 1648
  • [3] Gevorgyan Zhora, 2022, SIoU loss: More powerful learning for bounding box regression
  • [4] He K, 2017, IEEE INT WORKSH MULT
  • [5] Joseph RK, 2016, CRIT POL ECON S ASIA, P1
  • [6] Li C., 2022, arXiv, Vabs/2209.0
  • [7] Li J., 2017, IEEE T MULTIMED
  • [8] An improved target detection method based on multiscale features fusion
    Lu, Liping
    Li, Hanshan
    Ding, Zhe
    Guo, Quanmin
    [J]. MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2020, 62 (09) : 3051 - 3059
  • [9] Lu Pengsen, 2023, 2023 IEEE 2nd International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA), P547, DOI 10.1109/EEBDA56825.2023.10090701
  • [10] Luo Q., 2018, 3D SSD LEARNING HIER