Research on Real-time Detection of Stacked Objects Based on Deep Learning

被引:3
作者
Geng, Kaiguo [1 ,2 ]
Qiao, Jinwei [1 ,2 ]
Liu, Na [1 ,2 ]
Yang, Zhi [1 ,2 ]
Zhang, Rongmin [1 ,2 ]
Li, Huiling [3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Mech & Automot Engn, Jinan 250353, Peoples R China
[2] Shandong Inst Mech Design & Res, Jinan 250353, Peoples R China
[3] Shandong Inst Innovat & Dev, Jinan 250101, Peoples R China
关键词
Stacked objects detection; Computer vision; Deep learning; One stage; Convolutional neural networks; SEGMENTATION; NETWORK; NMS;
D O I
10.1007/s10846-023-02009-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Learning has garnered significant attention in the field of object detection and is widely used in both industry and everyday life. The objective of this study is to investigate the applicability and targeted improvements of Deep Learning-based object detection in complex stacked environments. We analyzed the limitations in practical applications under such conditions, pinpointed the specific problems, and proposed corresponding improvement strategies. First, the study provided an overview of recent advancements in mainstream one-stage object detection algorithms, which included Anchor-based, Anchor-free, and Transformer-based architectures. The high real-time performance of these algorithms holds particular significance in practical engineering applications. It then looked at relevant technologies in three emerging research areas: Parts Recognition, Intelligent Driving, and Agricultural Picking. The study summarized existing limitations in real-time object detection within complex stacked environments and provided a comprehensive analysis of prevalent improvement strategies such as multi-level feature fusion, knowledge distillation, and hyperparameter optimization. Finally, after analyzing the performance of recent advanced one-stage algorithms on official datasets, this paper conducted empirical tests on a self-constructed industrial stacked dataset with algorithms of different structure and analyzed the experimental results in detail. A comprehensive analysis shows that Deep Learning-based object detection algorithms offer extensive applicability in complex stacked environments. In addressing diverse target sizes, overlapping occlusions, real-time constraints, and the need for lightweight solutions in complex stacked environments, each improvement strategy has its own advantages and limitations. Selecting and integrating appropriate enhancement strategies is critical and typically requires holistic evaluation, tailored to specific application contexts and challenges.
引用
收藏
页数:36
相关论文
共 205 条
[1]  
[Anonymous], 2023, Ultralytics: ultralytics's official github repository
[2]   YOLOv5 with ConvMixer Prediction Heads for Precise Object Detection in Drone Imagery [J].
Baidya, Ranjai ;
Jeong, Heon .
SENSORS, 2022, 22 (21)
[3]  
Bay H., 2006, EUROPEAN C COMPUTER, P1
[4]   Real-time vehicle detection algorithm based on a lightweight You-Only-Look-Once (YOLOv5n-L) approach [J].
Bie, Minglin ;
Liu, Yanyan ;
Li, Guoning ;
Hong, Jintao ;
Li, Jin .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
[5]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[6]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[7]  
Bouthillier X, 2021, Arxiv, DOI [arXiv:2103.03098, DOI 10.48550/ARXIV.2103.03098]
[8]  
Broy M., 1992, Software Pioneers, P10
[10]  
Canziani Alfredo, 2016, arXiv