Develop a novel deep learning-based framework using convolutional neural networks for real-time object detection and tracking in embedded systems

被引:0
作者
Zhang, Kai [1 ]
机构
[1] Zhengzhou Railway Vocat & Tech Coll, Innovat & Entrepreneurship Coll, 298 Tonghui Rd, Zhengzhou 451460, Henan, Peoples R China
关键词
embedded systems; real-time object detection; object tracking; Botox Optimization Algorithm-tuned Adaptive CNN (BOA-ACNN);
D O I
10.1177/14727978251346034
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Real-time object detection and tracking are critical for applications such as robotics and autonomous systems. Embedded platforms present challenges in balancing speed, accuracy, and efficiency. Existing approaches often struggle to achieve both high accuracy and real-time performance within resource-constrained embedded systems. The main challenge remains in balancing detection speed, tracking consistency, and hardware efficiency for practical deployment. This work proposes a deep learning (DL) framework optimized for embedded systems, ensuring high accuracy, minimal latency, and efficient resource utilization for real-world applications. The framework integrates a novel Botox Optimization Algorithm-tuned Adaptive CNN (BOA-ACNN) for real-time object recognition and tracking. The dataset comprises annotated video sequences capturing diverse scenarios involving vehicles, pedestrians, and dynamic camera movements. The framework employs a Kalman filter for real-time motion prediction and noise smoothing, thereby enhancing tracking stability. Additionally, SIFT features are utilized to improve detection robustness under varying scales and environmental conditions. The system incorporates BOA for hyper-parameter fine-tuning and ACNN for efficient real-time detection and tracking, achieving latency of 97.5 mu s, throughput of 200.4 activation/us, precision of 96.2%, recall of 97%, F1-score of 97%, mAP of 98.4%, and overall accuracy of 98.97%. This framework facilitates real-time object identification and tracking with high accuracy and low latency on embedded devices, demonstrating superior performance for practical applications.
引用
收藏
页数:17
相关论文
共 28 条
[1]   A novel real-time multiple objects detection and tracking framework for different challenges [J].
Abdulghafoor, Nuha H. ;
Abdullah, Hadeel N. .
ALEXANDRIA ENGINEERING JOURNAL, 2022, 61 (12) :9637-9647
[2]   IoT Enabled Deep Learning Based Framework for Multiple Object Detection in Remote Sensing Images [J].
Ahmed, Imran ;
Ahmad, Misbah ;
Chehri, Abdellah ;
Hassan, Mohammad Mehedi ;
Jeon, Gwanggil .
REMOTE SENSING, 2022, 14 (16)
[3]   FPGA-Based Real-Time Object Detection and Classification System Using YOLO for Edge Computing [J].
Al Amin, Rashed ;
Hasan, Mehrab ;
Wiese, Veit ;
Obermaisser, Roman .
IEEE ACCESS, 2024, 12 :73268-73278
[4]   Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection [J].
Amudhan, A. N. ;
Sudheer, A. P. .
IMAGE AND VISION COMPUTING, 2022, 119
[5]   Core outcomes measures in dental computer vision studies (DentalCOMS) [J].
Buettner, Martha ;
Rokhshad, Rata ;
Brinz, Janet ;
Issa, Julien ;
Chaurasia, Akhilanand ;
Uribe, Sergio E. ;
Karteva, Teodora ;
Chala, Sanaa ;
Tichy, Antonin ;
Schwendicke, Falk .
JOURNAL OF DENTISTRY, 2024, 150
[6]   Benchmarking Object Detection Deep Learning Models in Embedded Devices [J].
Cantero, David ;
Esnaola-Gonzalez, Iker ;
Miguel-Alonso, Jose ;
Jauregi, Ekaitz .
SENSORS, 2022, 22 (11)
[7]   A large scale training sample database system for intelligent interpretation of remote sensing imagery [J].
Cao, Zhipeng ;
Jiang, Liangcun ;
Yue, Peng ;
Gong, Jianya ;
Hu, Xiangyun ;
Liu, Shuaiqi ;
Tan, Haofeng ;
Liu, Chang ;
Shangguan, Boyi ;
Yu, Dayu .
GEO-SPATIAL INFORMATION SCIENCE, 2024, 27 (05) :1489-1508
[8]   Using Physical Dynamics: Accurate and Real-Time Object Detection for High-Resolution Video Streaming on Internet of Things Devices [J].
Cao, Zhiqiang ;
Cheng, Yun ;
Hu, Youbing ;
Lu, Anqi ;
Liu, Jie ;
Li, Zhijun .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12) :22494-22507
[9]   A robust multiclass 3D object recognition based on modern YOLO deep learning algorithms [J].
Francies, Mariam L. ;
Ata, Mohamed M. ;
Mohamed, Mohamed A. .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01)
[10]   LLM-Based Edge Intelligence: A Comprehensive Survey on Architectures, Applications, Security and Trustworthiness [J].
Friha, Othmane ;
Amine Ferrag, Mohamed ;
Kantarci, Burak ;
Cakmak, Burak ;
Ozgun, Arda ;
Ghoualmi-Zine, Nassira .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 :5799-5856