An Improved Lightweight Network Using Attentive Feature Aggregation for Object Detection in Autonomous Driving

被引:2
作者
Kalgaonkar, Priyank [1 ]
El-Sharkawy, Mohamed [1 ]
机构
[1] Purdue Sch Engn & Technol, Dept Elect & Comp Engn, Indianapolis, IN 46254 USA
关键词
MobileNetV3; lightweight; object detection; autonomous driving; PyTorch; NXP;
D O I
10.3390/jlpea13030049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection, a more advanced application of computer vision than image classification, utilizes deep neural networks to predict objects in an input image and determine their locations through bounding boxes. The field of artificial intelligence has increasingly focused on the demands of autonomous driving, which require both high accuracy and fast inference speeds. This research paper aims to address this demand by introducing an efficient lightweight network for object detection specifically designed for self-driving vehicles. The proposed network, named MobDet3, incorporates a modified MobileNetV3 as its backbone, leveraging its lightweight convolutional neural network algorithm to extract and aggregate image features. Furthermore, the network integrates altered techniques in computer vision and adjusts to the most recent iteration of the PyTorch framework. The MobDet3 network enhances not only object positioning ability but also the reusability of feature maps across different scales. Extensive evaluations were conducted to assess the effectiveness of the proposed network, utilizing an autonomous driving dataset, as well as large-scale everyday human and object datasets. These evaluations were performed on NXP BlueBox 2.0, an advanced edge development platform designed for autonomous vehicles. The results demonstrate that the proposed lightweight object detection network achieves a mean precision of up to 58.30% on the BDD100K dataset and a high inference speed of up to 88.92 frames per second on NXP BlueBox 2.0, making it well-suited for real-time object detection in autonomous driving applications.
引用
收藏
页数:19
相关论文
共 44 条
[1]  
Aizenberg I.N., 2000, MULTIVALUED UNIVERSA
[2]  
[Anonymous], 2014, P IEEE C COMP VIS PA
[3]   Adaptive importance sampling to accelerate training of a neural probabilistic language model [J].
Bengio, Yoshua ;
Senecal, Jean-Sebastien .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (04) :713-722
[4]  
BERNERS-LEE C.M., 1968, Nature, V219, P202, DOI [10.1038/219202b0, 10.1038/219202B0, DOI 10.1038/219202B0]
[5]  
Cai Z., 2017, P IEEE C COMP VIS PA, P6154
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Cureton C., 2019, Bluebox Deep Dive-NXP's AD Processing Platform
[8]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[9]  
docs.ultralytics, YOLOv5 Documentation
[10]  
dspace, RTMaps Development and Validation Environment for Multisensor Applications (ADAS/AD, Robotics, etc.)