FPGA-oriented lightweight multi-modal free-space detection network

被引:1
作者
Fang, Feiyi [1 ]
Mao, Junzhu [1 ]
Yu, Wei [2 ]
Lu, Jianfeng [1 ,3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Beijing RICH AI Informat Technol Co Ltd, Beijing, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
关键词
Pruning; free-space detection; lightweight network; multi-modal learning; FPGA;
D O I
10.1080/09540091.2022.2159333
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-space detection algorithms are developed based on multiple sensors. However, multi-modal CNNs have high data throughput and contain a large number of computationally intensive convolution calculations, limiting their feasibility for real-time applications. Field Programmable Gate Arrays (FPGAs) provide a unique combination of flexibility, performance, and low power for these problems to accommodate multi-modal data and the computational acceleration of different compression algorithms. Network lightweight methods offer great assurance for facilitating the deployment of CNNs on such resource-constrained devices. In this paper, we propose a network lightweight method for a multi-modal free-space detection algorithm. We first propose an FPGA-friendly multi-modal free-space detection lightweight network. It comprises operators that FPGA prefers and achieves a 95.54% MaxF score on the test set of KITTI-Road free-space detection tasks and 81 ms runtime when running on 700 W GPU devices. Then we present a pruning approach for this network to reduce the number of parameters in case the complete model exceeds the FPGA chip memory. The pruning is in two parts. For the feature extractors, we propose a data-dependent filter pruner according to the principle that the low-rank feature map contains less information. To not compromise the integrity of the multi-modal information, the pruner is independent for each modality. For the segmentation decoder, we apply a channel pruning approach to remove redundant parameters. Finally, we implement our designs on an FPGA board using 8-bit quantisation, and the accelerator achieves outstanding performance. A real-time application of scene segmentation on KITTI-Road is used to evaluate our algorithm, and the model achieves a 94.39% MaxF score and minimum 14 ms runtime on 20W FPGA devices.
引用
收藏
页数:21
相关论文
共 71 条
[1]  
[Anonymous], 2018, CORR
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   RoadNet-RT: High Throughput CNN Architecture and SoC Design for Real-Time Road Segmentation [J].
Bai, Lin ;
Lyu, Yecheng ;
Huang, Xinming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (02) :704-714
[4]   Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments [J].
Bai, Xiao ;
Wang, Xiang ;
Liu, Xianglong ;
Liu, Qiang ;
Song, Jingkuan ;
Sebe, Nicu ;
Kim, Been .
PATTERN RECOGNITION, 2021, 120
[5]   LIDAR-camera fusion for road detection using fully convolutional neural networks [J].
Caltagirone, Luca ;
Bellone, Mauro ;
Svensson, Lennart ;
Wande, Mattias .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 111 :125-131
[6]   "Learning-Compression" Algorithms for Neural Net Pruning [J].
Carreira-Perpinan, Miguel A. ;
Idelbayev, Yerlan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8532-8541
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]  
Chen X., 2020, ARXIV
[9]   Progressive LiDAR Adaptation for Road Detection [J].
Chen, Zhe ;
Zhang, Jing ;
Tao, Dacheng .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) :693-702
[10]   RBNet: A Deep Neural Network for Unified Road and Road Boundary Detection [J].
Chen, Zhe ;
Chen, Zijing .
NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 :677-687