A Low-Rank CNN Architecture for Real-Time Semantic Segmentation in Visual SLAM Applications

被引:7
作者
Falaschetti, Laura [1 ]
Manoni, Lorenzo [1 ]
Turchetti, Claudio [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, I-60131 Ancona, Italy
来源
IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS | 2022年 / 3卷
关键词
Semantics; Computer architecture; Image segmentation; Simultaneous localization and mapping; Convolutional neural networks; Task analysis; Visualization; Embedded systems; semantic segmentation; smart robots; smart vehicles; tensor decomposition; TENSOR DECOMPOSITIONS; RECOGNITION; NETWORK;
D O I
10.1109/OJCAS.2022.3174632
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time semantic segmentation on embedded devices has recently enjoyed significant gain in popularity, due to the increasing interest in smart vehicles and smart robots. In particular, with the emergence of autonomous driving, low latency and computation-intensive operations lead to new challenges for vehicles and robots, such as excessive computing power and energy consumption. The aim of this paper is to address semantic segmentation, one of the most critical tasks for the perception of the environment, and its implementation in a low power core, by preserving the required performance of accuracy and low complexity. To reach this goal a low-rank convolutional neural network (CNN) architecture for real-time semantic segmentation is proposed. The main contributions of this paper are: i) a tensor decomposition technique has been applied to the kernel of a generic convolutional layer, ii) three versions of an optimized architecture, that combines UNet and ResNet models, have been derived to explore the trade-off between model complexity and accuracy, iii) the low-rank CNN architectures have been implemented in a Raspberry Pi 4 and NVIDIA Jetson Nano 2 GB embedded platforms, as severe benchmarks to meet the low-power, low-cost requirements, and in the high-cost GPU NVIDIA Tesla P100 PCIe 16 GB to meet the best performance.
引用
收藏
页码:115 / 133
页数:19
相关论文
共 115 条
[1]  
Aldegheri S, 2019, IEEE INT C INT ROBOT, P5370, DOI [10.1109/IROS40897.2019.8967814, 10.1109/iros40897.2019.8967814]
[2]   Error Preserving Correction: A Method for CP Decomposition at a Target Error Bound [J].
Anh-Huy Phan ;
Tichavsky, Petr ;
Cichocki, Andrzej .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (05) :1175-1190
[3]  
[Anonymous], 2016, ROB SCI SYST RSS 201
[4]  
[Anonymous], 2014, COMPUT RES REPOSITOR
[5]  
[Anonymous], 2009, IEEE I CONF COMP VIS, DOI 10.1109/ICCV.2009.5459175
[6]  
[Anonymous], 2016, ENET DEEP NEURAL NET
[7]  
[Anonymous], 2015, Advances in neural information processing systems
[8]  
[Anonymous], MOTION BASED SEGMENT
[9]  
[Anonymous], 1988, Advances in Neural Information Processing Systems
[10]  
[Anonymous], 2016, ARXIV160404339