LASNet: A Light-Weight Asymmetric Spatial Feature Network for Real-Time Semantic Segmentation

被引:5
作者
Chen, Yu [1 ]
Zhan, Weida [1 ]
Jiang, Yichun [1 ]
Zhu, Depeng [1 ]
Guo, Renzhong [1 ]
Xu, Xiaoyu [1 ]
机构
[1] Changchun Univ Sci & Technol, Natl Demonstrat Ctr Expt Elect, Sch Elect & Informat Engn, Changchun 130022, Peoples R China
关键词
asymmetric convolution; real-time semantic segmentation; attention mechanism; residual unit; FEATURE FUSION NETWORK;
D O I
10.3390/electronics11193238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deep learning models have achieved great success in the field of semantic segmentation, which achieve satisfactory performance by introducing a large number of parameters. However, this achievement usually leads to high computational complexity, which seriously limits the deployment of semantic segmented applications on mobile devices with limited computing and storage resources. To address this problem, we propose a lightweight asymmetric spatial feature network (LASNet) for real-time semantic segmentation. We consider the network parameters, inference speed, and performance to design the structure of LASNet, which can make the LASNet applied to embedded devices and mobile devices better. In the encoding part of LASNet, we propose the LAS module, which retains and utilize spatial information. This module uses a combination of asymmetric convolution, group convolution, and dual-stream structure to reduce the number of network parameters and maintain strong feature extraction ability. In the decoding part of LASNet, we propose the multivariate concatenate module to reuse the shallow features, which can improve the segmentation accuracy and maintain a high inference speed. Our network attains precise real-time segmentation results in a wide range of experiments. Without additional processing and pre-training, LASNet achieves 70.99% mIoU and 110.93 FPS inference speed in the CityScapes dataset with only 0.8 M model parameters.
引用
收藏
页数:18
相关论文
共 51 条
[1]  
[Anonymous], 2015, P IEEE C COMP VIS PA
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[4]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[5]   RGAM: A novel network architecture for 3D point cloud semantic segmentation in indoor scenes [J].
Chen, Xue-Tao ;
Li, Ying ;
Fan, Jia-Hao ;
Wang, Rui .
INFORMATION SCIENCES, 2021, 571 :87-103
[6]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[7]   Knowledge and Spatial Pyramid Distance-Based Gated Graph Attention Network for Remote Sensing Semantic Segmentation [J].
Cui, Wei ;
He, Xin ;
Yao, Meng ;
Wang, Ziwei ;
Hao, Yuanjie ;
Li, Jie ;
Wu, Weijie ;
Zhao, Huilin ;
Xia, Cong ;
Li, Jin ;
Cui, Wenqi .
REMOTE SENSING, 2021, 13 (07)
[8]   Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes [J].
Dong, Genshun ;
Yan, Yan ;
Shen, Chunhua ;
Wang, Hanzi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3258-3274
[9]   DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes [J].
Elhassan, Mohammed A. M. ;
Huang, Chenxi ;
Yang, Chenhui ;
Munea, Tewodros Legesse .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
[10]   Interactive Few-Shot Learning: Limited Supervision, Better Medical Image Segmentation [J].
Feng, Ruiwei ;
Zheng, Xiangshang ;
Gao, Tianxiang ;
Chen, Jintai ;
Wang, Wenzhe ;
Chen, Danny Z. ;
Wu, Jian .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) :2575-2588