BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder

被引：9

作者：

Shi, Xiaoqiang ^{[1
,2
,3
]}

Yin, Zhenyu ^{[2
,3
]}

Han, Guangjie ^{[4
]}

Liu, Wenzhuo ^{[5
]}

Qin, Li ^{[1
,2
,3
]}

Bi, Yuanguo ^{[6
]}

Li, Shurui ^{[7
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China

[2] Chinese Acad Sci, Shenyang Inst Comp Technol, Shenyang 110168, Peoples R China

[3] Liaoning Key Lab Domest Ind Control Platform Techn, Shenyang 110168, Peoples R China

[4] Hohai Univ, Dept Internet Things Engn, Changzhou 213022, Peoples R China

[5] China Univ Min & Technol Beijing, Sch Artificial Intelligence, Beijing 100083, Peoples R China

[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110167, Peoples R China

[7] Shenyang Aerosp Univ, Sch Comp Sci, Shenyang 110136, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Real-time semantic segmentation; convolution neural networks; AutoEncoder; feature fusion; FUSION NETWORK;

D O I：

10.1109/TCSVT.2023.3325360

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although semantic segmentation methods have made remarkable progress so far, their long inference process limits their use in practical applications. Recently, some two-branch and three-branch real-time segmentation networks have been proposed to improve segmentation accuracy by adding branches to extract spatial or border information. For the design of extracting spatial information branches, preserving high-resolution features or adding segmentation loss to guide spatial branches are commonly used methods to extract spatial information. However, these approaches are not the most efficient. To solve the problem, we design the spatial information extraction branch as an AutoEncoder structure, which allows us to extract the spatial structure and features of the image during the encoding and decoding process of the AutoEncoder. Border, semantic and spatial information are all helpful for segmentation tasks, and efficiently fusing these three kinds of information can obtain better feature representation compared to the fusion of two types of information in the dual-branch network. However, existing three-branch networks have yet to explore this aspect deeply. Therefore, this paper designs a new three-branch network based on this starting point. In addition, we also propose a feature fusion module called the Unified Multi-Feature Fusion module (UMF), which can fuse multiple features efficiently. Our method achieves a state-of-the-art trade-off between inference speed and accuracy on the Cityscapes, CamVid, and NightCity datasets. Specifically, BSSNet-T achieves 78.8% mIoU at 115.8 FPS on the Cityscapes dataset, 79.5% mIoU at 170.8 FPS on the CamVid dataset, and 52.6% mIoU at 172.3 FPS on the NightCity dataset. Code is available at https://github.com/SXQ-STUDY/BSSNet.

引用

页码：3424 / 3438

页数：15

共 50 条

[41] A Multi-level Feature Fusion Network for Real-time Semantic Segmentation
Wang, Lu
Xu, Qinzhen
Xiong, Zixiang
Huang, Yongming
Yang, Luxi
2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
[42] Faster BiSeNet : A Faster Bilateral Segmentation Network for Real-time Semantic Segmentation
Xu, Qi
Ma, Yinan
Wu, Jing
Long, Chengnian
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[43] ASFNet: Adaptive multiscale segmentation fusion network for real-time semantic segmentation
Zha, Hengfeng
Liu, Rui
Yang, Xin
Zhou, Dongsheng
Zhang, Qiang
Wei, Xiaopeng
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (3-4)
[44] Exploring Scale-Aware Features for Real-Time Semantic Segmentation of Street Scenes
Li, Kaige
Geng, Qichuan
Zhou, Zhong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3575 - 3587
[45] DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes
Elhassan, Mohammed A. M.
Huang, Chenxi
Yang, Chenhui
Munea, Tewodros Legesse
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
[46] Adjacent Feature Propagation Network (AFPNet) for Real-Time Semantic Segmentation
Hyun, Junhyuk
Seong, Hongje
Kim, Sangki
Kim, Euntai
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (09): : 5877 - 5888
[47] EFRNet: Efficient Feature Reuse Network for Real-time Semantic Segmentation
Li, Yaqian
Li, Moran
Li, Zhongliang
Xiao, Cunjun
Li, Haibin
NEURAL PROCESSING LETTERS, 2022, 54 (06) : 4647 - 4659
[48] EFRNet: Efficient Feature Reuse Network for Real-time Semantic Segmentation
Yaqian Li
Moran Li
Zhongliang Li
Cunjun Xiao
Haibin Li
Neural Processing Letters, 2022, 54 : 4647 - 4659
[49] SCMNet: Shared Context Mining Network for Real-time Semantic Segmentation
Singha, Tanmay
Bergemann, Moritz
Duc-Son Pham
Krishna, Aneesh
2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 582 - 589
[50] Attention based lightweight asymmetric network for real-time semantic segmentation
Liu, Qian
Wang, Cunbao
Li, Zhensheng
Qi, Youwei
Fang, Jiongtao
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130

← 1 2 3 4 5 →