BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder

被引:9
|
作者
Shi, Xiaoqiang [1 ,2 ,3 ]
Yin, Zhenyu [2 ,3 ]
Han, Guangjie [4 ]
Liu, Wenzhuo [5 ]
Qin, Li [1 ,2 ,3 ]
Bi, Yuanguo [6 ]
Li, Shurui [7 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Comp Technol, Shenyang 110168, Peoples R China
[3] Liaoning Key Lab Domest Ind Control Platform Techn, Shenyang 110168, Peoples R China
[4] Hohai Univ, Dept Internet Things Engn, Changzhou 213022, Peoples R China
[5] China Univ Min & Technol Beijing, Sch Artificial Intelligence, Beijing 100083, Peoples R China
[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110167, Peoples R China
[7] Shenyang Aerosp Univ, Sch Comp Sci, Shenyang 110136, Peoples R China
关键词
Real-time semantic segmentation; convolution neural networks; AutoEncoder; feature fusion; FUSION NETWORK;
D O I
10.1109/TCSVT.2023.3325360
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although semantic segmentation methods have made remarkable progress so far, their long inference process limits their use in practical applications. Recently, some two-branch and three-branch real-time segmentation networks have been proposed to improve segmentation accuracy by adding branches to extract spatial or border information. For the design of extracting spatial information branches, preserving high-resolution features or adding segmentation loss to guide spatial branches are commonly used methods to extract spatial information. However, these approaches are not the most efficient. To solve the problem, we design the spatial information extraction branch as an AutoEncoder structure, which allows us to extract the spatial structure and features of the image during the encoding and decoding process of the AutoEncoder. Border, semantic and spatial information are all helpful for segmentation tasks, and efficiently fusing these three kinds of information can obtain better feature representation compared to the fusion of two types of information in the dual-branch network. However, existing three-branch networks have yet to explore this aspect deeply. Therefore, this paper designs a new three-branch network based on this starting point. In addition, we also propose a feature fusion module called the Unified Multi-Feature Fusion module (UMF), which can fuse multiple features efficiently. Our method achieves a state-of-the-art trade-off between inference speed and accuracy on the Cityscapes, CamVid, and NightCity datasets. Specifically, BSSNet-T achieves 78.8% mIoU at 115.8 FPS on the Cityscapes dataset, 79.5% mIoU at 170.8 FPS on the CamVid dataset, and 52.6% mIoU at 172.3 FPS on the NightCity dataset. Code is available at https://github.com/SXQ-STUDY/BSSNet.
引用
收藏
页码:3424 / 3438
页数:15
相关论文
共 50 条
  • [41] A Multi-level Feature Fusion Network for Real-time Semantic Segmentation
    Wang, Lu
    Xu, Qinzhen
    Xiong, Zixiang
    Huang, Yongming
    Yang, Luxi
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [42] Faster BiSeNet : A Faster Bilateral Segmentation Network for Real-time Semantic Segmentation
    Xu, Qi
    Ma, Yinan
    Wu, Jing
    Long, Chengnian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [43] ASFNet: Adaptive multiscale segmentation fusion network for real-time semantic segmentation
    Zha, Hengfeng
    Liu, Rui
    Yang, Xin
    Zhou, Dongsheng
    Zhang, Qiang
    Wei, Xiaopeng
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (3-4)
  • [44] Exploring Scale-Aware Features for Real-Time Semantic Segmentation of Street Scenes
    Li, Kaige
    Geng, Qichuan
    Zhou, Zhong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3575 - 3587
  • [45] DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes
    Elhassan, Mohammed A. M.
    Huang, Chenxi
    Yang, Chenhui
    Munea, Tewodros Legesse
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [46] Adjacent Feature Propagation Network (AFPNet) for Real-Time Semantic Segmentation
    Hyun, Junhyuk
    Seong, Hongje
    Kim, Sangki
    Kim, Euntai
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (09): : 5877 - 5888
  • [47] EFRNet: Efficient Feature Reuse Network for Real-time Semantic Segmentation
    Li, Yaqian
    Li, Moran
    Li, Zhongliang
    Xiao, Cunjun
    Li, Haibin
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 4647 - 4659
  • [48] EFRNet: Efficient Feature Reuse Network for Real-time Semantic Segmentation
    Yaqian Li
    Moran Li
    Zhongliang Li
    Cunjun Xiao
    Haibin Li
    Neural Processing Letters, 2022, 54 : 4647 - 4659
  • [49] SCMNet: Shared Context Mining Network for Real-time Semantic Segmentation
    Singha, Tanmay
    Bergemann, Moritz
    Duc-Son Pham
    Krishna, Aneesh
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 582 - 589
  • [50] Attention based lightweight asymmetric network for real-time semantic segmentation
    Liu, Qian
    Wang, Cunbao
    Li, Zhensheng
    Qi, Youwei
    Fang, Jiongtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130