Improvement of park drivable area segmentation method based on STDCSeg network

被引:0
作者
Zhang, Ting [1 ]
Liu, Yuansheng [2 ]
Fan, Yangyang [1 ]
Lu, Ming [3 ]
机构
[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
[2] Beijing Union Univ, Coll Robot, Beijing 100101, Peoples R China
[3] Beijing Union Univ, Coll Appl Sci Technol, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Deep learning; Convolutional neural network; Park drivable area;
D O I
10.1007/s42452-025-06767-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the increasing demand for smart mobile devices in park environments, there are higher requirements for their ability to adapt to complex scenarios. In unique park scenes such as lakeside paths and irregular stairs, the blurred road edges make it difficult for traditional algorithms to accurately distinguish between drivable and restricted areas, posing a significant challenge to the path planning and stable operation of smart mobile devices. To enhance the safe operation of these devices on complex paths, this study proposes an improved semantic segmentation method for park drivable areas, based on the Short-Term Dense Connection (STDC) network. This method effectively addresses issues such as imprecise road edge segmentation, insufficient road position awareness, and the limited generalization capability of existing datasets. Utilizing the STDC network as a backbone for key image feature extraction, a Multi-scale Coordinate Attention Module (MCAM) is introduced to bolster road position awareness and refine edge segmentation. Moreover, an Atrous Spatial Pyramid Pooling (ASPP) module is incorporated, allowing the network to manage local details and global contextual information within images. Besides, a dataset tailored to park environments is constructed using real-world and simulated park scene data for training and validation. The experimental results show that the algorithm achieved a mIoU of 98.91% on the self-built dataset, which is 3.07% higher than the STDC network. Additionally, its mIoU on the public dataset is 2.91% higher than the STDC network.
引用
收藏
页数:15
相关论文
共 27 条
[1]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Vestibular migraine or Meniere's disease: a diagnostic dilemma [J].
Chen, Jing-Yu ;
Guo, Zhao-Qi ;
Wang, Jun ;
Liu, Dan ;
Tian, E. ;
Guo, Jia-Qi ;
Kong, Wei-Jia ;
Zhang, Su-Lin .
JOURNAL OF NEUROLOGY, 2023, 270 (04) :1955-1968
[4]  
Chen LC, 2018, Arxiv, DOI [arXiv:1802.02611, 10.48550/ARXIV.1802.02611, DOI 10.1007/978-3-030-01234-249]
[5]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[8]   Rethinking BiSeNet For Real-time Semantic Segmentation [J].
Fan, Mingyuan ;
Lai, Shenqi ;
Huang, Junshi ;
Wei, Xiaoming ;
Chai, Zhenhua ;
Luo, Junfeng ;
Wei, Xiaolin .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9711-9720
[9]  
Hao Y., 2022, arXiv
[10]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717