TBiSeg: A transformer-based network with bi-level routing attention for inland waterway segmentation

被引:2
作者
Fu, Chuanmao [1 ]
Li, Meng [1 ]
Zhang, Bo [2 ]
机构
[1] Jilin Univ, Coll Elect Sci & Engn, State Key Lab Integrated Optoelect, Changchun 130000, Jilin, Peoples R China
[2] China Ship Sci Res Ctr, Taihu Lab Deepsea, Wuxi 214082, Jiangsu, Peoples R China
关键词
Inland waterway segmentation; Vision transformer; Deep learning; Attention mechanism; UNMANNED SURFACE VEHICLES; USV;
D O I
10.1016/j.oceaneng.2024.119011
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Unmanned surface vehicles (USVs) for inland waterways have recently attracted increasing attention in various fields. Accurate detection in navigable regions is crucial for ensuring USV safety in autonomous navigation. However, the complex and variable environment of inland waterways, such as confusable textures and irregular edge details, continues to pose some problems in existing methods. Therefore, to acquire navigable regions, this study proposed TBiSeg, a Vision Transformer-based efficient inland waterway segmentation network, for obtaining pixel-level results. Bi-level routing attention is used to improve the Transformer block, which enhances the understanding of inland water textures. Additionly, this study combined global and local attention through a hierarchical encoder-decoder architecture. To simulate inland waterway scenes as accurately as possible, this study used two representative public datasets for data integration and data augmentation, and conducted testing and cross-validating using multiple inland waterway datasets. Results demonstrated that the model performed better than current state-of-the-art models in segmentation accuracy and robustness in complex inland waterway environments while showing impressive generalization. The datasets and code used in this paper is available at https://github.com/dawnnazzz/TBiSeg.
引用
收藏
页数:15
相关论文
共 59 条
[21]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[22]   RaRWS: A Radar-assisted Real-time Water Segmentation Network to Meet the Autonomous Navigation of USV in Inland Waterways [J].
He, Weiye ;
Jiang, Xianliang ;
Jin, Guang .
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, :4168-4174
[23]   Enabling Assistance Functions for the Safe Navigation of Inland Waterways [J].
Hesselbarth, Anja ;
Medina, Daniel ;
Ziebold, Ralf ;
Sandler, Martin ;
Hoppe, Michael ;
Uhlemann, Maik .
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2020, 12 (03) :123-135
[24]   Horizon detection in maritime images using scene parsing network [J].
Jeong, C. Y. ;
Yang, H. S. ;
Moon, K. D. .
ELECTRONICS LETTERS, 2018, 54 (12) :760-761
[25]   DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition [J].
Jiao, Jiayu ;
Tang, Yu-Ming ;
Lin, Kun-Yu ;
Gao, Yipeng ;
Ma, Andy J. ;
Wang, Yaowei ;
Zheng, Wei-Shi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :8906-8919
[26]  
Kim J., 2023, IEEE Robot. Autom. Lett.
[27]   Optimal Task-UAV-Edge Matching for Computation Offloading in UAV Assisted Mobile Edge Computing [J].
Kim, Kitae ;
Hong, Choong Seen .
2019 20TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2019,
[28]  
Kingma D. P., ADAM METHOD STOCHAST
[29]   AUTOMATIC WATERLINE EXTRACTION FROM SMARTPHONE IMAGES [J].
Kroehnert, M. .
XXIII ISPRS CONGRESS, COMMISSION V, 2016, 41 (B5) :857-863
[30]  
Lipschutz I., 2013, INT J COMPUT ENG RES, V3, P1197