Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Yao, Fengqin [1 ]
Wang, Shengke [1 ]
Ding, Laihui [2 ]
Zhong, Guoqiang [1 ]
Li, Shu [1 ]
Xu, Zhiwei [2 ]
机构
[1] Ocean Univ China, Qingdao 266100, Peoples R China
[2] Shandong Willand Intelligent Technol Co Ltd, Qingdao 266100, Peoples R China
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
10.1007/s12559-023-10206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
收藏
页码:366 / 376
页数:11
相关论文
共 50 条
  • [21] Semantic Segmentation of Remote Sensing Images Based on Dual Attention and Multi-scale Feature Fusion
    Weng, Mengqian
    Hu, Zhibo
    Xie, Xiaopeng
    Li, Yunhong
    Hu, Lei
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [22] Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
    Shang, Ronghua
    Zhang, Jiyu
    Jiao, Licheng
    Li, Yangyang
    Marturi, Naresh
    Stolkin, Rustam
    REMOTE SENSING, 2020, 12 (05)
  • [23] Dual attention-guided distillation for class incremental semantic segmentation
    Xu, Pengju
    Wang, Yan
    Wang, Bingye
    Zhao, Haiying
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [24] Attention-guided and resource-saving modules for semantic segmentation
    Yu, Beike
    Wang, Dafang
    Cao, Jiang
    Liu, Lei
    Wang, Tao
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [25] Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context
    Zhou, Quan
    Wang, Linjie
    Gao, Guangwei
    Kang, Bin
    Ou, Weihua
    Lu, Huimin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7887 - 7900
  • [26] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
    Li, Yuanhang
    Liu, Shuo
    Wu, Jie
    Sun, Weichao
    Wen, Qingke
    Wu, Yibiao
    Qin, Xiujuan
    Qiao, Yanyou
    REMOTE SENSING, 2025, 17 (05)
  • [27] Multi-Scale Fusion With Matching Attention Model: A Novel Decoding Network Cooperated With NAS for Real-Time Semantic Segmentation
    Xie, Bangquan
    Yang, Zongming
    Yang, Liang
    Luo, Ruifa
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 12622 - 12632
  • [28] Attention-Guided Label Refinement Network for Semantic Segmentation of Very High Resolution Aerial Orthoimages
    Huang, Jianfeng
    Zhang, Xinchang
    Sun, Ying
    Xin, Qinchuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 4490 - 4503
  • [29] Remote sensing image semantic segmentation network based on multi-scale feature enhancement fusion
    Wang, Feiting
    Zhang, Yuan
    Hu, Qiongqiong
    Zhu, Yu
    GEOCARTO INTERNATIONAL, 2024, 39 (01)
  • [30] Multi-Scale Recursive Context Aggregation Network for Semantic Segmentation
    Yalcin, Abdullah
    Keskinoz, Mehmet
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,