Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Yao, Fengqin [1 ]
Wang, Shengke [1 ]
Ding, Laihui [2 ]
Zhong, Guoqiang [1 ]
Li, Shu [1 ]
Xu, Zhiwei [2 ]
机构
[1] Ocean Univ China, Qingdao 266100, Peoples R China
[2] Shandong Willand Intelligent Technol Co Ltd, Qingdao 266100, Peoples R China
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
10.1007/s12559-023-10206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
收藏
页码:366 / 376
页数:11
相关论文
共 50 条
  • [41] Semantic Segmentation of Remote Sensing Image via Self-Attention-Based Multi-Scale Feature Fusion
    Guo D.
    Fu Y.
    Zhu Y.
    Wen W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (08): : 1259 - 1268
  • [42] Semantic Segmentation of Urban Airborne LiDAR Point Clouds Based on Fusion Attention Mechanism and Multi-Scale Features
    Wang, Jingxue
    Li, Huan
    Xu, Zhenghui
    Xie, Xiao
    REMOTE SENSING, 2023, 15 (21)
  • [43] Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation
    Wang, Shuyang
    Mu, Xiaodong
    Yang, Dongfang
    He, Hao
    Zhao, Peng
    IEEE ACCESS, 2020, 8 : 215299 - 215309
  • [44] DoubleU-NetPlus: a novel attention and context-guided dual U-Net with multi-scale residual feature fusion network for semantic segmentation of medical images
    Md. Rayhan Ahmed
    Adnan Ferdous Ashrafi
    Raihan Uddin Ahmed
    Swakkhar Shatabda
    A. K. M. Muzahidul Islam
    Salekul Islam
    Neural Computing and Applications, 2023, 35 : 14379 - 14401
  • [45] AMF-NET: Attention-aware Multi-scale Fusion Network for Retinal Vessel Segmentation
    Yang, Qi
    Ma, Bingqi
    Cui, Hui
    Ma, Jiquan
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3277 - 3280
  • [46] MLFNet- Point Cloud Semantic Segmentation Convolution Network Based on Multi-Scale Feature Fusion
    Yang, Jingfang
    Zou, Bochang
    Qiu, Huadong
    Li, Zhi
    IEEE ACCESS, 2021, 9 : 44950 - 44962
  • [47] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
    Shen, Xu
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2022, 14 (23)
  • [48] Multi-Scale Convolutional Features Network for Semantic Segmentation in Indoor Scenes
    Wang, Yanran
    Chen, Qingliang
    Chen, Shilang
    Wu, Junjun
    IEEE ACCESS, 2020, 8 : 89575 - 89583
  • [49] MFSNet: Enhancing Semantic Segmentation of Urban Scenes with a Multi-Scale Feature Shuffle Network
    Qian, Xiaohong
    Shu, Chente
    Jin, Wuyin
    Yu, Yunxiang
    Yang, Shengying
    ELECTRONICS, 2024, 13 (01)
  • [50] Enhanced multi-scale feature adaptive fusion sparse convolutional network for large-scale scenes semantic segmentation☆
    Shen, Lingfeng
    Cao, Yanlong
    Zhu, Wenbin
    Ren, Kai
    Shou, Yejun
    Wang, Haocheng
    Xu, Zhijie
    COMPUTERS & GRAPHICS-UK, 2025, 126