Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Yao, Fengqin [1 ]
Wang, Shengke [1 ]
Ding, Laihui [2 ]
Zhong, Guoqiang [1 ]
Li, Shu [1 ]
Xu, Zhiwei [2 ]
机构
[1] Ocean Univ China, Qingdao 266100, Peoples R China
[2] Shandong Willand Intelligent Technol Co Ltd, Qingdao 266100, Peoples R China
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
10.1007/s12559-023-10206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
收藏
页码:366 / 376
页数:11
相关论文
共 50 条
  • [31] MAF-Net: A multi-scale attention fusion network for automatic surgical instrument segmentation?
    Yang, Lei
    Gu, Yuge
    Bian, Guibin
    Liu, Yanhong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [32] MULTI-SCALE FUSION ATTENTION NETWORK FOR MULTISPECTRAL WORLDVIEW3 DATA ROAD SEGMENTATION
    Tong, Zhonggui
    Li, Yuxia
    Zhang, Jinglin
    Gong, Yushu
    He, Lei
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5720 - 5723
  • [33] Semantic Segmentation Method Based on Residual and Multi-Scale Feature Fusion
    Xiu, Chunbo
    Su, Huan
    Su, Xuemiao
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2078 - 2083
  • [34] Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion
    Zhang J.
    Jin Q.
    Wang H.
    Da C.
    Xiang S.
    Pan C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1509 - 1517
  • [35] TAG-fusion: Two-stage attention guided multi-modal fusion network for semantic segmentation
    Zhang, Zhizhou
    Wang, Wenwu
    Zhu, Lei
    Tang, Zhibin
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [36] Attention based multi-scale parallel network for polyp segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [37] Semantic Segmentation Network of Pathological Images of Liver Tissue Based on Multi-scale Feature and Attention Mechanism
    Zhang A.
    Kang Y.
    Wu Z.
    Cui L.
    Bu Q.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (04): : 375 - 384
  • [38] Parallel multi-scale network with attention mechanism for pancreas segmentation
    Long, Jianwu
    Song, Xinlei
    An, Yong
    Li, Tong
    Zhu, Jiangzhou
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (01) : 110 - 119
  • [39] CMAA: Channel-wise multi-scale adaptive attention network for metallographic image semantic segmentation
    Sun, Yongliang
    Huang, Xiangyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 276
  • [40] DoubleU-NetPlus: a novel attention and context-guided dual U-Net with multi-scale residual feature fusion network for semantic segmentation of medical images
    Ahmed, Md. Rayhan
    Ashrafi, Adnan Ferdous
    Ahmed, Raihan Uddin
    Shatabda, Swakkhar
    Islam, A. K. M. Muzahidul
    Islam, Salekul
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19) : 14379 - 14401