Semantic Labeling of High-Resolution Images Using EfficientUNets and Transformers

被引:13
|
作者
Almarzouqi, Hasan [1 ]
Saoud, Lyes Saad [2 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
[2] Khalifa Univ, Mech Engn Dept, Abu Dhabi 127788, U Arab Emirates
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
关键词
Transformers; Feature extraction; Remote sensing; Semantics; Semantic segmentation; Image resolution; Data models; Convolutional neural networks (CNNs); EfficientNet; fusion networks; semantic segmentation; transformers; SEGMENTATION; NETWORK; CLASSIFICATION; FOREST;
D O I
10.1109/TGRS.2023.3268159
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semantic segmentation necessitates approaches that learn high-level characteristics while dealing with enormous quantities of data. Convolutional neural networks (CNNs) can learn unique and adaptive features to achieve this aim. However, due to the large size and high spatial resolution of remote sensing images, these networks cannot efficiently analyze an entire scene. Recently, deep transformers have proven their capability to record global interactions between different objects in the image. In this article, we propose a new segmentation model that combines CNNs with transformers and show that this mixture of local and global feature extraction techniques provides significant advantages in remote sensing segmentation. In addition, the proposed model includes two fusion layers that are designed to efficiently represent multimodal inputs and outputs of the network. The input fusion layer extracts feature maps summarizing the relationship between image content and elevation maps [digital surface model (DSM)]. The output fusion layer uses a novel multitask segmentation strategy where class labels are identified using class-specific feature extraction layers and loss functions. Finally, a fast-marching method (FMM) is used to convert unidentified class labels into their closest known neighbors. Our results demonstrate that the proposed method improves segmentation accuracy compared with state-of-the-art techniques.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multiscale Feature Weighted-Aggregating and Boundary Enhancement Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhao, Yingying
    Zheng, Guizhou
    Xu, Zhangyan
    Qiu, Zhonghang
    Chen, Zhixing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8118 - 8130
  • [22] SEMANTIC SEGMENTATION OF HIGH-RESOLUTION REMOTE SENSING IMAGES USING AN IMPROVED TRANSFORMER
    Liu, Yuheng
    Mei, Shaohui
    Zhang, Shun
    Wang, Ye
    He, Mingyi
    Du, Qian
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3496 - 3499
  • [23] FSegNet: A Semantic Segmentation Network for High-Resolution Remote Sensing Images That Balances Efficiency and Performance
    Luo, Wen
    Deng, Fei
    Jiang, Peifan
    Dong, Xiujun
    Zhang, Gulan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [24] HCANet: A Hierarchical Context Aggregation Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Bai, Haiwei
    Cheng, Jian
    Huang, Xia
    Liu, Siyu
    Deng, Changjian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [25] High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
    Pan, Xuran
    Gao, Lianru
    Zhang, Bing
    Yang, Fan
    Liao, Wenzhi
    SENSORS, 2018, 18 (11)
  • [26] MsanlfNet: Semantic Segmentation Network With Multiscale Attention and Nonlocal Filters for High-Resolution Remote Sensing Images
    Bai, Lin
    Lin, Xiangyuan
    Ye, Zhen
    Xue, Dongling
    Yao, Cheng
    Hui, Meng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images
    Li, Haifeng
    Qiu, Kaijian
    Chen, Li
    Mei, Xiaoming
    Hong, Liang
    Tao, Chao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 905 - 909
  • [28] MFALNet: A Multiscale Feature Aggregation Lightweight Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Lv, Liang
    Guo, Yiyou
    Bao, Tengfei
    Fu, Chenqin
    Huo, Hong
    Fang, Tao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2172 - 2176
  • [29] SegCLIP: Multimodal Visual-Language and Prompt Learning for High-Resolution Remote Sensing Semantic Segmentation
    Zhang, Shijie
    Zhang, Bin
    Wu, Yuntao
    Zhou, Huabing
    Jiang, Junjun
    Ma, Jiayi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [30] See, Perceive, and Answer: A Unified Benchmark for High-Resolution Postdisaster Evaluation in Remote Sensing Images
    Zhao, Danpei
    Lu, Jiankai
    Yuan, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14