Multimodal Remote Sensing Image Segmentation With Intuition-Inspired Hypergraph Modeling

被引:42
|
作者
He, Qibin [1 ,2 ]
Sun, Xian [1 ,2 ]
Diao, Wenhui [1 ,3 ]
Yan, Zhiyuan [1 ,3 ]
Yao, Fanglong [1 ,2 ]
Fu, Kun [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 1100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Semantics; Image segmentation; Remote sensing; Optical interferometry; Vegetation; Optical sensors; Multimodal remote sensing; intuitive reasoning; hypergraph learning; semantic segmentation; SEMANTIC SEGMENTATION; FUSION NETWORK; ATTENTION;
D O I
10.1109/TIP.2023.3245324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal remote sensing (RS) image segmentation aims to comprehensively utilize multiple RS modalities to assign pixel-level semantics to the studied scenes, which can provide a new perspective for global city understanding. Multimodal segmentation inevitably encounters the challenge of modeling intra-and inter-modal relationships, i.e., object diversity and modal gaps. However, the previous methods are usually designed for a single RS modality, limited by the noisy collection environment and poor discrimination information. Neuropsychology and neuroanatomy confirm that the human brain performs the guiding perception and integrative cognition of multimodal semantics through intuitive reasoning. Therefore, establishing a semantic understanding framework inspired by intuition to realize multimodal RS segmentation becomes the main motivation of this work. Drived by the superiority of hypergraphs in modeling high-order relationships, we propose an intuition-inspired hypergraph network ((IH)-H-2 N) for multimodal RS segmentation. Specifically, we present a hypergraph parser to imitate guiding perception to learn intra-modal object-wise relationships. It parses the input modality into irregular hyper graphs to mine semantic clues and generate robust mono modal representations. In addition, we also design a hypergraph matcher to dynamically update the hypergraph structure from the explicit correspondence of visual concepts, similar to integrative cognition, to improve cross-modal compatibility when fusing multimodal features. Extensive experiments on two multimodal RS datasets show that the proposed I2H N outperforms the stateof-the-art models, achieving F-1/mIoU accuracy 91.4%/82.9% on the ISPRS Vaihingen dataset, and 92.1%/84.2% on the MSAW dataset.
引用
收藏
页码:1474 / 1487
页数:14
相关论文
共 50 条
  • [1] A Light-Weighted Hypergraph Neural Network for Multimodal Remote Sensing Image Retrieval
    Yu, Hongfeng
    Deng, Chubo
    Zhao, Liangjin
    Hao, Lingxiang
    Liu, Xiaoyu
    Lu, Wanxuan
    You, Hongjian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 2690 - 2702
  • [2] Remote Sensing Image Semantic Segmentation Network Based on Multimodal Fusion
    Hu, Yuxiang
    Yu, Changhong
    Gao, Ming
    Computer Engineering and Applications, 60 (15): : 234 - 242
  • [3] Brain Image Segmentation Based on Hypergraph Modeling
    Hu, Jicheng
    Wei, Xiaofeng
    He, Honglin
    2014 IEEE 12TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC)/2014 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTING (EMBEDDEDCOM)/2014 IEEE 12TH INTERNATIONAL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING (PICOM), 2014, : 327 - 332
  • [4] A Mamba-Diffusion Framework for Multimodal Remote Sensing Image Semantic Segmentation
    Du, Wen-Liang
    Gu, Yang
    Zhao, Jiaqi
    Zhu, Hancheng
    Yao, Rui
    Zhou, Yong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] Hypergraph-Guided Multimodal Prototype for Remote Sensing Scene Understanding
    Liu, Chenglong
    Deng, Chubo
    Yu, Hongfeng
    Yan, Qiwei
    Xu, Liangyu
    Zhang, Ting
    Sun, Xian
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [6] FDGSNet: A Multimodal Gated Segmentation Network for Remote Sensing Image Based on Frequency Decomposition
    Cui, Jian
    Liu, Jiahang
    Ni, Yue
    Wang, Jinjin
    Li, Manchun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19756 - 19770
  • [7] Hypergraph Representation Learning for Remote Sensing Image Change Detection
    Cui, Zhoujuan
    Zu, Yueran
    Duan, Yiping
    Tao, Xiaoming
    REMOTE SENSING, 2024, 16 (18)
  • [8] Semantic Co-Occurrence and Relationship Modeling for Remote Sensing Image Segmentation
    Zhang, Yinxing
    Song, Haochen
    Wang, Qingwang
    Jin, Pengcheng
    Shen, Tao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6630 - 6640
  • [9] LMFNet: Lightweight Multimodal Fusion Network for high-resolution remote sensing image segmentation
    Wang, Tong
    Chen, Guanzhou
    Zhang, Xiaodong
    Liu, Chenxi
    Wang, Jiaqi
    Tan, Xiaoliang
    Zhou, Wenlin
    He, Chanjuan
    PATTERN RECOGNITION, 2025, 164
  • [10] Deep Multimodal Fusion Network for Semantic Segmentation Using Remote Sensing Image and LiDAR Data
    Sun, Yangjie
    Fu, Zhongliang
    Sun, Chuanxia
    Hu, Yinglei
    Zhang, Shengyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60