Multimodal Remote Sensing Image Segmentation With Intuition-Inspired Hypergraph Modeling

被引:42
作者
He, Qibin [1 ,2 ]
Sun, Xian [1 ,2 ]
Diao, Wenhui [1 ,3 ]
Yan, Zhiyuan [1 ,3 ]
Yao, Fanglong [1 ,2 ]
Fu, Kun [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 1100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Semantics; Image segmentation; Remote sensing; Optical interferometry; Vegetation; Optical sensors; Multimodal remote sensing; intuitive reasoning; hypergraph learning; semantic segmentation; SEMANTIC SEGMENTATION; FUSION NETWORK; ATTENTION;
D O I
10.1109/TIP.2023.3245324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal remote sensing (RS) image segmentation aims to comprehensively utilize multiple RS modalities to assign pixel-level semantics to the studied scenes, which can provide a new perspective for global city understanding. Multimodal segmentation inevitably encounters the challenge of modeling intra-and inter-modal relationships, i.e., object diversity and modal gaps. However, the previous methods are usually designed for a single RS modality, limited by the noisy collection environment and poor discrimination information. Neuropsychology and neuroanatomy confirm that the human brain performs the guiding perception and integrative cognition of multimodal semantics through intuitive reasoning. Therefore, establishing a semantic understanding framework inspired by intuition to realize multimodal RS segmentation becomes the main motivation of this work. Drived by the superiority of hypergraphs in modeling high-order relationships, we propose an intuition-inspired hypergraph network ((IH)-H-2 N) for multimodal RS segmentation. Specifically, we present a hypergraph parser to imitate guiding perception to learn intra-modal object-wise relationships. It parses the input modality into irregular hyper graphs to mine semantic clues and generate robust mono modal representations. In addition, we also design a hypergraph matcher to dynamically update the hypergraph structure from the explicit correspondence of visual concepts, similar to integrative cognition, to improve cross-modal compatibility when fusing multimodal features. Extensive experiments on two multimodal RS datasets show that the proposed I2H N outperforms the stateof-the-art models, achieving F-1/mIoU accuracy 91.4%/82.9% on the ISPRS Vaihingen dataset, and 92.1%/84.2% on the MSAW dataset.
引用
收藏
页码:1474 / 1487
页数:14
相关论文
共 59 条
[11]   3-D Object Retrieval and Recognition With Hypergraph Analysis [J].
Gao, Yue ;
Wang, Meng ;
Tao, Dacheng ;
Ji, Rongrong ;
Dai, Qionghai .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (09) :4290-4303
[12]  
Hannun A.Y., 2013, INT C MACHINE LEARNI
[13]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[14]   Transformer-induced graph reasoning for multimodal semantic segmentation in remote sensing [J].
He, Qibin ;
Sun, Xian ;
Diao, Wenhui ;
Yan, Zhiyuan ;
Yin, Dongshuo ;
Fu, Kun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 193 :90-103
[15]   Multi-Object Tracking in Satellite Videos With Graph-Based Multitask Modeling [J].
He, Qibin ;
Sun, Xian ;
Yan, Zhiyuan ;
Li, Beibei ;
Fu, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[16]   DABNet: Deformable Contextual and Boundary-Weighted Network for Cloud Detection in Remote Sensing Images [J].
He, Qibin ;
Sun, Xian ;
Yan, Zhiyuan ;
Fu, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[17]  
Heo Y.-J., 2020, P IEEE CVF C COMP VI, P14581
[18]   How much can natural resource inventory benefit from finer resolution auxiliary data? [J].
Hou, Zhengyang ;
McRoberts, Ronald E. ;
Stahl, Goran ;
Packalen, Petteri ;
Greenberg, Jonathan A. ;
Xu, Qing .
REMOTE SENSING OF ENVIRONMENT, 2018, 209 :31-40
[19]  
Huang, 2007, ADV NEURAL INFORM PR, P1601, DOI DOI 10.7551/MITPRESS/7503.003.0205
[20]   Image Retrieval via Probabilistic Hypergraph Ranking [J].
Huang, Yuchi ;
Liu, Qingshan ;
Zhang, Shaoting ;
Metaxas, Dimitris N. .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3376-3383