A Foreground-Driven Fusion Network for Gully Erosion Extraction Utilizing UAV Orthoimages and Digital Surface Models

被引:1
作者
Shen, Yi [1 ]
Su, Nan [1 ]
Zhao, Chunhui [1 ]
Yan, Yiming [1 ]
Feng, Shou [1 ]
Liu, Yong [2 ]
Xiang, Wei [3 ,4 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150001, Peoples R China
[2] Heilongjiang Prov Hydraul Res Inst, Harbin 150000, Peoples R China
[3] La Trobe Univ, Sch Comp Engn & Math Sci, Melbourne, Vic 3086, Australia
[4] James Cook Univ, Coll Sci & Engn, Cairns, Qld 4878, Australia
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Germanium; Autonomous aerial vehicles; Feature extraction; Data mining; Erosion; Vegetation mapping; Vegetation; Digital surface models (DSMs); foreground prototypes; gully erosion (GE) extraction; multimodal segmentation; semantic segmentation; unmanned aerial vehicle (UAV) orthoimages;
D O I
10.1109/TGRS.2024.3417398
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Unmanned aerial vehicle (UAV) orthoimages and digital surface models (DSMs) can provide valuable insights for semantic segmentation methods in comprehending gully erosion (GE) from diverse perspectives. While the integration of these two modalities has the potential to improve the GE extraction performance, the extent of enhancement primarily depends on the quality of modality-specific features and the synergistic fusion manner employed for integrating features from both modalities. Toward this end, we propose a novel multimodal segmentation method, which is called foreground-driven fusion network (FFNet). Guided by the prototypes of foreground objects (i.e., gullies), the network effectively tackles the challenges from the modality itself and between different modalities, ultimately achieving high-quality GE extraction results. Specifically, a foreground prototype sampling (FPS) module is first devised for precisely sampling foreground prototypes related to gullies from two modalities. Then, a local-global hybrid purification (LHP) module is proposed to effectively mitigate the erroneous activation within each modality at multiple dimensions by leveraging foreground prototypes. Finally, a multimodal foreground synergy (MFS) module is introduced to further activate foreground features and facilitate full complementarity between multimodal foreground features. To validate our network, a comprehensive multimodal dataset for GE extraction is constructed based on UAV orthoimages and DSMs from northeastern China. Furthermore, a public road extraction dataset is employed to evaluate the generalizability of this network. In the experiments conducted on these two datasets, the proposed FFNet exhibits obvious superiority, outperforming the second-best method with an average improvement of 2.55% in terms of intersection over union (IoU) and 2.77% in terms of $F1$ -score. These experimental results not only demonstrate the practicality of FFNet in GE extraction tasks, but also highlight its significant advantage in similar road extraction tasks.
引用
收藏
页数:16
相关论文
共 57 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]   Semantic Labeling of High-Resolution Images Using EfficientUNets and Transformers [J].
Almarzouqi, Hasan ;
Saoud, Lyes Saad .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[3]   Novel Machine Learning Approaches for Modelling the Gully Erosion Susceptibility [J].
Arabameri, Alireza ;
Nalivan, Omid Asadi ;
Pal, Subodh Chandra ;
Chakrabortty, Rabin ;
Saha, Asish ;
Lee, Saro ;
Pradhan, Biswajeet ;
Dieu Tien Bui .
REMOTE SENSING, 2020, 12 (17) :1-32
[4]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[5]   Network Dissection: Quantifying Interpretability of Deep Visual Representations [J].
Bau, David ;
Zhou, Bolei ;
Khosla, Aditya ;
Oliva, Aude ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3319-3327
[6]   Developing reliable urban flood hazard mapping from LiDAR data [J].
Bodoque, Jose M. ;
Aroca-Jimenez, Estefania ;
Eguibar, Miguel A. ;
Garcia, Juan A. .
JOURNAL OF HYDROLOGY, 2023, 617
[7]   GULLY EXTRACTION AND MAPPING IN KAJOO-GARGAROO WATERSHED - COMPARATIVE EVALUATION OF DEM-BASED AND IMAGE-BASED MACHINE LEARNING ALGORITHMM [J].
Bokaei, M. ;
Samadi, M. ;
Hadavand, A. ;
Moslem, A. P. ;
Soufi, M. ;
Bameri, A. ;
Sarvarinezhad, A. .
ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, :101-108
[8]   Deep learning-based multi-feature semantic segmentation in building extraction from images of UAV photogrammetry [J].
Boonpook, Wuttichai ;
Tan, Yumin ;
Xu, Bo .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (01) :1-19
[9]   Soil Erosion Monitoring in Quarry Restoration Using Drones [J].
Carabassa, Vicenc ;
Montero, Pau ;
Maria Alcaniz, Josep ;
Padro, Joan-Cristian .
MINERALS, 2021, 11 (09)
[10]   Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation [J].
Chen, Xiaokang ;
Lin, Kwan-Yee ;
Wang, Jingbo ;
Wu, Wayne ;
Qian, Chen ;
Li, Hongsheng ;
Zeng, Gang .
COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :561-577