Multi-modal deep learning approaches to semantic segmentation of mining footprints with multispectral satellite imagery

被引:1
作者
Saputra, Muhamad Risqi U. [1 ]
Bhaswara, Irfan Dwiki [1 ]
Nasution, Bahrul Ilmi [1 ]
Ern, Michelle Ang Li [2 ]
Husna, Nur Laily Romadhotul [1 ]
Witra, Tahjudil [1 ]
Feliren, Vicky [1 ]
Owen, John R. [3 ]
Kemp, Deanna [4 ]
Lechner, Alex M. [1 ]
机构
[1] Monash Univ Indonesia, Min Spatial Data Intelligence Res Hub, Green Off Pk 9, Tangerang Selatan 15345, Banten, Indonesia
[2] Univ Nottingham Malaysia, Sch Environm & Geog Sci, Landscape Ecol & Conservat Lab, Semenyih 43500, Malaysia
[3] Univ Free State, Ctr Dev Support, 205 Nelson Mandela Dr,Pk West, ZA-9301 Bloemfontein, South Africa
[4] Univ Queensland, Sustainable Minerals Inst, Ctr Social Responsibil Min, Brisbane, Qld 4072, Australia
基金
澳大利亚研究理事会;
关键词
Semantic segmentation; Global mining footprints; Multispectral; Deep learning; IMPACTS;
D O I
10.1016/j.rse.2024.114584
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Existing remote sensing applications in mining are often of limited scope, typically mapping multiple mining land covers for a single mine or only mapping mining extents or a single feature (e.g., tailings dam) for multiple mines across a region. Many of these works have a narrow focus on specific mine land covers rather than encompassing the variety of mining and non-mining land use in a mine site. This study presents a pioneering effort in performing deep learning-based semantic segmentation of 37 mining locations worldwide, representing a range of commodities from gold to coal, using multispectral satellite imagery, to automate mapping of mining and non-mining land covers. Due to the absence of a dedicated training dataset, we crafted a customized multispectral dataset for training and testing deep learning models, leveraging and refining existing datasets in terms of boundaries, shapes, and class labels. We trained and tested multimodal semantic segmentation models, particularly based on U-Net, DeepLabV3+, Feature Pyramid Network (FPN), SegFormer, and IBM-NASA foundational geospatial model (Prithvi) architecture, with a focus on evaluating different model configurations, input band combinations, and the effectiveness of transfer learning. In terms of multimodality, we utilized various image bands, including Red, Green, Blue, and Near Infra-Red (NIR) and Normalized Difference Vegetation Index (NDVI), to determine which combination of inputs yields the most accurate segmentation. Results indicated that among different configurations, FPN with DenseNet-121 backbone, pre-trained on ImageNet, and trained using both RGB and NIR bands, performs the best. We concluded the study with a comprehensive assessment of the model's performance based on climate classification categories and diverse mining commodities. We believe that this work lays a robust foundation for further analysis of the complex relationship between mining projects, communities, and the environment.
引用
收藏
页数:16
相关论文
共 86 条
[71]   Automatic Identification and Dynamic Monitoring of Open-Pit Mines Based on Improved Mask R-CNN and Transfer Learning [J].
Wang, Chunsheng ;
Chang, Lili ;
Zhao, Lingran ;
Niu, Ruiqing .
REMOTE SENSING, 2020, 12 (21) :1-20
[72]   Position-Aware Graph-CNN Fusion Network: An Integrated Approach Combining Geospatial Information and Graph Attention Network for Multiclass Change Detection [J].
Wang, Moyang ;
Li, Xiang ;
Tan, Kun ;
Mango, Joseph ;
Pan, Chen ;
Zhang, Di .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-16
[73]   Fast Tailings Pond Mapping Exploiting Large Scene Remote Sensing Images by Coupling Scene Classification and Sematic Segmentation Models [J].
Wang, Pan ;
Zhao, Hengqian ;
Yang, Zihan ;
Jin, Qian ;
Wu, Yanhua ;
Xia, Pengjiu ;
Meng, Lingxuan .
REMOTE SENSING, 2023, 15 (02)
[74]   Generalizing from a Few Examples: A Survey on Few-shot Learning [J].
Wang, Yaqing ;
Yao, Quanming ;
Kwok, James T. ;
Ni, Lionel M. .
ACM COMPUTING SURVEYS, 2020, 53 (03)
[75]   Global-scale remote sensing of mine areas and analysis of factors explaining their extent [J].
Werner, Tim T. ;
Mudd, Gavin M. ;
Schipper, Aafke M. ;
Huijbregt, Mark A. J. ;
Taneja, Lakshay ;
Northey, Stephen A. .
GLOBAL ENVIRONMENTAL CHANGE-HUMAN AND POLICY DIMENSIONS, 2020, 60
[76]   Semantic segmentation of water bodies in very high-resolution satellite and aerial images [J].
Wieland, Marc ;
Martinis, Sandro ;
Kiefl, Ralph ;
Gstaiger, Veronika .
REMOTE SENSING OF ENVIRONMENT, 2023, 287
[77]  
Xie EZ, 2021, ADV NEUR IN, V34
[78]   Detecting the dynamics of vegetation disturbance and recovery in surface mining area via Landsat imagery and LandTrendr algorithm [J].
Yang, Yongjun ;
Erskine, Peter D. ;
Lechner, Alex M. ;
Mulligan, David ;
Zhang, Shaoliang ;
Wang, Zhenyu .
JOURNAL OF CLEANER PRODUCTION, 2018, 178 :353-362
[79]   Unified Focal loss: Generalising Dice and cross entropy-based losses to handle class imbalanced medical image segmentation [J].
Yeung, Michael ;
Sala, Evis ;
Schoenlieb, Carola-Bibiane ;
Rundo, Leonardo .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 95
[80]   A self-attention capsule feature pyramid network for water body extraction from remote sensing imagery [J].
Yu, Yongtao ;
Yao, Yuting ;
Guan, Haiyan ;
Li, Dilong ;
Liu, Zuojun ;
Wang, Lanfang ;
Yu, Changhui ;
Xiao, Shaozhang ;
Wang, Wenhao ;
Chang, Lv .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (05) :1801-1822