Multi-modal deep learning approaches to semantic segmentation of mining footprints with multispectral satellite imagery

被引:1
作者
Saputra, Muhamad Risqi U. [1 ]
Bhaswara, Irfan Dwiki [1 ]
Nasution, Bahrul Ilmi [1 ]
Ern, Michelle Ang Li [2 ]
Husna, Nur Laily Romadhotul [1 ]
Witra, Tahjudil [1 ]
Feliren, Vicky [1 ]
Owen, John R. [3 ]
Kemp, Deanna [4 ]
Lechner, Alex M. [1 ]
机构
[1] Monash Univ Indonesia, Min Spatial Data Intelligence Res Hub, Green Off Pk 9, Tangerang Selatan 15345, Banten, Indonesia
[2] Univ Nottingham Malaysia, Sch Environm & Geog Sci, Landscape Ecol & Conservat Lab, Semenyih 43500, Malaysia
[3] Univ Free State, Ctr Dev Support, 205 Nelson Mandela Dr,Pk West, ZA-9301 Bloemfontein, South Africa
[4] Univ Queensland, Sustainable Minerals Inst, Ctr Social Responsibil Min, Brisbane, Qld 4072, Australia
基金
澳大利亚研究理事会;
关键词
Semantic segmentation; Global mining footprints; Multispectral; Deep learning; IMPACTS;
D O I
10.1016/j.rse.2024.114584
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Existing remote sensing applications in mining are often of limited scope, typically mapping multiple mining land covers for a single mine or only mapping mining extents or a single feature (e.g., tailings dam) for multiple mines across a region. Many of these works have a narrow focus on specific mine land covers rather than encompassing the variety of mining and non-mining land use in a mine site. This study presents a pioneering effort in performing deep learning-based semantic segmentation of 37 mining locations worldwide, representing a range of commodities from gold to coal, using multispectral satellite imagery, to automate mapping of mining and non-mining land covers. Due to the absence of a dedicated training dataset, we crafted a customized multispectral dataset for training and testing deep learning models, leveraging and refining existing datasets in terms of boundaries, shapes, and class labels. We trained and tested multimodal semantic segmentation models, particularly based on U-Net, DeepLabV3+, Feature Pyramid Network (FPN), SegFormer, and IBM-NASA foundational geospatial model (Prithvi) architecture, with a focus on evaluating different model configurations, input band combinations, and the effectiveness of transfer learning. In terms of multimodality, we utilized various image bands, including Red, Green, Blue, and Near Infra-Red (NIR) and Normalized Difference Vegetation Index (NDVI), to determine which combination of inputs yields the most accurate segmentation. Results indicated that among different configurations, FPN with DenseNet-121 backbone, pre-trained on ImageNet, and trained using both RGB and NIR bands, performs the best. We concluded the study with a comprehensive assessment of the model's performance based on climate classification categories and diverse mining commodities. We believe that this work lays a robust foundation for further analysis of the complex relationship between mining projects, communities, and the environment.
引用
收藏
页数:16
相关论文
共 86 条
[21]   Google Earth Engine: Planetary-scale geospatial analysis for everyone [J].
Gorelick, Noel ;
Hancher, Matt ;
Dixon, Mike ;
Ilyushchenko, Simon ;
Thau, David ;
Moore, Rebecca .
REMOTE SENSING OF ENVIRONMENT, 2017, 202 :18-27
[22]   MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [J].
Guo, Yandong ;
Zhang, Lei ;
Hu, Yuxiao ;
He, Xiaodong ;
Gao, Jianfeng .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :87-102
[23]   A review of semantic segmentation using deep neural networks [J].
Guo, Yanming ;
Liu, Yu ;
Georgiou, Theodoros ;
Lew, Michael S. .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (02) :87-93
[24]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[25]   Probabilistic SAR-based water segmentation with adapted Bayesian convolutional neural network [J].
Hertel, Victor ;
Chow, Candace ;
Wani, Omar ;
Wieland, Marc ;
Martinis, Sandro .
REMOTE SENSING OF ENVIRONMENT, 2023, 285
[26]  
Holscher K., 2021, Urban Transformations, V3, DOI [DOI 10.1186/S42854-021-00019-Z, 10.1186/s42854-021-00019-z]
[27]   Searching for MobileNetV3 [J].
Howard, Andrew ;
Sandler, Mark ;
Chu, Grace ;
Chen, Liang-Chieh ;
Chen, Bo ;
Tan, Mingxing ;
Wang, Weijun ;
Zhu, Yukun ;
Pang, Ruoming ;
Vasudevan, Vijay ;
Le, Quoc V. ;
Adam, Hartwig .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324
[28]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[29]   Applicability of Earth Observation for Identifying Small-Scale Mining Footprints in a Wet Tropical Region [J].
Isidro, Celso M. ;
McIntyre, Neil ;
Lechner, Alex M. ;
Callow, Ian .
REMOTE SENSING, 2017, 9 (09)
[30]  
Jakubik Johannes, 2023, arXiv, DOI 10.48550/arXiv.2310.18660