Unlabeled Sample Selection for Mineral Prospectivity Mapping by Semi-supervised Support Vector Machine

被引:8
作者
Tao, Jintao [1 ,2 ,3 ,4 ]
Zhang, Nannan [1 ,2 ,3 ,4 ]
Chang, Jinyu [1 ,2 ,3 ,4 ]
Chen, Li [1 ,2 ,3 ,4 ]
Zhang, Hao [1 ,2 ,3 ,4 ]
Chi, Yujin [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, State Key Lab Desert & Oasis Ecol, Xinjiang Inst Ecol & Geog, Urumqi 830011, Peoples R China
[2] Xinjiang Key Lab Mineral Resources & Digital Geol, Urumqi 830011, Peoples R China
[3] Chinese Acad Sci, Xinjiang Res Ctr Mineral Resources, Urumqi 830011, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Mineral prospectivity mapping; Euclidean distance-based similarity measure; Unlabeled sample selection; Semi-supervised support vector machine (S3VM); Honghai Cu-Zn deposit; EASTERN TIANSHAN OROGEN; CU-ZN DEPOSIT; NEURAL-NETWORK; VMS MINERALIZATION; KALATAG DISTRICT; BAGUIO DISTRICT; 3-D INVERSION; RANDOM FOREST; FUZZY-LOGIC; CLASSIFICATION;
D O I
10.1007/s11053-022-10093-0
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Semi-supervised learning (SSL) algorithms can use unlabeled data to improve the performance of supervised learning algorithms for mineral prospectivity mapping with few known mineral deposits or mineralized blocks. However, SSL algorithms are sensitive to unlabeled samples and, in some cases, perform worse than supervised algorithms. In this study, a quasi-Newton method for semi-supervised support vector machine (QN-S3VM) was used in the 3D mineral prospectivity mapping of the Honghai volcanogenic massive sulfide Cu-Zn deposit in eastern Tianshan, northwestern China. Three Euclidean distance-based similarity measures of unlabeled samples to known mineral deposits or mineralized blocks were proposed to select unlabeled samples. The influence of the similarity and number of unlabeled samples on the performance of the QN-S3VM was investigated. The results showed that lower similarity in unlabeled samples yielded enhanced QN-S3VM performance. The performance of the QN-S3VM was affected by the number of unlabeled samples used. However, there was no consistent pattern among them. Compared with random selection, the QN-S3VM trained with unlabeled samples selected by the similarity measure had higher generalization and stability. Among the maximum, minimum, and average similarities, the minimum similarity had the best generalization while the average similarity had the best stability. Therefore, similarity to known mineral deposits or mineralized blocks is a good tool for unlabeled sample selection. This can effectively guarantee the performance of SSL for mineral prospectivity mapping.
引用
收藏
页码:2247 / 2269
页数:23
相关论文
共 112 条
[1]  
Abe S, 2010, ADV PATTERN RECOGNIT, P1, DOI 10.1007/978-1-84996-098-4_1
[2]   Support vector machine for multi-classification of mineral prospectivity areas [J].
Abedi, Maysam ;
Norouzi, Gholam-Hossain ;
Bahroudi, Abbas .
COMPUTERS & GEOSCIENCES, 2012, 46 :272-283
[3]  
Agterberg F.P., 2011, Natural Resources Research, V20, P95, DOI [10.1007/s11053-011-9138-0, DOI 10.1007/S11053-011-9138-0]
[4]  
Agterberg F.P., 1992, Nonrenewable Resources, V1, P39, DOI 10.1007/BF01782111
[5]  
[Anonymous], 2011, MINERAL EXPLORATION
[6]  
[Anonymous], 2000, Numerical Optimization
[7]  
Ben-David S., 2008, C LEARN THEOR COLT
[8]  
Bishop C.M., 2006, PATTERN RECOGNITION, DOI [DOI 10.18637/JSS.V017.B05, 10.1117/1.2819119]
[9]  
Bonham-Carter GraemeF., 1994, Geographic Information Systems for Geoscientists, DOI DOI 10.1016/C2013-0-03864-9
[10]   Artificial neural networks: a new method for mineral prospectivity mapping [J].
Brown, WM ;
Gedeon, TD ;
Groves, DI ;
Barnes, RG .
AUSTRALIAN JOURNAL OF EARTH SCIENCES, 2000, 47 (04) :757-770