Geological Domaining with Unsupervised Clustering and Ensemble Support Vector Classification

被引：2

作者：

Koruk, Kasimcan ^{[1
,2
]}

Ortiz, Julian M. ^{[2
]}

机构：

[1] Gen Directorate Mineral Res & Explorat, Dept Feasibil Studies, Ankara, Turkiye

[2] Queens Univ, Robert M Buchan Dept Min, Kingston, ON K7L 3N6, Canada

来源：

MINING METALLURGY & EXPLORATION | 2023年 / 40卷 / 6期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Machine learning; Ensemble learning; Geological domaining; Support vector classification; Geostatistics; MINERAL-RESOURCES;

D O I：

10.1007/s42461-023-00858-3

中图分类号：

TF [冶金工业];

学科分类号：

0806 ;

摘要：

Building a geological model is important in resource estimation, as it defines the extent of domains for estimation. Geological models are built by assigning a domain to the samples, based on logging of different features related to lithology, alteration, and mineralization. Then, the extent of these domains is determined using geometric or geostatistical tools. However, these models should account for uncertainty, since the actual extent of these domains is not known and must be extrapolated from the labelled samples. With the availability of geochemical datasets and machine learning techniques, employing multiple variables to build resource models should be considered. This article proposes a two-step machine learning approach with an ensemble implementation to define geological domains and their uncertainties. First, an unsupervised binary clustering method labels the geochemical samples into two domains. These labelled samples are used to inform a geological domain model built by support vector classification. This application is repeated with subsets of variables and subsets of samples, leading to an ensemble learning method. Weak models are produced with each subset of samples and variables, and combined to devise a stronger learner. The final model accounts for uncertainties thanks to the ensemble implementation. The proposed workflow is demonstrated on a dataset from a porphyry copper deposit, applied hierarchically to define four domains. Performance is comparable with traditional methods, but provides the advantage of allowing domain knowledge in the selection of the variables, and generates domain boundaries that can be controlled in terms of their continuity and smoothness.

引用

页码：2537 / 2549

页数：13

共 25 条

[1]

Abzalov M., 2016, Applied Mining Geology, DOI [DOI 10.1007/978-3-319-39264-6, 10.1007/978-3-319-39264-6_2, 10.1007/978-3-319-39264-6]

[2]

Armstrong M, 2011, PLURIGAUSSIAN SIMULATIONS IN GEOSCIENCES, P1, DOI 10.1007/978-3-642-19607-2

[3] A combined multivariate approach analyzing geochemical data for knowledge discovery: The Vazante - Paracatu Zinc District, Minas Gerais, Brazil [J].

Cevik, Ilkay S. ;

Olivo, Gema R. ;

Ortiz, Julian M. .

JOURNAL OF GEOCHEMICAL EXPLORATION, 2021, 221

[4] A sequential indicator simulation program for categorical variables with point and block data: BlockSIS [J].

Deutsch, Clayton V. .

COMPUTERS & GEOSCIENCES, 2006, 32 (10) :1669-1681

[5]

Duke J.H., 2001, Monograph Ser.-Australian Inst. of Min. and Metall, V23, P147

[6] Machine Learning-A Review of Applications in Mineral Resource Estimation [J].

Dumakor-Dupey, Nelson K. ;

Arya, Sampurna .

ENERGIES, 2021, 14 (14)

[7]

Emery X, 2005, J S AFR I MIN METALL, V105, P247

[8] A Simple Unsupervised Classification Workflow for Defining Geological Domains Using Multivariate Data [J].

Faraj, Fouad ;

Ortiz, Julian M. .

MINING METALLURGY & EXPLORATION, 2021, 38 (03) :1609-1623

[9]

Friedman J., 2001, The elements of statistical learning: Data mining, inference, and prediction, DOI DOI 10.1007/978

[10]

GALLI A, 1994, QUANT GEO G, V7, P217

← 1 2 3 →