Using machine learning to estimate a key missing geochemical variable in mining exploration: Application of the Random Forest algorithm to multisensor core logging data

被引:24
|
作者
Schnitzler, N. [1 ,2 ]
Ross, P-S [1 ]
Gloaguen, E. [1 ]
机构
[1] Inst Natl Rech Sci, 490 Couronne, Quebec City, PQ G1K 9A9, Canada
[2] Sirius Resources Inc, 1000 St Antoine Ouest,Suite 410, Montreal, PQ H3C 3R7, Canada
关键词
Artificial intelligence; Geochemistry; Supervised method; Mineral exploration; RAY-FLUORESCENCE MEASUREMENTS; IMPROVING LITHOLOGICAL DISCRIMINATION; MASSIVE SULFIDE DEPOSIT; ABITIBI GREENSTONE-BELT; DRILL-CORES; MINERAL PROSPECTIVITY; COMPOSITIONAL DATA; HOST ROCKS; CAMP; DISTRICT;
D O I
10.1016/j.gexplo.2019.106344
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Mining exploration increasingly relies on large, multivariate databases storing data ranging from drill core geochemical analysis to geophysical data or geological descriptions. Utilizing these large datasets to their full potential implies the use of multivariate statistical analysis such as machine learning. The Random Forest algorithm has proved its efficiency in mining applications. In this study we use it to estimate a key geochemical element, sodium, using a multivariate chemo-physical dataset measured on drill cores in the Matagami mining district of Quebec, Canada. Sodium is important to characterize hydrothermal alteration in volcanogenic massive sulfide settings, since Na depletion can be used to vector towards ore, but this element is not readily measured by portable X-ray fluorescence (pXRF). We first test the algorithm on a database of over 8000 traditional laboratory geochemistry analyses and find a correlation of 0.95 between estimated and measured Na. We then test the algorithm on the multi-sensor core logging data, including density, magnetic susceptibility, and 15 geochemical elements by pXRF, but borrowing Na from traditional geochemistry (n = 260). This yields correlations of 0.66 to 0.75 depending on the training and testing sets. Finally the algorithm is applied to the whole multiparameter database (n = 9675) to estimate Na downcore. There is a good general correspondence with the downcore Na patterns seen through traditional geochemistry, and the estimated Na which has much greater spatial resolution. Random Forest appears to be a very good estimation tool when using large amounts of data and variables, as it uses all variables and automatically prioritizes the most useful. This method also allows visualization of the weight of each variable in the estimation. Future studies should compare RF with other methods.
引用
收藏
页数:20
相关论文
共 9 条
  • [1] AUTOCLASSIFICATION OF THE VARIABLE 3XMM SOURCES USING THE RANDOM FOREST MACHINE LEARNING ALGORITHM
    Farrell, Sean A.
    Murphy, Tara
    Lo, Kitty K.
    ASTROPHYSICAL JOURNAL, 2015, 813 (01):
  • [2] Novel Approach for Generating the Key of Stream Cipher System Using Random Forest Data Mining Algorithm
    Ali, Samaher Hussein
    2013 SIXTH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2014, : 259 - 269
  • [3] Overview of Machine Learning Techniques in Cybersecurity Data Science using Gradient Boosting and Random Forest Algorithm
    Gulhane, Kimsy
    Saxena, Surabhi
    Deogaonkar, Anant
    Kumar, Vinod
    Vichoray, Chandan
    Goyal, Shweta
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 19 - 24
  • [4] Identifying the key factors influencing Chinese carbon intensity using machine learning, the random forest algorithm, and evolutionary analysis
    Liu W.
    Tang Z.
    Xia Y.
    Han M.
    Jiang W.
    Dili Xuebao/Acta Geographica Sinica, 2019, 74 (12): : 2592 - 2603
  • [5] Application of the Random Forest Machine Learning Algorithm for Recognizing Patient Arm Movements While Using a Bionic Prosthesis
    Burtsev, N., I
    Shagdurov, V. C.
    Demkin, I. O.
    XIV RUSSIAN-GERMANY CONFERENCE ON BIOMEDICAL ENGINEERING (RGC-2019), 2019, 2140
  • [6] Application of Machine Learning Algorithm and Remote-sensed Data to Estimate Forest Gross Primary Production at Multi-sites Level
    Lee, Bora
    Kim, Eunsook
    Lim, Jong-Hwan
    Kang, Minseok
    Kim, Joon
    KOREAN JOURNAL OF REMOTE SENSING, 2019, 35 (06) : 1117 - 1132
  • [7] Inner parameters' optimization in the artificial neural network for the traffic data classification in radiofrequency applications Classification of nonstationary data using the machine learning algorithm "random forest"
    Alexandrovich, Kalashnikov Evgeniy
    Baurzhanovna, Kondybayeva Almagul
    Petrovna, Ositis Anastasia
    2018 SYSTEMS OF SIGNALS GENERATING AND PROCESSING IN THE FIELD OF ON BOARD COMMUNICATIONS, 2018,
  • [8] A Predictive Model to identify possible affected Bipolar disorder students using Naive Baye's, Random Forest and SVM machine learning techniques of data mining and Building a Sequential Deep Learning Model using Keras
    Peerbasha, S.
    Surputheen, M. Mohamed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (05): : 267 - 274
  • [9] Ground visibility prediction using tree-based and random-forest machine learning algorithm: Comparative study based on atmospheric pollution and atmospheric boundary layer data
    Wang, Fuzeng
    Liu, Ruolan
    Yan, Hao
    Liu, Duanyang
    Han, Lin
    Yuan, Shujie
    ATMOSPHERIC POLLUTION RESEARCH, 2024, 15 (11)