Multinomial Logistic Regression and Random Forest Classifiers in Digital Mapping of Soil Classes in Western Haiti

被引:15
|
作者
Jeune, Wesly [1 ]
Francelino, Marcio Rocha [2 ]
de Souza, Eliana [3 ]
Fernandes Filho, Elpidio Inacio [2 ]
Rocha, Genelicio Crusoe [2 ]
机构
[1] Univ Quisqueya, Fac Sci Agr & Environm, Port Au Prince, Ouest, Haiti
[2] Univ Fed Vicosa, Dept Solos, Vicosa, MG, Brazil
[3] Univ Fed Vicosa, Dept Solos, Programa Posgrad Solos & Nutr Plantas, Vicosa, MG, Brazil
来源
REVISTA BRASILEIRA DE CIENCIA DO SOLO | 2018年 / 42卷
关键词
auxiliary data; digital soil mapping; soil survey; data-mining; CLASSIFICATION; MAP; CLIMATE; STATE;
D O I
10.1590/18069657rbcs20170133
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Digital soil mapping (DSM) has been increasingly used to provide quick and accurate spatial information to support decision-makers in agricultural and environmental planning programs. In this study, we used a DSM approach to map soils in western Haiti and compare the performance of the Multinomial Logistic Regression (MLR) with Random Forest (RF) to classify the soils. The study area of 4,300 km(2) is mostly composed of diverse limestone rocks, alluvial deposits, and, to a lesser extent, basalt. A soil survey was conducted whereby soils were described and classified at 258 sites. Soil samples were collected and subjected to physical and chemical analyses. Recursive Feature Elimination (RFE) was used to select the most important covariates from auxiliary data, such as climate, lithology, and morphometric properties to describe the soil-landscape relationship. Mapping performance was assessed by the Kappa index and overall accuracy derived from a confusion matrix generated using a 5-fold cross validation process. In addition, an external mapping validation was carried out using an independent soil dataset. Accordingly, the soil dataset was split into 80 % and 20 % for training and validation of the models, respectively. No significant statistical difference (Z = 0.56< |1.96|) was found between maps generated with both classifiers (Kappa index 0.45 for MLR and 0.42 for RF). Based on the Kappa values, the classification performance can be characterized as moderate for both algorithms. Surprisingly, the RF classifier outperformed MLR in the validation process (Kappa values of 0.55 and 0.33, respectively). These results suggest a higher generalization ability of RF. However, no significant statistical difference (Z = 1.83< |1.96|) was observed. The soil map derived from RF indicated the occurrence of Leptosols (48.5 %), Gleysols (19.6 %), Chernozems (8 %), and Fluvisols (6.6 %) in most of the study area. The DSM approaches proved suitable for mapping soils in western Haiti and could be used in other parts of the country, thereby closing information gaps with regard to Haitian soils.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Spatial prediction of WRB soil classes in an arid floodplain using multinomial logistic regression and random forest models, south-east of Iran
    Forghani, Seyed Javad
    Pahlavan-Rad, Mohammad Reza
    Esfandiari, Mehrdad
    Torkashvand, Ali Mohammadi
    ARABIAN JOURNAL OF GEOSCIENCES, 2020, 13 (13)
  • [2] Generation of digital soil mapping for Coimbatore districts using multinomial logistic regression approach
    Shankar, S. Vishnu
    Kumaraperumal, R.
    Radha, M.
    Kannan, Balaji
    Patil, S. G.
    Vanitha, G.
    Raj, M. Nivas
    Athira, M.
    Ananthakrishnan, S.
    ENVIRONMENTAL EARTH SCIENCES, 2024, 83 (24)
  • [3] Downscaling legacy soil information for hydrological soil mapping using multinomial logistic regression
    Smit, I. E.
    Van Zijl, G. M.
    Riddell, E. S.
    Van Tol, J. J.
    GEODERMA, 2023, 436
  • [4] Digital soil mapping at pilot sites in the northwest coast of Egypt: A multinomial logistic regression approach
    Abdel-Kader, Fawzy Hassan
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2011, 14 (01) : 29 - 40
  • [5] Spatial prediction of WRB soil classes in an arid floodplain using multinomial logistic regression and random forest models, south-east of Iran
    Seyed Javad Forghani
    Mohammad Reza Pahlavan-Rad
    Mehrdad Esfandiari
    Ali Mohammadi Torkashvand
    Arabian Journal of Geosciences, 2020, 13
  • [6] Digital mapping of soil texture classes using Random Forest classification algorithm
    Dharumarajan, Subramanian
    Hegde, Rajendra
    SOIL USE AND MANAGEMENT, 2022, 38 (01) : 135 - 149
  • [7] Multinomial logistic regression with soil diagnostic features and land surface parameters for soil mapping of Latium (Central Italy)
    Piccini, Chiara
    Marchetti, Alessandro
    Rivieccio, Rosa
    Napoli, Rosario
    GEODERMA, 2019, 352 : 385 - 394
  • [8] Multivariate random forest for digital soil mapping
    van der Westhuizen, Stephan
    Heuvelink, Gerard B. M.
    Hofmeyr, David P.
    GEODERMA, 2023, 431
  • [9] Forest Fire Probability Mapping in Eastern Serbia: Logistic Regression versus Random Forest Method
    Milanovic, Slobodan
    Markovic, Nenad
    Pamucar, Dragan
    Gigovic, Ljubomir
    Kostic, Pavle
    Milanovic, Sladjan D.
    FORESTS, 2021, 12 (01): : 1 - 17
  • [10] Provincial-scale digital soil mapping using a random forest approach for British Columbia
    Heung, Brandon
    Bulmer, Chuck E.
    Schmidt, Margaret G.
    Zhang, Jin
    CANADIAN JOURNAL OF SOIL SCIENCE, 2022, 102 (03) : 597 - 620