Machine-learning-based quantitative estimation of soil organic carbon content by VIS/NIR spectroscopy

被引:50
作者
Ding, Jianli [1 ,2 ]
Yang, Aixia [1 ,3 ]
Wang, Jingzhe [1 ,2 ]
Sagan, Vasit [4 ]
Yu, Danlin [5 ,6 ]
机构
[1] Xinjiang Univ, Coll Resources & Environm Sci, Higher Educ Inst, Key Lab Smart City & Environm Modelling, Urumqi, Peoples R China
[2] Xinjiang Univ, Key Lab Oasis Ecol, Urumqi, Peoples R China
[3] Qinzhou Univ, Coll Resources & Environm Sci, Qinzhou, Peoples R China
[4] St Louis Univ, Dept Earth & Atmospher Sci, St Louis, MO 63103 USA
[5] Montclair State Univ, Dept Earth & Environm Studies, Montclair, NJ USA
[6] Renmin Univ China, Sch Sociol & Populat Studies, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Ebinur lake wetland; Desert wetland soil; Soil organic carbon; Machine learning; NEAR-INFRARED-SPECTROSCOPY; ARTIFICIAL NEURAL-NETWORK; REFLECTANCE SPECTROSCOPY; RANDOM FOREST; SPATIAL VARIABILITY; REGIONAL-SCALE; LEAST-SQUARES; LAND-USE; MATTER; REGRESSION;
D O I
10.7717/peerj.5714
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Soil organic carbon (SOC) is an important soil property that has profound impact on soil quality and plant growth. With 140 soil samples collected from Ebinur Lake Wetland National Nature Reserve, Xinjiang Uyghur Autonomous Region of China, this research evaluated the feasibility of visible/near infrared (VIS/NIR) spectroscopy data (350-2,500 nm) and simulated EO-1 Hyperion data to estimate SOC in arid wetland regions. Three machine learning algorithms including Ant Colony Optimization-interval Partial Least Squares (ACO-iPLS), Recursive Feature Elimination-Support Vector Machine (RF-SVM), and Random Forest (RF) were employed to select spectral features and further estimate SOC. Results indicated that the feature wavelengths pertaining to SOC were mainly within the ranges of 745-910 nm and 1,911-2,254 nm. The combination of RF-SVM and first derivative pre-processing produced the highest estimation accuracy with the optimal values of R-t (correlation coefficient of testing set), RMSEt and RPD of 0.91, 0.27% and 2.41, respectively. The simulated EO-1 Hyperion data combined with Support Vector Machine (SVM) based recursive feature elimination algorithm produced the most accurate estimate of SOC content. For the testing set, R-t was 0.79, RMSEt was 0.19%, and RPD was 1.61. This practice provides an efficient, low-cost approach with potentially high accuracy to estimate SOC contents and hence supports better management and protection strategies for desert wetland ecosystems.
引用
收藏
页数:24
相关论文
共 68 条
[1]   Evaluation of the pollution and human health risks posed by heavy metals in the atmospheric dust in Ebinur Basin in Northwest China [J].
Abuduwailil, Jilili ;
Zhang Zhaoyong ;
Jiang Fengqing .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2015, 22 (18) :14018-14031
[2]   Modeling soil parameters using hyperspectral image reflectance in subtropical coastal wetlands [J].
Anne, Naveen J. P. ;
Abd-Elrahman, Amr H. ;
Lewis, David B. ;
Hewitt, Nicole A. .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2014, 33 :47-56
[3]   Determining soil properties in Amazonian Dark Earths by reflectance spectroscopy [J].
Araujo, Suzana Romeiro ;
Soderstrom, Mats ;
Eriksson, Jan ;
Isendahl, Christian ;
Stenborg, Per ;
Dematte, Jose A. M. .
GEODERMA, 2015, 237 :308-317
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Land-Use Type Effects on Soil Organic Carbon and Microbial Properties in a Semi-arid Region of Northeast Brazil [J].
Camara Ferreira, Ana Carolina ;
Carvalho Leite, Luiz Fernando ;
Ferreira de Araujo, Ademir Sergio ;
Eisenhauer, Nico .
LAND DEGRADATION & DEVELOPMENT, 2016, 27 (02) :171-178
[6]   Near-infrared reflectance spectroscopy-principal components regression analyses of soil properties [J].
Chang, CW ;
Laird, DA ;
Mausbach, MJ ;
Hurburgh, CR .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 2001, 65 (02) :480-490
[7]   Plumbing the global carbon cycle: Integrating inland waters into the terrestrial carbon budget [J].
Cole, J. J. ;
Prairie, Y. T. ;
Caraco, N. F. ;
McDowell, W. H. ;
Tranvik, L. J. ;
Striegl, R. G. ;
Duarte, C. M. ;
Kortelainen, P. ;
Downing, J. A. ;
Middelburg, J. J. ;
Melack, J. .
ECOSYSTEMS, 2007, 10 (01) :171-184
[8]   LOSS ON IGNITION AND KJELDAHL DIGESTION FOR ESTIMATING ORGANIC-CARBON AND TOTAL NITROGEN IN ESTUARINE MARSH SOILS - CALIBRATION WITH DRY COMBUSTION [J].
CRAFT, CB ;
SENECA, ED ;
BROOME, SW .
ESTUARIES, 1991, 14 (02) :175-179
[9]   Spatial prediction of soil organic matter content integrating artificial neural network and ordinary kriging in Tibetan Plateau [J].
Dai, Fuqiang ;
Zhou, Qigang ;
Lv, Zhiqiang ;
Wang, Xuemei ;
Liu, Gangcai .
ECOLOGICAL INDICATORS, 2014, 45 :184-194
[10]   Monitoring and evaluating spatial variability of soil salinity in dry and wet seasons in the Werigan-Kuqa Oasis, China, using remote sensing and electromagnetic induction instruments [J].
Ding, Jianli ;
Yu, Danlin .
GEODERMA, 2014, 235 :316-322