A MACHINE LEARNING APPROACH TO ANALYZING THE RELATIONSHIP BETWEEN TEMPERATURES AND MULTI-PROXY TREE-RING RECORDS

被引:16
作者
Jevsenak, Jernej [1 ]
Dzeroski, Saso [2 ,3 ]
Zavadlav, Sasa [1 ]
Levanic, Tom [1 ]
机构
[1] Slovenian Forestry Inst, Vecna Pot 2, Ljubljana 1000, Slovenia
[2] Jozef Stefan Inst, Jamova Cesta 39, Ljubljana 1000, Slovenia
[3] Jozef Stefan Int Postgrad Sch, Jamova Cesta 39, Ljubljana 1000, Slovenia
基金
欧盟第七框架计划;
关键词
multiple linear regression; machine learning; random forests; bagging; model trees; artificial neural networks; dendroclimatology; ARTIFICIAL NEURAL-NETWORKS; SUMMER TEMPERATURES; LATEWOOD-WIDTH; RANDOM FORESTS; WOOD FORMATION; R PACKAGE; CLASSIFICATION; OAK; PRECIPITATION; DENSITY;
D O I
10.3959/1536-1098-74.2.210
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
Machine learning (ML) is a widely unexplored field in dendroclimatology, but it is a powerful tool that might improve the accuracy of climate reconstructions. In this paper, different ML algorithms are compared to climate reconstruction from tree-ring proxies. The algorithms considered are multiple linear regression (MLR), artificial neural networks (ANN), model trees (MT), bagging of model trees (BMT), and random forests of regression trees (RF). April-May mean temperature at a Quercus robur stand in Slovenia is predicted with mean vessel area (MVA, correlation coefficient with April-May mean temperature, r = 0.70, p < 0.001) and earlywood width (EW, r = -0.28, p < 0.05). Similarly, June-August mean temperature is predicted with stable carbon isotope (delta C-13, r = 0.72, p < 0.001), stable oxygen (delta O-18, r = isotope 0.32, p < 0.05) and tree-ring width (TRW, r = 0.11, p > 0.05 (ns)) chronologies. The predictive performance of ML algorithms was estimated by 3-fold cross-validation repeated 100 times. In both spring and summer temperature models, BMT performed best respectively in 62% and 52% of the 100 repetitions. The second-best method was ANN. Although BMT gave the best validation results, the differences in the models' performances were minor. We therefore recommend always comparing different ML regression techniques and selecting the optimal one for applications in dendroclimatology.
引用
收藏
页码:210 / 224
页数:15
相关论文
共 76 条
  • [1] [Anonymous], 1956, Statistical forecasting project-scientific report No. 1
  • [2] [Anonymous], TOPOLA
  • [3] [Anonymous], 1976, Tree Rings and Climate
  • [4] A comparison of supervised learning techniques in the classification of bat echolocation calls
    Armitage, David W.
    Ober, Holly K.
    [J]. ECOLOGICAL INFORMATICS, 2010, 5 (06) : 465 - 473
  • [5] Balybina A.S., 2010, GEOGR NAT RESOUR, V31, P30, DOI DOI 10.1016/J.GNR.2010.03.006
  • [6] Neural networks in materials science
    Bhadeshia, HKDH
    [J]. ISIJ INTERNATIONAL, 1999, 39 (10) : 966 - 979
  • [7] Bishop C.M., 1995, Neural networks for pattern recognition
  • [8] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32