Forest Height Mapping Using Feature Selection and Machine Learning by Integrating Multi-Source Satellite Data in Baoding City, North China

被引:39
作者
Zhang, Nan [1 ]
Chen, Mingjie [1 ]
Yang, Fan [2 ]
Yang, Cancan [1 ,3 ]
Yang, Penghui [1 ]
Gao, Yushan [1 ]
Shang, Yue [1 ]
Peng, Daoli [1 ]
机构
[1] Beijing Forestry Univ, Coll Forestry, State Forestry & Grassland Adm Key Lab Forest Res, Beijing 100083, Peoples R China
[2] Acad Inventory & Planning, Natl Forestry & Grassland Adm, Beijing 100714, Peoples R China
[3] Chuzhou Univ, Anhui Prov Key Lab Phys Geog Environm, Chuzhou 239000, Peoples R China
基金
国家重点研发计划;
关键词
forest height; multi-source data; feature selection; machine learning algorithm; ABOVEGROUND BIOMASS; SAR; ALGORITHMS;
D O I
10.3390/rs14184434
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate estimation of forest height is crucial for the estimation of forest aboveground biomass and monitoring of forest resources. Remote sensing technology makes it achievable to produce high-resolution forest height maps in large geographical areas. In this study, we produced a 25 m spatial resolution wall-to-wall forest height map in Baoding city, north China. We evaluated the effects of three factors on forest height estimation utilizing four types of remote sensing data (Sentinel-1, Sentinel-2, ALOS PALSAR-2, and SRTM DEM) with the National Forest Resources Continuous Inventory (NFCI) data, three feature selection methods (stepwise regression analysis (SR), recursive feature elimination (RFE), and Boruta), and six machine learning algorithms (k-nearest neighbor (k-NN), support vector machine regression (SVR), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and categorical boosting (CatBoost)). ANOVA was adopted to quantify the effects of three factors, including data source, feature selection method, and modeling algorithm, on forest height estimation. The results showed that all three factors had a significant influence. The combination of multiple sensor data improved the estimation accuracy. Boruta's overall performance was better than SR and RFE, and XGBoost outperformed the other five machine learning algorithms. The variables selected based on Boruta, including Sentinel-1, Sentinel-2, and topography metrics, combined with the XGBoost algorithm, provided the optimal model (R-2 = 0.67, RMSE = 2.2 m). Then, we applied the best model to create the forest height map. There were several discrepancies between the generated forest height map and the existing map product, and the values with large differences between the two maps were mostly distributed in the steep areas with high slope values. Overall, we proposed a methodological framework for quantifying the importance of data source, feature selection method, and machine learning algorithm in forest height estimation, and it was proved to be effective in estimating forest height by using freely accessible multi-source data, advanced feature selection method, and machine learning algorithm.
引用
收藏
页数:27
相关论文
共 86 条
[81]   An Evaluation of Eight Machine Learning Regression Algorithms for Forest Aboveground Biomass Estimation from Multiple Satellite Data Products [J].
Zhang, Yuzhen ;
Ma, Jun ;
Liang, Shunlin ;
Li, Xisheng ;
Li, Manyao .
REMOTE SENSING, 2020, 12 (24) :1-26
[82]   Forest Biomass Mapping of Northeastern China Using GLAS and MODIS Data [J].
Zhang, Yuzhen ;
Liang, Shunlin ;
Sun, Guoqing .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (01) :140-152
[83]   Biomass retrieval from L-band polarimetric UAVSAR backscatter and PRISM stereo imagery [J].
Zhang, Zhiyu ;
Ni, Wenjian ;
Sun, Guoqing ;
Huang, Wenli ;
Ranson, Kenneth J. ;
Cook, Bruce D. ;
Guo, Zhifeng .
REMOTE SENSING OF ENVIRONMENT, 2017, 194 :331-346
[84]   Forest aboveground biomass estimation in Zhejiang Province using the integration of Landsat TM and ALOS PALSAR data [J].
Zhao, Panpan ;
Lu, Dengsheng ;
Wang, Guangxing ;
Liu, Lijuan ;
Li, Dengqiu ;
Zhu, Jinru ;
Yu, Shuquan .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2016, 53 :1-15
[85]   Comparison of machine learning algorithms for forest parameter estimations and application for forest quality assessments [J].
Zhao, Qingxia ;
Yu, Shichuan ;
Zhao, Fei ;
Tian, Linghong ;
Zhao, Zhong .
FOREST ECOLOGY AND MANAGEMENT, 2019, 434 :224-234
[86]   Structure damage detection based on random forest recursive feature elimination [J].
Zhou, Qifeng ;
Zhou, Hao ;
Zhou, Qingqing ;
Yang, Fan ;
Luo, Linkai .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 46 (01) :82-90