The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases

被引:129
作者
Eskelson, Bianca N. I. [1 ]
Temesgen, Hailemariam [1 ]
Lemay, Valerie [2 ]
Barrett, Tara M. [3 ]
Crookston, Nicholas L. [4 ]
Hudak, Andrew T. [4 ]
机构
[1] Oregon State Univ, Dept Forest Engn Resources & Management, Corvallis, OR 97331 USA
[2] Univ British Columbia, Dept Forest Resources, Vancouver, BC V5Z 1M9, Canada
[3] US Forest Serv, Pacific NW Res Stn, USDA, Anchorage, AK USA
[4] US Forest Serv, Rocky Mt Res Stn, USDA, Moscow, ID USA
关键词
Consistent notation; forest measurements; input data for forest planning; nearest neighbor imputation; registration error; sources of X-variables; REGENERATION IMPUTATION MODELS; INDIVIDUAL TREE GROWTH; DISCRETE-RETURN LIDAR; REMOTELY-SENSED DATA; BASAL AREA; STAND CHARACTERISTICS; SATELLITE IMAGERY; COMPLEX STANDS; ATTRIBUTES; REGRESSION;
D O I
10.1080/02827580902870490
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
Almost universally, forest inventory and monitoring databases are incomplete, ranging from missing data for only a few records and a few variables, common for small land areas, to missing data for many observations and many variables, common for large land areas. For a wide variety of applications, nearest neighbor (NN) imputation methods have been developed to fill in observations of variables that are missing on some records (Y-variables), using related variables that are available for all records (X-variables). This review attempts to summarize the advantages and weaknesses of NN imputation methods and to give an overview of the NN approaches that have most commonly been used. It also discusses some of the challenges of NN imputation methods. The inclusion of NN imputation methods into standard software packages and the use of consistent notation may improve further development of NN imputation methods. Using X-variables from different data sources provides promising results, but raises the issue of spatial and temporal registration errors. Quantitative measures of the contribution of individual X-variables to the accuracy of imputing the Y-variables are needed. In addition, further research is warranted to verify statistical properties, modify methods to improve statistical properties, and provide variance estimators.
引用
收藏
页码:235 / 246
页数:12
相关论文
共 70 条
[31]   Estimating stand structural details using nearest neighbor analyses to link ground data, forest cover maps, and Landsat imagery [J].
LeMay, Valerie ;
Maedel, Jerry ;
Coops, Nicholas C. .
REMOTE SENSING OF ENVIRONMENT, 2008, 112 (05) :2578-2591
[32]  
Little R.J.A., 2002, Statistical analysis with missing data, DOI [10.1002/9781119013563, DOI 10.1002/9781119013563]
[33]   Model-based mean square error estimators for k-nearest neighbour predictions and applications using remotely sensed data for forest inventories [J].
Magnussen, Steen ;
McRoberts, Ronald E. ;
Tomppo, Erkki O. .
REMOTE SENSING OF ENVIRONMENT, 2009, 113 (03) :476-488
[34]   Locally adaptable non-parametric methods for estimating stand characteristics for wood procurement planning [J].
Malinen, J .
SILVA FENNICA, 2003, 37 (01) :109-120
[35]   Nonparametric estimation of stem volume using airborne laser scanning, aerial photography, and stand-register data [J].
Maltamo, M ;
Malinen, J ;
Packaln, P ;
Suvanto, A ;
Kangas, J .
CANADIAN JOURNAL OF FOREST RESEARCH, 2006, 36 (02) :426-436
[36]   Methods based on k-nearest neighbor regression in the prediction of basal area diameter distribution [J].
Maltamo, M ;
Kangas, A .
CANADIAN JOURNAL OF FOREST RESEARCH, 1998, 28 (08) :1107-1115
[37]   Most similar neighbour-based stand variable estimation for use in inventory by compartments in Finland [J].
Maltamo, M ;
Malinen, J ;
Kangas, A ;
Härkönen, S ;
Pasanen, AM .
FORESTRY, 2003, 76 (04) :449-463
[38]   The most similar neighbour reference in the yield prediction of Pinus kesiya stands in Zambia [J].
Maltamo, M ;
Eerikäinen, K .
SILVA FENNICA, 2001, 35 (04) :437-451
[39]   Estimation of Mediterranean forest attributes by the application of k-NN procedures to multitemporal Landsat ETM plus images [J].
Maselli, F ;
Chirici, G ;
Bottai, L ;
Corona, P ;
Marchetti, M .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2005, 26 (17) :3781-3796
[40]   Extension of environmental parameters over the land surface by improved fuzzy classification of remotely sensed data [J].
Maselli, F .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2001, 22 (17) :3597-3610