Metrics for evaluating the performance of machine learning based automated valuation models

被引:58
作者
Steurer, Miriam [1 ]
Hill, Robert J. [1 ]
Pfeifer, Norbert [1 ]
机构
[1] Karl Franzens Univ Graz, Dept Econ, Graz, Austria
关键词
Performance metrics; automated valuation; model selection; machine learning; house price prediction; REGRESSION;
D O I
10.1080/09599916.2020.1858937
中图分类号
TU98 [区域规划、城乡规划];
学科分类号
0814 ; 082803 ; 0833 ;
摘要
Automated Valuation Models (AVMs) based on Machine Learning (ML) algorithms are widely used for predicting house prices. While there is consensus in the literature that cross-validation (CV) should be used for model selection in this context, the interdisciplinary nature of the subject has made it hard to reach consensus over which metrics to use at each stage of the CV exercise. We collect 48 metrics (from the AVM literature and elsewhere) and classify them into seven groups according to their structure. Each of these groups focuses on a particular aspect of the error distribution. Depending on the type of data and the purpose of the AVM, the needs of users may be met by some classes, but not by others. In addition, we show in an empirical application how the choice of metric can influence the choice of model, by applying each metric to evaluate five commonly used AVM models. Finally - since it is not always practicable to produce 48 different performance metrics - we provide a short list of 7 metrics that are well suited to evaluate AVMs. These metrics satisfy a symmetry condition that we find is important for AVM performance, and can provide a good overall model performance ranking.
引用
收藏
页码:99 / 129
页数:31
相关论文
共 62 条
[1]   Assessment and prediction of tropospheric ozone concentration levels using artificial neural networks [J].
Abdul-Wahab, SA ;
Al-Alawi, SM .
ENVIRONMENTAL MODELLING & SOFTWARE, 2002, 17 (03) :219-228
[2]   Can regulation de-bias appraisers? [J].
Agarwal, Sumit ;
Ambrose, Brent W. ;
Yao, Vincent W. .
JOURNAL OF FINANCIAL INTERMEDIATION, 2020, 44
[3]   Using ridge regression with genetic algorithm to enhance real estate appraisal forecasting [J].
Ahn, Jae Joon ;
Byun, Hyun Woo ;
Oh, Kyong Joo ;
Kim, Tae Yoon .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (09) :8369-8379
[4]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267
[5]  
[Anonymous], MACH LEARN, DOI [10.1023/A:1010933404324, DOI 10.1023/A:1010933404324]
[6]   Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics [J].
Antipov, Evgeny A. ;
Pokryshevskaya, Elena B. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) :1772-1778
[7]   Machine Learning Methods for Demand Estimation [J].
Bajari, Patrick ;
Nekipelov, Denis ;
Ryan, Stephen P. ;
Yang, Miaoyu .
AMERICAN ECONOMIC REVIEW, 2015, 105 (05) :481-485
[8]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[9]  
Bogin A.N., FHFA STAFF WORKING P
[10]  
Ceh M, 2018, ISPRS INT J GEO-INF, V7, DOI [10.3390/ijgi7050168, 10.3390/ijgi705168]