How many samples are needed? An investigation of binary logistic regression for selective omission in a road network

被引:3
作者
Zhou, Qi [1 ]
Li, Zhilin [2 ]
机构
[1] China Univ Geosci, Fac Informat Engn, Wuhan, Peoples R China
[2] Hong Kong Polytech Univ, Dept Land Surveying & Geoinformat, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Road network; selective omission; map generalization; binary logistic regression;
D O I
10.1080/15230406.2015.1104265
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Selective omission in a road network (or road selection) means to retain more important roads, and it is a necessary operator to transform a road network at a large scale to that at a smaller scale. This study discusses the use of the supervised learning approach to road selection, and investigates how many samples are needed for a good performance of road selection. More precisely, the binary logistic regression is employed and three road network data with different sizes and different target scales are involved for testing. The different percentages and numbers of strokes are randomly chosen for training a logistic regression model, which is further applied into the untrained strokes for validation. The performances of using the different sample sizes are mainly evaluated by an error rate estimate. Significance tests are also employed to investigate whether the use of different sample sizes shows statistically significant differences. The experimental results show that in most cases, the error rate estimate is around 0.1-0.2; more importantly, only a small number (e.g., 50-100) of training samples is needed, which indicates the usability of binary logistic regression for road selection.
引用
收藏
页码:405 / 416
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2009, ENCY DATABASE SYSTEM, DOI DOI 10.1007/978-0-387-39940-9_565
[2]  
ArcGIS Resources, 2012, ARCGIS HELP 10 1
[3]   Selective omission of road features based on mesh density for automatic map generalization [J].
Chen, Jun ;
Hu, Yungang ;
Li, Zhilin ;
Zhao, Renliang ;
Meng, Liqiu .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2009, 23 (08) :1013-1032
[4]   Centrality measures in spatial networks of urban streets [J].
Crucitti, P ;
Latora, V ;
Porta, S .
PHYSICAL REVIEW E, 2006, 73 (03)
[5]   NEW TABLES FOR MULTIPLE COMPARISONS WITH CONTROL [J].
DUNNETT, CW .
BIOMETRICS, 1964, 20 (03) :482-&
[7]   Generalization-oriented road line classification by means of an artificial neural network [J].
Garcia Balboa, Jose Luis ;
Ariza Lopez, Francisco Javier .
GEOINFORMATICA, 2008, 12 (03) :289-312
[8]  
Garson G.D., 2012, Logistic regression: Binary multinomial
[9]  
Greene William, 2018, Econometric Analysis, V8th
[10]   A block-based selection method for road network generalization [J].
Gulgen, F. ;
Gokgoz, T. .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2011, 4 (02) :133-153