Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model

被引:381
作者
Yao, Yao [1 ]
Li, Xia [1 ]
Liu, Xiaoping [1 ]
Liu, Penghua [2 ]
Liang, Zhaotang [2 ]
Zhang, Jinbao [2 ]
Mai, Ke [2 ]
机构
[1] Sun Yat Sen Univ, Guangdong Key Lab Urbanizat & Geosimulat, Sch Geog & Planning, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Geog & Planning, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Land use; Word2Vec; point-of-interest; deep learning; topic model; SCENE CLASSIFICATION; IMPLEMENTATION;
D O I
10.1080/13658816.2016.1244608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Urban land use information plays an essential role in a wide variety of urban planning and environmental monitoring processes. During the past few decades, with the rapid technological development of remote sensing (RS), geographic information systems (GIS) and geospatial big data, numerous methods have been developed to identify urban land use at a fine scale. Points-of-interest (POIs) have been widely used to extract information pertaining to urban land use types and functional zones. However, it is difficult to quantify the relationship between spatial distributions of POIs and regional land use types due to a lack of reliable models. Previous methods may ignore abundant spatial features that can be extracted from POIs. In this study, we establish an innovative framework that detects urban land use distributions at the scale of traffic analysis zones (TAZs) by integrating Baidu POIs and a Word2Vec model. This framework was implemented using a Google open-source model of a deep-learning language in 2013. First, data for the Pearl River Delta (PRD) are transformed into a TAZ-POI corpus using a greedy algorithm by considering the spatial distributions of TAZs and inner POIs. Then, high-dimensional characteristic vectors of POIs and TAZs are extracted using the Word2Vec model. Finally, to validate the reliability of the POI/TAZ vectors, we implement a K-Means-based clustering model to analyze correlations between the POI/TAZ vectors and deploy TAZ vectors to identify urban land use types using a random forest algorithm (RFA) model. Compared with some state-of-the-art probabilistic topic models (PTMs), the proposed method can efficiently obtain the highest accuracy (OA = 0.8728, kappa = 0.8399). Moreover, the results can be used to help urban planners to monitor dynamic urban land use and evaluate the impact of urban planning schemes.
引用
收藏
页码:825 / 848
页数:24
相关论文
共 54 条
[1]   An information-theoretic perspective of tf-idf measures [J].
Aizawa, A .
INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) :45-65
[2]  
[Anonymous], 2007, ENCY EARTH
[3]   Toward mapping land-use patterns from volunteered geographic information [J].
Arsanjani, Jamal Jokar ;
Helbich, Marco ;
Bakillah, Mohamed ;
Hagenauer, Julian ;
Zipf, Alexander .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2013, 27 (12) :2264-2278
[4]   A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec [J].
Bai Xue ;
Chen Fu ;
Zhan Shaobin .
2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, :358-363
[5]  
Biau G, 2012, J MACH LEARN RES, V13, P1063
[6]   Building text classifiers using positive and unlabeled examples [J].
Bing, L ;
Yang, D ;
Li, XL ;
Lee, WS ;
Yu, PS .
THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, :179-186
[7]   Object based image analysis for remote sensing [J].
Blaschke, T. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2010, 65 (01) :2-16
[8]   Geographic Object-Based Image Analysis - Towards a new paradigm [J].
Blaschke, Thomas ;
Hay, Geoffrey J. ;
Kelly, Maggi ;
Lang, Stefan ;
Hofmann, Peter ;
Addink, Elisabeth ;
Feitosa, Raul Queiroz ;
van der Meer, Freek ;
van der Werff, Harald ;
van Coillie, Frieke ;
Tiede, Dirk .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :180-191
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Bosch A, 2006, LECT NOTES COMPUT SC, V3954, P517