A framework for extracting urban functional regions based on multiprototype word embeddings using points-of-interest data

被引:76
作者
Hu, Sheng [1 ]
He, Zhanjun [1 ,2 ]
Wu, Liang [1 ,2 ]
Yin, Li [3 ]
Xu, Yongyang [1 ]
Cui, Haifu [1 ]
机构
[1] China Univ Geosci, Fac Informat Engn, Wuhan 430074, Peoples R China
[2] Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China
[3] SUNY Buffalo, Dept Urban & Reg Planning, Buffalo, NY 14214 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Urban functional regions; Word embeddings; Points-of-interest; Spatial dusters; LAND-USE CLASSIFICATION; AREAS;
D O I
10.1016/j.compenvurbsys.2019.101442
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many studies are in an effort to explore urban spatial structure, and urban functional regions have become the subject of increasing attention among planners, engineers and public officials. Attempts have been made to identify urban functional regions using high spatial resolution (HSR) remote sensing images and extensive geo-data. However, the research scale and throughput have also been limited by the accessibility of HSR remote sensing data. Recently, big geo-data are becoming increasingly popular for urban studies since research is still accessible and objective with regard to the use of these data. This study aims to build a novel framework to provide an alternative solution for sensing urban spatial structure and discovering urban functional regions based on emerging geo-data - points of interest (POIs) data and an embedding learning method in the natural language processing (NLP) field. We started by constructing the intraurban functional corpus using a center-context pairs-based approach. A word embeddings representation model for training that corpus was used to extract multiprototype vectors in the second step, and the last step aggregated the functional parcels based on an introduced spatial clustering method, hierarchical density-based spatial clustering of applications with noise (HDBSCAN). The clustering results suggested that our proposed framework used in this study is capable of discovering the utilization of urban space with a reasonable level of accuracy. The limitation and potential improvement of the proposed framework arc also discussed.
引用
收藏
页数:15
相关论文
共 54 条
[1]   An information-theoretic perspective of tf-idf measures [J].
Aizawa, A .
INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) :45-65
[2]  
[Anonymous], MAP POIS POINT INT D
[3]  
[Anonymous], 29 AAAI C ART INT 29
[4]  
Antikainen Janne., 2005, Informationen Zur Raumentwickl, V7, P447
[5]   Toward mapping land-use patterns from volunteered geographic information [J].
Arsanjani, Jamal Jokar ;
Helbich, Marco ;
Bakillah, Mohamed ;
Hagenauer, Julian ;
Zipf, Alexander .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2013, 27 (12) :2264-2278
[6]   Human mobility: Models and applications [J].
Barbosa, Hugo ;
Barthelemy, Marc ;
Ghoshal, Gourab ;
James, Charlotte R. ;
Lenormand, Maxime ;
Louail, Thomas ;
Menezes, Ronaldo ;
Ramasco, Jose J. ;
Simini, Filippo ;
Tomasini, Marcello .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2018, 734 :1-74
[7]   Geographic Object-Based Image Analysis - Towards a new paradigm [J].
Blaschke, Thomas ;
Hay, Geoffrey J. ;
Kelly, Maggi ;
Lang, Stefan ;
Hofmann, Peter ;
Addink, Elisabeth ;
Feitosa, Raul Queiroz ;
van der Meer, Freek ;
van der Werff, Harald ;
van Coillie, Frieke ;
Tiede, Dirk .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :180-191
[8]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[9]  
Bracken I., 2014, URBAN PLANNING METHO, DOI [10.4324/9781315823904, DOI 10.4324/9781315823904]
[10]   Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection [J].
Campello, Ricardo J. G. B. ;
Moulavi, Davoud ;
Zimek, Arthur ;
Sander, Joerg .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2015, 10 (01)