Integrating Household Travel Survey and Social Media Data to Improve the Quality of OD Matrix: A Comparative Case Study

被引:18
作者
Cheng, Zesheng [1 ]
Jian, Sisi [2 ]
Rashidi, Taha Hossein [1 ]
Maghrebi, Mojtaba [1 ]
Waller, Steven Travis [1 ]
机构
[1] Univ New South Wales, Sch Civil & Environm Engn, Sydney, NSW 2052, Australia
[2] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Hong Kong, Peoples R China
关键词
Urban areas; Data models; Twitter; Data mining; High-temperature superconductors; Forestry; Foursquare; random forest regression; travel demand estimation; multi-city model; LOCATION; CLASSIFICATION;
D O I
10.1109/TITS.2019.2958673
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Collecting effective data is a fundamental step in developing transport networks and related research. Social media have become an emerging source of data for traffic analyses. In this paper, we demonstrate that the function of a city influences the utility of social media data in travel demand models by generating models for eight US cities with different functions. Data from Twitter and Foursquare, as well as other socio-demographic information, are considered as independent variables in Origin-Destination trip regression models generated via a Random Forest regression technique. Model performance with and without use of social media data are compared via 10-fold cross-validation. The results indicate that the accuracy of the models for all eight cities improved when independent variables based on social media data were included. The performance was most improved in metropolitan areas, followed by rural and tourist areas. Inspired by this finding, we conclude that the city function influences the utility of social media data in travel demand models. Meanwhile, we create models based on trip purpose and transport mode to explore other factors that may impact the efficiency of applying social media data in transport research.
引用
收藏
页码:2628 / 2636
页数:9
相关论文
共 43 条
[1]  
[Anonymous], 2015, P 14 INT C TRAV BEH
[2]  
[Anonymous], 2009, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, DOI DOI 10.1007/978
[3]  
[Anonymous], 2012, P INT AAAI C WEB SOC
[4]  
[Anonymous], 2019, ANN REPORT
[5]  
Asakura Yasuo, 2009, EXPANDING SPHERE TRA, P207
[6]  
Batty Michael, 2013, Dialogues Hum Geogr, V3, P274, DOI 10.1177/2043820613513390
[7]  
Bregman S., 2012, 99 TRANSP RES BOARD
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Breiman L, 1984, Classification and Regression Trees, V1st, DOI DOI 10.1201/9781315139470
[10]  
Bureau USC, 2019, ANN EST RES POP