Computing Urban Traffic Congestions by Incorporating Sparse GPS Probe Data and Social Media Data

被引:63
作者
Wang, Senzhang [1 ,2 ]
Zhang, Xiaoming [3 ]
Cao, Jianping [4 ]
He, Lifang [5 ]
Stenneth, Leon [6 ]
Yu, Philip S. [7 ,8 ]
Li, Zhoujun [3 ]
Huang, Zhiqiu [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Jiangsu, Peoples R China
[3] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[4] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[5] Shenzhen Univ, Shenzhen, Peoples R China
[6] BMW Audi & Daimlers HERE Connected Driving, Shenzhen, Peoples R China
[7] Univ Illinois, Sch Comp Sci, Chicago, IL USA
[8] Tsinghua Univ, Inst Data Sci, Beijing, Peoples R China
关键词
Social media; traffic congestion; matrix factorization; data fusion; REAL-TIME DETECTION; DENSITY-ESTIMATION;
D O I
10.1145/3057281
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating urban traffic conditions of an arterial network with GPS probe data is a practically important while substantially challenging problem, and has attracted increasing research interests recently. Although GPS probe data is becoming a ubiquitous data source for various traffic related applications currently, they are usually insufficient for fully estimating traffic conditions of a large arterial network due to the low sampling frequency. To explore other data sources for more effectively computing urban traffic conditions, we propose to collect various traffic events such as traffic accident and jam from social media as complementary information. In addition, to further explore other factors that might affect traffic conditions, we also extract rich auxiliary information including social events, road features, Point of Interest (POI), and weather. With the enriched traffic data and auxiliary information collected from different sources, we first study the traffic co-congestion pattern mining problem with the aim of discovering which road segments geographically close to each other are likely to co-occur traffic congestion. A search tree based approach is proposed to efficiently discover the co-congestion patterns. These patterns are then used to help estimate traffic congestions and detect anomalies in a transportation network. To fuse the multisourced data, we finally propose a coupled matrix and tensor factorization model named TCE_R to more accurately complete the sparse traffic congestion matrix by collaboratively factorizing it with other matrices and tensors formed by other data. We evaluate the proposed model on the arterial network of downtown Chicago with 1,257 road segments whose total length is nearly 700 miles. The results demonstrate the superior performance of TCE_ R by comprehensive comparison with existing approaches.
引用
收藏
页数:30
相关论文
共 42 条
[1]  
[Anonymous], 2013, P 21 ACM SIGSPATIAL, DOI DOI 10.1145/2525314.2525343
[2]  
[Anonymous], 2014, P 22 ACM SIGSPATIAL
[3]  
[Anonymous], 2001, WWW, DOI 10.1145/371920.372071
[4]  
[Anonymous], 2010, P 4 ACM C RECOMMEND
[5]  
[Anonymous], 2014, ACM SIGKDD INT C KNO, DOI DOI 10.1145/2623330.2623653
[6]  
[Anonymous], 2011, HPL2011109
[7]  
[Anonymous], EL ENG INF ICEEI 201
[8]  
Carli R, 2015, IEEE INT CON AUTO SC, P967, DOI 10.1109/CoASE.2015.7294224
[9]   Road Traffic Congestion Monitoring in Social Media with Hinge-Loss Markov Random Fields [J].
Chen, Po-Ta ;
Chen, Feng ;
Qian, Zhen .
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, :80-89
[10]   Real-Time Detection of Traffic From Twitter Stream Analysis [J].
D'Andrea, Eleonora ;
Ducange, Pietro ;
Lazzerini, Beatrice ;
Marcelloni, Francesco .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) :2269-2283