Efficient Missing Counts Imputation of a Bike-Sharing System by Generative Adversarial Network

被引:10
作者
Xiao, Xiao [1 ]
Zhang, Yunlong [1 ]
Yang, Shu [2 ]
Kong, Xiaoqiang [1 ]
机构
[1] Texas A&M Univ, Zachry Dept Civil Engn, Dwight Look Coll Engn, College Stn, TX 77843 USA
[2] Southeast Univ, Dept Geog Informat Syst, Sch Transportat, Nanjing 211189, Peoples R China
关键词
Generative adversarial networks; Transportation; Neural networks; Training; Planning; Traffic control; Memory; Traffic data imputation; bike-sharing system; missing data; generative adversarial network; TRAFFIC FLOW; PREDICTION; MANAGEMENT;
D O I
10.1109/TITS.2021.3124409
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The issue of missing data is common in a bike-sharing system due to various reasons, such as the failure of data collection devices. To better utilize the bike-sharing data and guide the operation and planning of the public transportation system, missing data need to be imputed. When data are missing to a rate as high as 50%, or when the training set to calibrate a model is incomplete, many commonly used methods dealing with missing data may fail. Our concerns are how to incorporate the temporal-spatial relations from counts from a bike-sharing system and ensure a stable performance when the training set is incomplete. To solve these issues, a method is proposed using the strengths of a Generative Adversarial Network (GAN), which learns the distribution of missingness and generates data close to ground-truth values to impute the missing counts from a bike-sharing system. Traffic counts data are collected from Bluebikes, Boston. With limited available observations, the proposed method imputes missing traffic counts when concerning two scenarios: not missing at random (NMAR) problem and MCAR (Missing Completely at Random problem). The proposed method shows robustness with increasing missingness ratios in the dataset. In our experiment, the RMSE values used to measure the missing data imputation accuracy are smaller than 0.15, while the missingness ratio raises from 20% to 80%. Compared to other baseline methods, the method is robust and efficient for the missing data imputation problem for a bike-sharing system.
引用
收藏
页码:13443 / 13451
页数:9
相关论文
共 47 条
[1]   Dynamic linear models to predict bike availability in a bike sharing system [J].
Almannaa, Mohammed H. ;
Elhenawy, Mohammed ;
Rakha, Hesham A. .
INTERNATIONAL JOURNAL OF SUSTAINABLE TRANSPORTATION, 2020, 14 (03) :232-242
[2]   Network and station-level bike-sharing system prediction: a San Francisco bay area case study [J].
Ashqar, Huthaifa I. ;
Elhenawy, Mohammed ;
Rakha, Hesham A. ;
Almannaa, Mohammed ;
House, Leanna .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 26 (05) :602-612
[3]   Multiple imputation by chained equations: what is it and how does it work? [J].
Azur, Melissa J. ;
Stuart, Elizabeth A. ;
Frangakis, Constantine ;
Leaf, Philip J. .
INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) :40-49
[4]   Detecting errors and imputing missing data for single-loop surveillance systems [J].
Chen, C ;
Kwon, J ;
Rice, J ;
Skabardonis, A ;
Varaiya, P .
TRANSPORTATION DATA RESEARCH: PLANNING AND ADMINISTRATION, 2003, (1855) :160-167
[5]   Traffic Flow Imputation Using Parallel Data and Generative Adversarial Networks [J].
Chen, Yuanyuan ;
Lv, Yisheng ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) :1624-1630
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]   An efficient realization of deep learning for traffic data imputation [J].
Duan, Yanjie ;
Lv, Yisheng ;
Liu, Yu-Liang ;
Wang, Fei-Yue .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 72 :168-181
[8]  
Duan YJ, 2014, 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), P912, DOI 10.1109/ITSC.2014.6957805
[9]   Analysing bicycle-sharing system user destination choice preferences: Chicago's Divvy system [J].
Faghih-Imani, Ahmadreza ;
Eluru, Naveen .
JOURNAL OF TRANSPORT GEOGRAPHY, 2015, 44 :53-64
[10]  
Gang Chang, 2012, Tsinghua Science and Technology, V17, P304