Estimating Traffic Volume for Local Streets with Imbalanced Data

被引:8
作者
Chen, Peng [1 ,2 ]
Hu, Songhua [1 ,2 ]
Shen, Qing [3 ]
Lin, Hangfei [1 ,2 ]
Xie, Chi [1 ,2 ]
机构
[1] Tongji Univ, Coll Transportat Engn, Shanghai, Peoples R China
[2] Tongji Univ, Minist Educ, Key Lab Rd & Traff Engn, Shanghai, Peoples R China
[3] Univ Washington, Dept Urban Design & Planning, Seattle, WA 98195 USA
关键词
AVERAGE; PREDICTION; NETWORK; COUNTY;
D O I
10.1177/0361198119833347
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Annual average daily traffic (AADT) is an important measurement used in traffic engineering. Local streets are major components of a road network. However, automatic traffic recorders (ATRs) used to collect AADT are often limited to arterial roads, and such information is, therefore, often unavailable for local streets. Estimating AADT on local streets becomes a necessity as local street traffic continues to grow and the capacity of arterial roads becomes insufficient. A challenge is that an under-represented sample of local street AADT may result in biased estimation. A synthetic minority oversampling technique (SMOTE) is applied to oversample local streets to correct the imbalanced sampling among different road types. A generalized linear mixed model (GLMM) is employed to estimate AADT incorporating various independent variables, including factors of roadway design, socio-demographics, and land use. The model is examined with an AADT dataset from Seattle, WA. Results show that: (1) SMOTE helps to correct imbalanced sampling proportions and improve model performance significantly; (2) the number of lanes and the number of crosswalks are both positively associated with AADT; (3) road segments located in areas with a higher population density or more mixed land use have a higher AADT; (4) distance to the nearest arterial road is negatively correlated with AADT; and (5) AADT creates spatial spillover effects on neighboring road segments. The combination of SMOTE and GLMM improves the estimation accuracy on AADT, which contributes to better data for transportation planning and traffic monitoring, and to cost saving on data collection.
引用
收藏
页码:598 / 610
页数:13
相关论文
共 30 条
  • [1] [Anonymous], 1995, Survey Methods for Transport Planning
  • [2] REGRESSION-BASED TESTS FOR OVERDISPERSION IN THE POISSON MODEL
    CAMERON, AC
    TRIVEDI, PK
    [J]. JOURNAL OF ECONOMETRICS, 1990, 46 (03) : 347 - 364
  • [3] Mixed land-uses and commuting: Evidence from the American housing survey
    Cervero, R
    [J]. TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 1996, 30 (05) : 361 - 377
  • [4] Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
  • [5] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [6] Built environment determinants of bicycle volume: A longitudinal analysis
    Chen, Peng
    Zhou, Jiangping
    Sun, Feiyang
    [J]. JOURNAL OF TRANSPORT AND LAND USE, 2017, 10 (01) : 655 - 674
  • [7] Principle of Demographic Gravitation to Estimate Annual Average Daily Traffic: Comparison of Statistical and Neural Network Models
    Duddu, Venkata Ramana
    Pulugurtha, Srinivas S.
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING, 2013, 139 (06) : 585 - 595
  • [8] Eom JK, 2006, TRANSPORT RES REC, P20
  • [9] Federal Highway Administration, 2012, TRAFF MON GUID
  • [10] Estimating annual average daily traffic and transport emissions for a national road network: A bottom-up methodology for both nationally-aggregated and spatially-disaggregated results
    Fu, Miao
    Kelly, J. Andrew
    Clinch, J. Peter
    [J]. JOURNAL OF TRANSPORT GEOGRAPHY, 2017, 58 : 186 - 195