Missing Traffic Data Imputation with a Linear Generative Model Based on Probabilistic Principal Component Analysis

被引:2
|
作者
Huang, Liping [1 ]
Li, Zhenghuan [1 ]
Luo, Ruikang [1 ]
Su, Rong [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
missing data; urban traffic sensing; probabilistic; principal component analysis; PREDICTION;
D O I
10.3390/s23010204
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Even with the ubiquitous sensing data in intelligent transportation systems, such as the mobile sensing of vehicle trajectories, traffic estimation is still faced with the data missing problem due to the detector faults or limited number of probe vehicles as mobile sensors. Such data missing issue poses an obstacle for many further explorations, e.g., the link-based traffic status modeling. Although many studies have focused on tackling this kind of problem, existing studies mainly focus on the situation in which data are missing at random and ignore the distinction between links of missing data. In the practical scenario, traffic speed data are always missing not at random (MNAR). The distinction for recovering missing data on different links has not been studied yet. In this paper, we propose a general linear model based on probabilistic principal component analysis (PPCA) for solving MNAR traffic speed data imputation. Furthermore, we propose a metric, i.e., Pearson score (p-score), for distinguishing links and investigate how the model performs on links with different p-score values. Experimental results show that the new model outperforms the typically used PPCA model, and missing data on links with higher p-score values can be better recovered.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Probabilistic principal component analysis for metabolomic data
    Gift Nyamundanda
    Lorraine Brennan
    Isobel Claire Gormley
    BMC Bioinformatics, 11
  • [22] An Infinitesimal Probabilistic Model for Principal Component Analysis of Manifold Valued Data
    Sommer, Stefan
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2019, 81 (01): : 37 - 62
  • [23] An Infinitesimal Probabilistic Model for Principal Component Analysis of Manifold Valued Data
    Stefan Sommer
    Sankhya A, 2019, 81 (1): : 37 - 62
  • [24] Missing value imputation for the analysis of incomplete traffic accident data
    Deb, Rupam
    Liew, Alan Wee -Chung
    INFORMATION SCIENCES, 2016, 339 : 274 - 289
  • [25] Mixture semisupervised probabilistic principal component regression model with missing inputs
    Sedghi, Shabnam
    Sadeghian, Anahita
    Huang, Biao
    COMPUTERS & CHEMICAL ENGINEERING, 2017, 103 : 176 - 187
  • [26] A Missing Traffic Data Imputation Method Based on a Diffusion Convolutional Neural Network-Generative Adversarial Network
    Zhang, Chenchen
    Zhou, Lei
    Xiao, Xuemei
    Xu, Dongwei
    SENSORS, 2023, 23 (23)
  • [27] DEGAIN: Generative-Adversarial-Network-Based Missing Data Imputation
    Shahbazian, Reza
    Trubitsyna, Irina
    INFORMATION, 2022, 13 (12)
  • [28] A systematic review of generative adversarial imputation network in missing data imputation
    Yuqing Zhang
    Runtong Zhang
    Butian Zhao
    Neural Computing and Applications, 2023, 35 : 19685 - 19705
  • [29] A systematic review of generative adversarial imputation network in missing data imputation
    Zhang, Yuqing
    Zhang, Runtong
    Zhao, Butian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19685 - 19705
  • [30] Robust Principal Component Analysis of Data with Missing Values
    Karkkainen, Tommi
    Saarela, Mirka
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2015, 2015, 9166 : 140 - 154