A Diffusion Model for Traffic Data Imputation

被引:0
作者
Lu, Bo [1 ,2 ]
Miao, Qinghai [1 ]
Liu, Yahui [3 ,4 ]
Tamir, Tariku Sinshaw [3 ,5 ]
Zhao, Hongxia [3 ]
Zhang, Xiqiao [6 ]
Lv, Yisheng [3 ]
Wang, Fei-Yue [3 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[2] Baidu Inc, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[4] Meituan, Beijing 100050, Peoples R China
[5] Guangdong Univ Technol, Guangzhou 510520, Peoples R China
[6] Harbin Inst Technol, Sch Transportat Sci & Technol, Dept Transportat Engn, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Image synthesis; Electricity; Time series analysis; Predictive models; Feature extraction; Imputation; Data models; Intelligent transportation systems; Data imputation; diffusion model; implicit feature; time series; traffic data; PREDICTION;
D O I
10.1109/JAS.2024.124611
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imputation of missing data has long been an important topic and an essential application for intelligent transportation systems (ITS) in the real world. As a state-of-the-art generative model, the diffusion model has proven highly successful in image generation, speech generation, time series modelling etc. and now opens a new avenue for traffic data imputation. In this paper, we propose a conditional diffusion model, called the implicit-explicit diffusion model, for traffic data imputation. This model exploits both the implicit and explicit feature of the data simultaneously. More specifically, we design two types of feature extraction modules, one to capture the implicit dependencies hidden in the raw data at multiple time scales and the other to obtain the long-term temporal dependencies of the time series. This approach not only inherits the advantages of the diffusion model for estimating missing data, but also takes into account the multi-scale correlation inherent in traffic data. To illustrate the performance of the model, extensive experiments are conducted on three real-world time series datasets using different missing rates. The experimental results demonstrate that the model improves imputation accuracy and generalization capability.
引用
收藏
页码:606 / 617
页数:12
相关论文
共 64 条
  • [1] Alcaraz JML, 2022, Arxiv, DOI [arXiv:2208.09399, 10.48550/arXiv.2208.09399]
  • [2] Bai SJ, 2018, Arxiv, DOI [arXiv:1803.01271, 10.48550/arXiv.1803.01271]
  • [3] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [4] Cao W, 2018, Arxiv, DOI arXiv:1805.10572
  • [5] CDP-GAN: Near-Infrared and Visible Image Fusion Via Color Distribution Preserved GAN
    Chen, Jun
    Wu, Kangle
    Yu, Yang
    Luo, Linbo
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (09) : 1698 - 1701
  • [6] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [7] Low-Rank Autoregressive Tensor Completion for Spatiotemporal Traffic Data Imputation
    Chen, Xinyu
    Lei, Mengying
    Saunier, Nicolas
    Sun, Lijun
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 12301 - 12310
  • [8] Spatial-temporal traffic speed patterns discovery and incomplete data recovery via SVD-combined tensor decomposition
    Chen, Xinyu
    He, Zhaocheng
    Wang, Jiawei
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 86 : 59 - 77
  • [9] Traffic Flow Imputation Using Parallel Data and Generative Adversarial Networks
    Chen, Yuanyuan
    Lv, Yisheng
    Wang, Fei-Yue
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1624 - 1630
  • [10] Choi S.., 2023, IEEE Trans. Intelligent Vehicles, V9, P151