An Expressway ETC Missing Data Restoration Model Considering Multi-Attribute Features

被引:1
|
作者
Zou, Fumin [1 ]
Zhou, Zhaoyi [1 ]
Cai, Qiqin [1 ,2 ]
Guo, Feng [1 ,3 ]
Zhang, Xinyi [1 ]
机构
[1] Fujian Univ Technol, Fujian Key Lab Automot Elect & Elect Drive, Fuzhou 350118, Peoples R China
[2] Huaqiao Univ, Coll Mech Engn & Automat, Xiamen 361021, Peoples R China
[3] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
关键词
ETC data; data restoration; missing transactions; expressway; data mining; TRAFFIC FLOW; IMPUTATION; DEMAND; MATRIX;
D O I
10.3390/s23218745
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Electronic toll collection (ETC) data mining has become one of the hotspots in the research of intelligent expressway extension applications. Ensuring the integrity of ETC data stands as a critical measure in upholding data quality. ETC data are typical structured data, and although deep learning holds great potential in the ETC data restoration field, its applications in structured data are still in the early stages. To address these issues, we propose an expressway ETC missing transaction data restoration model considering multi-attribute features (MAF). Initially, we employ an entity embedding neural network (EENN) to automatically learn the representation of categorical features in multi-dimensional space, subsequently obtaining embedding vectors from networks that have been adequately trained. Then, we use long short-term memory (LSTM) neural networks to extract the changing patterns of vehicle speeds across several continuous sections. Ultimately, we merge the processed features with other features as input, using a three-layer multilayer perceptron (MLP) to complete the ETC data restoration. To validate the effectiveness of the proposed method, we conducted extensive tests using real ETC datasets and compared it with methods commonly used for structured data restoration. The experimental results demonstrate that the proposed method significantly outperforms others in restoration accuracy on two different datasets. Specifically, our sample data size reached around 400,000 entries. Compared to the currently best method, our method improved the restoration accuracy by 19.06% on non-holiday ETC datasets. The MAE and RMSE values reached optimal levels of 12.394 and 23.815, respectively. The fitting degree of the model to the dataset also reached its peak (R2 = 0.993). Meanwhile, the restoration stability of our method on holiday datasets increased by 5.82%. An ablation experiment showed that the EENN and LSTM modules contributed 7.60% and 9% to the restoration accuracy, as well as 4.68% and 7.29% to the restoration stability. This study indicates that the proposed method not only significantly improves the quality of ETC data but also meets the timeliness requirements of big data mining analysis.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Multi-attribute Aware Data Scheduling for Multipath TCP
    Luo, Jiacheng
    Su, Xin
    Liu, Bei
    Zeng, Jie
    2018 18TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2018, : 270 - 274
  • [32] Efficient Similarity Join and Search on Multi-Attribute Data
    Li, Guoliang
    He, Jian
    Deng, Dong
    Li, Jian
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1137 - 1151
  • [33] Group decision making with multi-attribute interval data
    Yue, Zhongliang
    INFORMATION FUSION, 2013, 14 (04) : 551 - 561
  • [34] A framework for efficient multi-attribute movement data analysis
    Valdes, Fabio
    Gueting, Ralf Hartmut
    VLDB JOURNAL, 2019, 28 (04): : 427 - 449
  • [35] Multi-attribute group decision-making considering opinion dynamics
    Li, Yupeng
    Liu, Meng
    Cao, Jin
    Wang, Xiaolin
    Zhang, Na
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [36] Multi-attribute group decision-making considering opinion dynamics
    Li, Yupeng
    Liu, Meng
    Cao, Jin
    Wang, Xiaolin
    Zhang, Na
    Expert Systems with Applications, 2021, 184
  • [37] Efficient Summarization Framework for Multi-Attribute Uncertain Data
    Xu, Jie
    Kalashnikov, Dmitri, V
    Mehrotra, Sharad
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 421 - 432
  • [38] Positioning method of expressway ETC gantry by multi-source traffic data
    Guo, Feng
    Zou, Fumin
    Luo, Sijie
    Chen, Haobin
    Yu, Xiang
    Zhang, Cheng
    Liao, Lyuchao
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (03) : 540 - 554
  • [39] An explainable multi-attribute decision model based on argumentation
    Zhong, Qiaoting
    Fan, Xiuyi
    Luo, Xudong
    Toni, Francesca
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 42 - 61
  • [40] A descriptive multi-attribute utility model for everyday decisions
    Weiss, Jie W.
    Weiss, David J.
    Edwards, Ward
    THEORY AND DECISION, 2010, 68 (1-2) : 101 - 114