An Expressway ETC Missing Data Restoration Model Considering Multi-Attribute Features

被引:1
|
作者
Zou, Fumin [1 ]
Zhou, Zhaoyi [1 ]
Cai, Qiqin [1 ,2 ]
Guo, Feng [1 ,3 ]
Zhang, Xinyi [1 ]
机构
[1] Fujian Univ Technol, Fujian Key Lab Automot Elect & Elect Drive, Fuzhou 350118, Peoples R China
[2] Huaqiao Univ, Coll Mech Engn & Automat, Xiamen 361021, Peoples R China
[3] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
关键词
ETC data; data restoration; missing transactions; expressway; data mining; TRAFFIC FLOW; IMPUTATION; DEMAND; MATRIX;
D O I
10.3390/s23218745
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Electronic toll collection (ETC) data mining has become one of the hotspots in the research of intelligent expressway extension applications. Ensuring the integrity of ETC data stands as a critical measure in upholding data quality. ETC data are typical structured data, and although deep learning holds great potential in the ETC data restoration field, its applications in structured data are still in the early stages. To address these issues, we propose an expressway ETC missing transaction data restoration model considering multi-attribute features (MAF). Initially, we employ an entity embedding neural network (EENN) to automatically learn the representation of categorical features in multi-dimensional space, subsequently obtaining embedding vectors from networks that have been adequately trained. Then, we use long short-term memory (LSTM) neural networks to extract the changing patterns of vehicle speeds across several continuous sections. Ultimately, we merge the processed features with other features as input, using a three-layer multilayer perceptron (MLP) to complete the ETC data restoration. To validate the effectiveness of the proposed method, we conducted extensive tests using real ETC datasets and compared it with methods commonly used for structured data restoration. The experimental results demonstrate that the proposed method significantly outperforms others in restoration accuracy on two different datasets. Specifically, our sample data size reached around 400,000 entries. Compared to the currently best method, our method improved the restoration accuracy by 19.06% on non-holiday ETC datasets. The MAE and RMSE values reached optimal levels of 12.394 and 23.815, respectively. The fitting degree of the model to the dataset also reached its peak (R2 = 0.993). Meanwhile, the restoration stability of our method on holiday datasets increased by 5.82%. An ablation experiment showed that the EENN and LSTM modules contributed 7.60% and 9% to the restoration accuracy, as well as 4.68% and 7.29% to the restoration stability. This study indicates that the proposed method not only significantly improves the quality of ETC data but also meets the timeliness requirements of big data mining analysis.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] A multi-attribute quantum group consensus model considering psychological
    Xu, Yizhao
    Yan, Shuli
    Li, Yangxue
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [2] Research on Multi-attribute Data Completion Method Considering Data Distribution Characteristics
    Yong, Wang
    Hao, Li
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 2, 2019, : 138 - 141
  • [3] MULTI-ATTRIBUTE DATA VISUALIZATION ANALYSIS MODEL FOR MULTIMEDIA
    Wu, Jun
    Chen, YaoXin
    2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
  • [4] Power Grid Enterprise Intelligent Risk Identification Model Considering Multi-Attribute and Low correlation Data
    Zhou, Long
    Cai, Lingjia
    Jiang, Lin
    Chen, Lingna
    IEEE ACCESS, 2019, 7 : 111324 - 111331
  • [5] Similarity measure for multi-attribute data
    Li, CJ
    Prabhakaran, B
    Zheng, SQ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1149 - 1152
  • [6] Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation
    Li, Xiao
    Li, Huan
    Lu, Hua
    Jensen, Christian S.
    Pandey, Varun
    Markl, Volker
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 17 (03): : 345 - 358
  • [7] Multi-Attribute Compressive Data Gathering
    Chen, Guangshuo
    Liu, Xiao-Yang
    Kong, Linghe
    Lu, Jia-Liang
    Wu, Min-You
    2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 2178 - 2183
  • [8] Crowdsourced Selection on Multi-Attribute Data
    Weng, Xueping
    Li, Guoliang
    Hu, Huiqi
    Feng, Jianhua
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 307 - 316
  • [9] Dealing with missing data in rough set analysis of multi-attribute and multi-criteria decision problems
    Greco, S
    Matarazzo, B
    Slowinski, R
    DECISION MAKING: RECENT DEVELOPMENTS AND WORLDWIDE APPLICATIONS, 2000, 45 : 295 - 316
  • [10] Efficient Multi-Attribute Auctions Considering Supply Disruption
    Xiang, Jie
    Zhang, Juliang
    Cheng, T. C. E.
    Maria Sallan, Jose
    Hua, Guowei
    ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2019, 36 (03)