Enhancing data efficiency for autonomous vehicles: Using data sketches for detecting driving anomalies

被引:1
|
作者
Indah, Debbie Aisiana [1 ]
Mwakalonge, Judith [1 ]
Comert, Gurcan [2 ]
Siuhi, Saidi [1 ]
机构
[1] South Carolina State Univ, Dept Engn, 300 Coll Ave, Orangeburg, SC 29117 USA
[2] Benedict Coll, Dept Comp Sci & Engn, 1600 Harden St, Columbia, SC 29204 USA
来源
MACHINE LEARNING WITH APPLICATIONS | 2024年 / 15卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Data sketches; Reservoir sampling sketches; Big data; Driving anomaly detection; BEHAVIOR; MODEL;
D O I
10.1016/j.mlwa.2024.100530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning models for near collision detection in autonomous vehicles promise enhanced predictive power. However, training on these large datasets presents storage and computational challenges, particularly when operated on conventional computing systems. This paper addresses the problem of training anomaly detection models from large-scale vehicle trajectory datasets and adopts a reservoir sampling-based data sketching technique. Predetermined subset sizes ranging from 0.4% to 100% of the original data are utilized, A single-pass reservoir sampling algorithm is then applied to construct these data subsets efficiently. Subsequently, a Support Vector Machine (SVM) model is trained on these subsets, and its performance is assessed by various metrics, including accuracy, precision, recall, and F1-score. Experimental outcomes on the HighD dataset, a comprehensive real-world collection of vehicle trajectories, confirm that our approach can achieve robust near-collision detection. With a full dataset, our model achieved an F1-score of 0.9998 for class 0 and 0.9984 for class 1. When the data was reduced to as low as 0.4% of the original size, the F1-score for class 0 remained at 0.9998 and 0.7143 for class 1. This demonstrates a capability to maintain a relatively high performance even with a 99.6% reduction in data size. Moreover, precision and recall values ranged from 71.3% to 0.999 across varying sketch sizes.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Sensor Data Transplantation for Redundant Hardware Switchover in Micro Autonomous Vehicles
    Lemieux-Mack, Cailani
    Leach, Kevin
    Angstadt, Kevin
    PROCEEDINGS 15TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS, ICCPS 2024, 2024, : 135 - 146
  • [42] Distributed Data-Sharing Consensus in Cooperative Perception of Autonomous Vehicles
    Qiu, Chenxi
    Yadav, Sourabh
    Squicciarini, Anna
    Yang, Qing
    Fu, Song
    Zhao, Juanjuan
    Xu, Chengzhong
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 1212 - 1222
  • [43] Uncertainty-aware Sensor Data Anomaly Detection for Autonomous Vehicles
    Chen, Shixiang
    Min, Haigen
    Fang, Yukun
    Wu, Xia
    Li, Baolu
    Zhao, Xiangmo
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 478 - 483
  • [44] The Michigan Tech autonomous winter driving data set: year two
    Bos, Jeremy P.
    Kurup, Akhil
    Chopp, Derek
    Jeffries, Zach
    AUTONOMOUS SYSTEMS: SENSORS, PROCESSING, AND SECURITY FOR VEHICLES AND INFRASTRUCTURE 2021, 2021, 11748
  • [45] Vulnerable Road User Trajectory Prediction for Autonomous Driving Using a Data-Driven Integrated Approach
    Chen, Hao
    Liu, Yinhua
    Hu, Chuan
    Zhang, Xi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) : 7306 - 7317
  • [46] USING DATA FROM REDDIT, PUBLIC DELIBERATION, AND SURVEYS TO MEASURE PUBLIC OPINION ABOUT AUTONOMOUS VEHICLES
    Chen, Kaiping
    Tomblin, David
    PUBLIC OPINION QUARTERLY, 2021, 85 : 289 - 322
  • [47] QoS-Based Secure Data Communication for Software-Defined Autonomous Vehicles Using Blockchain
    Garg, Deepanshu
    Bali, Rasmeet Singh
    JOURNAL OF APPLIED SECURITY RESEARCH, 2024, 19 (03) : 494 - 516
  • [48] How do long combination vehicles perform in real traffic? A study using Naturalistic Driving Data
    Behera, Abhijeet
    Kharrazi, Sogol
    Frisk, Erik
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 207
  • [49] Modelling Road Congestion Using a Fuzzy System and Real-World Data for Connected and Autonomous Vehicles
    Abberley, Luke
    Crockett, Keeley
    Cheng, Jianquan
    2019 WIRELESS DAYS (WD), 2019,
  • [50] Efficient and Privacy-preserving Roadmap Data Update for Autonomous Vehicles
    Wang, Haoyu
    Hadian, Mohammad
    Liang, Xiaohui
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,