Enhancing data efficiency for autonomous vehicles: Using data sketches for detecting driving anomalies

被引:1
|
作者
Indah, Debbie Aisiana [1 ]
Mwakalonge, Judith [1 ]
Comert, Gurcan [2 ]
Siuhi, Saidi [1 ]
机构
[1] South Carolina State Univ, Dept Engn, 300 Coll Ave, Orangeburg, SC 29117 USA
[2] Benedict Coll, Dept Comp Sci & Engn, 1600 Harden St, Columbia, SC 29204 USA
来源
MACHINE LEARNING WITH APPLICATIONS | 2024年 / 15卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Data sketches; Reservoir sampling sketches; Big data; Driving anomaly detection; BEHAVIOR; MODEL;
D O I
10.1016/j.mlwa.2024.100530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning models for near collision detection in autonomous vehicles promise enhanced predictive power. However, training on these large datasets presents storage and computational challenges, particularly when operated on conventional computing systems. This paper addresses the problem of training anomaly detection models from large-scale vehicle trajectory datasets and adopts a reservoir sampling-based data sketching technique. Predetermined subset sizes ranging from 0.4% to 100% of the original data are utilized, A single-pass reservoir sampling algorithm is then applied to construct these data subsets efficiently. Subsequently, a Support Vector Machine (SVM) model is trained on these subsets, and its performance is assessed by various metrics, including accuracy, precision, recall, and F1-score. Experimental outcomes on the HighD dataset, a comprehensive real-world collection of vehicle trajectories, confirm that our approach can achieve robust near-collision detection. With a full dataset, our model achieved an F1-score of 0.9998 for class 0 and 0.9984 for class 1. When the data was reduced to as low as 0.4% of the original size, the F1-score for class 0 remained at 0.9998 and 0.7143 for class 1. This demonstrates a capability to maintain a relatively high performance even with a 99.6% reduction in data size. Moreover, precision and recall values ranged from 71.3% to 0.999 across varying sketch sizes.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Lateral conflict resolution data derived from Argoverse-2: Analysing safety and efficiency impacts of autonomous vehicles at intersections
    Li, Guopeng
    Jiao, Yiru
    Calvert, Simeon C.
    van Lint, J. W. C.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 167
  • [22] Detecting anomalies from big network traffic data using an adaptive detection approach
    Zhang, Ji
    Li, Hongzhou
    Gao, Qigang
    Wang, Hai
    Luo, Yonglong
    INFORMATION SCIENCES, 2015, 318 : 91 - 110
  • [23] Generating Edge Cases for Testing Autonomous Vehicles Using Real-World Data
    Karunakaran, Dhanoop
    Perez, Julie Stephany Berrio
    Worrall, Stewart
    SENSORS, 2024, 24 (01)
  • [24] Real-Time Traffic State Measurement Using Autonomous Vehicles Open Data
    Wang, Zhaohan
    Keo, Profita
    Saberi, Meead
    IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 4 : 602 - 610
  • [26] Sentiment Analysis of Autonomous Vehicles After Extreme Events Using Social Media Data
    Chen, Xu
    Zeng, Haohan
    Xu, Heng
    Di, Xuan
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1211 - 1216
  • [27] MAVERIC: A Data-Driven Approach to Personalized Autonomous Driving
    Schrum, Mariah L.
    Sumner, Emily
    Gombolay, Matthew C.
    Best, Andrew
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 1952 - 1965
  • [28] Autonomous Vehicles: The Cybersecurity Vulnerabilities and Countermeasures for Big Data Communication
    Algarni, Abdullah
    Thayananthan, Vijey
    SYMMETRY-BASEL, 2022, 14 (12):
  • [29] Secure Data Sharing for Autonomous Vehicles in Mobile Blockchain Networks
    Zuo, Yiping
    Dai, Chen
    Guo, Jiajia
    Guo, Zhengxin
    Xiao, Fu
    Jin, Shi
    IEEE NETWORK, 2025, 39 (02): : 166 - 175
  • [30] Quantum Edge Computing for Data Analysis in Connected Autonomous Vehicles
    M Peixoto, Maycon Leone
    2024 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, ISCC 2024, 2024,