Enhancing data efficiency for autonomous vehicles: Using data sketches for detecting driving anomalies

被引:1
|
作者
Indah, Debbie Aisiana [1 ]
Mwakalonge, Judith [1 ]
Comert, Gurcan [2 ]
Siuhi, Saidi [1 ]
机构
[1] South Carolina State Univ, Dept Engn, 300 Coll Ave, Orangeburg, SC 29117 USA
[2] Benedict Coll, Dept Comp Sci & Engn, 1600 Harden St, Columbia, SC 29204 USA
来源
MACHINE LEARNING WITH APPLICATIONS | 2024年 / 15卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Data sketches; Reservoir sampling sketches; Big data; Driving anomaly detection; BEHAVIOR; MODEL;
D O I
10.1016/j.mlwa.2024.100530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning models for near collision detection in autonomous vehicles promise enhanced predictive power. However, training on these large datasets presents storage and computational challenges, particularly when operated on conventional computing systems. This paper addresses the problem of training anomaly detection models from large-scale vehicle trajectory datasets and adopts a reservoir sampling-based data sketching technique. Predetermined subset sizes ranging from 0.4% to 100% of the original data are utilized, A single-pass reservoir sampling algorithm is then applied to construct these data subsets efficiently. Subsequently, a Support Vector Machine (SVM) model is trained on these subsets, and its performance is assessed by various metrics, including accuracy, precision, recall, and F1-score. Experimental outcomes on the HighD dataset, a comprehensive real-world collection of vehicle trajectories, confirm that our approach can achieve robust near-collision detection. With a full dataset, our model achieved an F1-score of 0.9998 for class 0 and 0.9984 for class 1. When the data was reduced to as low as 0.4% of the original size, the F1-score for class 0 remained at 0.9998 and 0.7143 for class 1. This demonstrates a capability to maintain a relatively high performance even with a 99.6% reduction in data size. Moreover, precision and recall values ranged from 71.3% to 0.999 across varying sketch sizes.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Analysis of Driving Data for Autonomous Vehicle Applications
    O'Brien, Marie
    Neubauer, Kai
    Van Brummelen, Jessica
    Najjaran, Homayoun
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3677 - 3682
  • [2] Toward Interpretability in Fault Diagnosis for Autonomous Vehicles: Interpretation of Sensor Data Anomalies
    Fang, Yukun
    Min, Haigen
    Wu, Xia
    Lei, Xiaoping
    Chen, Shixiang
    Teixeira, Rui
    Zhao, Xiangmo
    IEEE SENSORS JOURNAL, 2023, 23 (05) : 5014 - 5027
  • [3] Analysing driving efficiency of mandatory lane change decision for autonomous vehicles
    Cao, Peng
    Xu, Zhandong
    Fan, Qiaochu
    Liu, Xiaobo
    IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (03) : 506 - 514
  • [4] Improving the Reliability of Autonomous Vehicles in a Branded Service System Using Big Data
    Makarova, Irina
    Buyvol, Polina
    Gabsalikhova, Larisa
    Pashkevich, Anton
    Tsybunov, Eduard
    Boyko, Aleksey
    2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
  • [5] A Path Towards Understanding Factors Affecting Crash Severity in Autonomous Vehicles Using Current Naturalistic Driving Data
    van Wyk, Franco
    Khojandi, Anahita
    Masoud, Neda
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2020, 1038 : 106 - 120
  • [6] Proximity based automatic data annotation for autonomous driving
    Sun, Chen
    Vianney, Jean M. Uwabeza
    Li, Ying
    Chen, Long
    Li, Li
    Wang, Fei-Yue
    Khajepour, Amir
    Cao, Dongpu
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (02) : 395 - 404
  • [7] Modelling Ethical Algorithms in Autonomous Vehicles Using Crash Data
    Robinson, Pamela
    Sun, Landy
    Furey, Heidi
    Jenkins, Ryan
    Phillips, Christopher R. M.
    Powers, Thomas M.
    Ritterson, Ryan S.
    Xie, Yuanchang
    Casagrande, Rocco
    Evans, Nicholas G.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7775 - 7784
  • [8] An in-depth evaluation of deep learning-enabled adaptive approaches for detecting obstacles using sensor-fused data in autonomous vehicles
    Thakur, Abhishek
    Mishra, Sudhansu Kumar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [9] Proposal of big data route selection methods for autonomous vehicles
    Reddig, Klaudia
    Dikunow, Blazej
    Krzykowska, Karolina
    INTERNET TECHNOLOGY LETTERS, 2018, 1 (05):
  • [10] Data Encryption and Fragmentation in Autonomous Vehicles using Raspberry Pi 3
    Murad, Sahand
    Khan, Asiya
    Shiaeles, Stavros
    Masala, Giovanni
    2019 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2019), 2019, : 212 - 216