A Clustering-Based Data Reduction for the Large Automotive Datasets

被引:0
|
作者
Siwek, Patryk [1 ]
Skruch, Pawel [2 ]
Dlugosz, Marek [2 ]
机构
[1] Aptiv Serv Poland SA, Krakow, Poland
[2] AGH Univ Sci & Technol, Krakow, Poland
来源
2023 27TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS, MMAR | 2023年
关键词
large dataset; automotive; reduction; clustering; perception;
D O I
10.1109/MMAR58394.2023.10242489
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large datasets used in automotive consist of a set of recorded sequences that represent possible road scenarios. Such scenarios are mainly utilized as test scenarios to verify developed driver assistance systems. Another application of the dataset is the training and verification of machine learning-based algorithms. As the number of possible road scenarios is, in fact, infinite, the process of selecting representative and meaningful sequences is a difficult and challenging task. This article presents an approach based on various clustering techniques for data reduction for large datasets that are used in the automotive industry to evaluate environmental perception algorithms. The approach is supported by the results obtained on representative datasets.
引用
收藏
页码:234 / 239
页数:6
相关论文
共 50 条
  • [21] Clustering-based hybrid resampling techniques for social lending data
    Jadwal P.K.
    Jain S.
    Agarwal B.
    International Journal of Intelligent Systems Technologies and Applications, 2021, 20 (03) : 183 - 198
  • [22] Clustering-Based Hybrid Approach for Multivariate Missing Data Imputation
    Dubey, Aditya
    Rasool, Akhtar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 710 - 714
  • [23] Pilot Contamination Reduction for Access Point Clustering-based Pilot Assignment
    Mussbah, Mariam
    Schwarz, Stefan
    Rupp, Markus
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [24] Data Clustering-based Anomaly Detection in Industrial Control Systems
    Kiss, Istvan
    Genge, Bela
    Haller, Piroska
    Sebestyen, Gheorghe
    2014 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2014, : 275 - +
  • [25] Detecting Data Accuracy Issues in Textual Geographical Data by a Clustering-based Approach
    Pellegrino, Maria Angela
    Postiglione, Luca
    Scarano, Vittorio
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 208 - 212
  • [26] Data Reduction in Very Large Spatio-Temporal Datasets
    Whelan, Michael
    Nhien An Le Khac
    Kechadi, M-Tahar
    19TH IEEE INTERNATIONAL WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE 2010), 2010, : 104 - 109
  • [27] A Clustering-based Recommendation System
    Wu, Shaofei
    PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 328 - 330
  • [28] Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    Valero-Mas, Jose J.
    Rico-Juan, Juan R.
    PATTERN RECOGNITION, 2018, 74 : 531 - 543
  • [29] Coevolutive clustering algorithm for large datasets
    Fabris, Fabio
    Luchi, Diego
    Varejao, Flavio Miguel
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [30] Spatial clustering of galaxies in large datasets
    Szalay, AS
    Budavari, T
    Connolly, A
    Gray, J
    Matsubara, T
    Pope, A
    Szapudi, I
    ASTRONOMICAL DATA ANALYSIS II, 2002, 4847 : 1 - 12