Assessment of the real-time pattern recognition capability of machine learning algorithms

被引:1
作者
Polytarchos, Elias [1 ]
Bardaki, Cleopatra [2 ]
Pramatari, Katerina [1 ]
机构
[1] Athens Univ Econ & Business, Dept Management Sci & Technol, 28 Oktovriou 76, Athens, Greece
[2] Harokopio Univ, Dept Informat & Telemat, Tavros, Greece
关键词
clustering algorithms; data streams; IoT; machine learning; pattern recognition;
D O I
10.1002/sam.11701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays data streams from different sources, like blockchain-based and traditional financial transactions, social networks, and interconnected Internet of Things (IoT) devices, are becoming increasingly large in volume and the need to recognize patterns in real time from these streams, while adapting to their velocity and veracity, is emerging. Established machine learning algorithms used for pattern recognition methods have not been designed taking under account the volume, velocity, diversity, and accuracy of the data streams. This research contributes with an approach for assessing the pattern recognition capabilities of established machine learning algorithms when handling volatile data in real time and proposes a system that adapts the algorithms to the requirements of data streams, as well as assesses their pattern recognition capabilities based on established criteria. The system was applied for assessing five machine learning algorithms with input from a data stream from Bluetooth beacons tracking consumers in a retail store. This research can support future data scientists and analysts who need to reveal data patterns in big, volatile data streams in real time in order to support effective decision-making in the respective application domain. Copyright (c) 2024 John Wiley & Sons, Ltd.
引用
收藏
页数:15
相关论文
共 63 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Aggarwal C., 2003, A Framework for Clustering Evolving Data Streams
  • [3] Alamuri M, 2014, IEEE IJCNN, P1907, DOI 10.1109/IJCNN.2014.6889941
  • [4] Alpaydin E., 2020, INTRO MACHINE LEARNI
  • [5] Amit Y, 2008, J MACH LEARN RES, V9, P1399
  • [6] Adaptive and self-confident on-line learning algorithms
    Auer, P
    Cesa-Bianchi, N
    Gentile, C
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 48 - 75
  • [7] Berkhin P., 1998, A survey of clustering data mining techniques, grouping multidimensional data
  • [8] Bernardo Jose M., 2000, BAYESIAN THEORY, V1st
  • [9] Bishop ChristopherM., 2011, Information science and statistics
  • [10] Density-Based Clustering over an Evolving Data Stream with Noise
    Cao, Feng
    Ester, Martin
    Qian, Weining
    Zhou, Aoying
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 328 - +