Adaptive Learning With Extreme Verification Latency in Non-Stationary Environments

被引:0
|
作者
Idrees, Mobin M. M. [1 ]
Stahl, Frederic [2 ]
Badii, Atta [1 ]
机构
[1] Univ Reading, Dept Comp Sci, Reading RG6 6AH, England
[2] German Res Ctr Artificial Intelligence GmbH DFKI, Marine Percept, D-26129 Oldenburg, Germany
关键词
Data mining; Classification algorithms; Predictive models; Prediction algorithms; Training; Labeling; Benchmark testing; Concept drift; data stream mining; extreme verification latency; non-stationary environment; semi-supervised learning; CONCEPT DRIFTING DATA; DATA STREAMS; MICRO-CLUSTERS; CLASSIFICATION; ALGORITHMS; FRAMEWORK; SYSTEMS; MODEL;
D O I
10.1109/ACCESS.2022.3225225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing Data Stream Mining algorithms assume the availability of labelled and balanced data streams. However, in many real-world applications such as Robotics, Weather Monitoring, Fraud-Detection systems, Cyber Security, and Human Activity Recognition, a vast amount of high-speed data is generated by Internet of Things sensors and real-time data on the Internet are unlabelled. Furthermore, the prediction models need to learn in Non-Stationary Environments due to evolving concepts. Manual labelling of these data streams is not practical due to the need for domain expertise and the time-resource-prohibitive nature of the required effort. To deal with such scenarios, existing approaches are self-Learning or Cluster-Guided Classification (CGC) which predict the pseudo-labels, which further update the prediction models. Previous studies have yet to establish a clear and conclusive view as to when, and why one pseudo-labelling approach should be preferable to another and what causes an approach to fail. In this research, we propose a novel approach, "Predictor for Streaming Data with Scarce Labels " (PSDSL), which is capable of intelligently switching between self-learning, CGC and micro-clustering strategies, based on the problem it is applied to, i.e., the different characteristics of the data streams. In PSDSL a novel approach called Envelope-Clustering has been introduced to resolve the conflict during the cluster labelling which suggested a confidence measure approach to ensure the quality and correctness of labels assigned to the clusters. The auto parameter tuning mechanism of PSDSL eliminates the human dependency and determines the best value of number of centroids from initial labelled data. The predictive performance of the PSDSL is evaluated on non-stationary datasets, synthetic data-streams, and real-world datasets. The approach has shown promising results on randomised datasets as well as on synthetic data-streams, as compared with state-of-the-art approaches. This is the first large-scale study on an adaptive extreme verification approach that supports automatic parameter tuning and intelligent switching of pseudo-labelling strategy, thus reducing the dependency of machine learning on human input.
引用
收藏
页码:127345 / 127364
页数:20
相关论文
共 50 条
  • [1] Stream-Based Active Learning with Verification Latency in Non-stationary Environments
    Castellani, Andrea
    Schmitt, Sebastian
    Hammer, Barbara
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 260 - 272
  • [2] Density-based Core Support Extraction for Non-stationary Environments with Extreme Verification Latency
    Ferreira, Raul S.
    da Silva, Bruno M. A.
    Teixeira, Wendell W.
    Zimbrao, Geraldo
    Alvim, Leandro
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 181 - 187
  • [3] Adaptive deep reinforcement learning for non-stationary environments
    Jin ZHU
    Yutong WEI
    Yu KANG
    Xiaofeng JIANG
    Geir E.DULLERUD
    Science China(Information Sciences), 2022, 65 (10) : 225 - 241
  • [4] Adaptive deep reinforcement learning for non-stationary environments
    Jin Zhu
    Yutong Wei
    Yu Kang
    Xiaofeng Jiang
    Geir E. Dullerud
    Science China Information Sciences, 2022, 65
  • [5] Adaptive deep reinforcement learning for non-stationary environments
    Zhu, Jin
    Wei, Yutong
    Kang, Yu
    Jiang, Xiaofeng
    Dullerud, Geir E.
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [6] Adaptive and on-line learning in non-stationary environments
    Lughofer, Edwin
    Sayed-Mouchaweh, Moamar
    EVOLVING SYSTEMS, 2015, 6 (02) : 75 - 77
  • [7] AMANDA: Semi-supervised density-based adaptive model for non-stationary data with extreme verification latency
    Ferreira, Raul S.
    Zimbrao, Geraldo
    Alvim, Leandro G. M.
    INFORMATION SCIENCES, 2019, 488 : 219 - 237
  • [8] Adaptive beamforming in non-stationary environments
    Cox, H
    THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 431 - 438
  • [9] Social Learning in non-stationary environments
    Boursier, Etienne
    Perchet, Vianney
    Scarsini, Marco
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [10] Adaptive Learning with Covariate Shift-Detection for Non-Stationary Environments
    Raza, Haider
    Prasad, Girijesh
    Li, Yuhua
    2014 14TH UK WORKSHOP ON COMPUTATIONAL INTELLIGENCE (UKCI), 2014, : 73 - 80