Online optimization for variable selection in data streams

被引:7
|
作者
Anagnostopoulos, Christoforos [1 ]
Tasoulis, Dimitris [1 ]
Hand, David J. [1 ]
Adams, Niall M.
机构
[1] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London SW7 2PG, England
来源
ECAI 2008, PROCEEDINGS | 2008年 / 178卷
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.3233/978-1-58603-891-5-132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variable selection for regression is a classical statistical problem, motivated by concerns that too many covariates invite overfitting. Existing approaches notably include a class of convex optimisation techniques, such as the Lasso algorithm. Such techniques are invariably reliant on assumptions that are unrealistic in streaming contexts, namely that the data is available off-line and the correlation structure is static. In this paper, we relax both these constraints, proposing for the first time an online implementation of the Lasso algorithm with exponential forgetting. We also optimise the model dimension and the speed of forgetting in an online manner, resulting in a fully automatic scheme. In simulations our scheme improves on recursive least squares in dynamic environments, while also featuring model discovery and changepoint detection capabilities.
引用
收藏
页码:132 / +
页数:2
相关论文
共 50 条
  • [31] Novelty Detection and Online Learning for Chunk Data Streams
    Wang, Yi
    Ding, Yi
    He, Xiangjian
    Fan, Xin
    Lin, Chi
    Li, Fengqi
    Wang, Tianzhu
    Luo, Zhongxuan
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (07) : 2400 - 2412
  • [32] On the Online Classification of Data Streams Using Weak Estimators
    Tavasoli, Hanane
    Oommen, B. John
    Yazidi, Anis
    TRENDS IN APPLIED KNOWLEDGE-BASED SYSTEMS AND DATA SCIENCE, 2016, 9799 : 68 - 79
  • [33] An Online Approach to Determine Correlation between Data Streams
    Lal, Devesh Kumar
    Suman, Ugrasen
    COMPANION OF THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2021, 2021, : 1 - 7
  • [34] WIP: Towards Optimal Online Approximation of Data Streams
    Sitbon, Phillip
    Bulusu, Nirupama
    Feng, Wu-chi
    2011 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS AND WORKSHOPS (DCOSS), 2011,
  • [35] Online Clustering for Topic Detection in Social Data Streams
    Comito, Carmela
    Pizzuti, Clara
    Procopio, Nicola
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 362 - 369
  • [36] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 525 - 539
  • [37] Online Distance Measurement for Tree Data Event Streams
    Kuehn, Eileen
    Streit, Achim
    2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 681 - 688
  • [38] Online Meta-Forest for Regression Data Streams
    Shaker, Ammar
    Gartner, Christoph
    He, Xiao
    Yu, Shujian
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [39] On the online classification of data streams using weak estimators
    Yazidi, Anis (anis.yazidi@hioa.no), 1600, Springer Verlag (9799):
  • [40] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 525 - 539