Robust Sparse Online Learning for Data Streams with Streaming Features

被引:0
作者
Chen, Zhong [1 ]
He, Yi [2 ]
Wu, Di [3 ]
Zhan, Huixin [4 ]
Sheng, Victor [5 ]
Zhang, Kun [6 ]
机构
[1] Southern Illinois Univ, Carbondale, IL 62901 USA
[2] Old Dominion Univ, Norfolk, VA USA
[3] Southwest Univ, El Paso, TX 79925 USA
[4] Cedars Sinai Med Ctr, Los Angeles, CA 90048 USA
[5] Texas Tech Univ, Lubbock, TX 79409 USA
[6] Xavier Univ, New Orleans, LA 70125 USA
来源
PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM | 2024年
关键词
online learning; sparse learning; streaming feature selection; open feature spaces; l(1,2) mixed norm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse online learning has received extensive attention during the past few years. Most of existing algorithms that utilize (l(1)-noun regularization or l(1)-ball projection assume that the feature space is fixed or changes by following explicit constraints. However, this assumption does not always hold in many real applications. Motivated by this observation, we propose a new online learning algorithm tailored for data streams described by open feature spaces, where new features can be occurred, and old features may be vanished over various time spans. Our algorithm named RSOL provides a strategy to adapt quickly to such feature dynamics by encouraging sparse model representation with an and (2-mixed regularizer. We leverage the proximal operator of the (1,2-mixed noun and show that our RSOL algorithm enjoys a closed-form solution at each iteration. A sub-linear regret bound of our proposed algorithm is guaranteed with a solid theoretical analysis. Empirical results benchmarked on nine streaming datasets validate the effectiveness of the proposed RSOL method over three state-of-the-art algorithms.
引用
收藏
页码:181 / 189
页数:9
相关论文
共 45 条
[1]   Oracle-Based Robust Optimization via Online Learning [J].
Ben-Tal, Aharon ;
Hazan, Elad ;
Koren, Tomer ;
Mannor, Shie .
OPERATIONS RESEARCH, 2015, 63 (03) :628-638
[2]  
Beyazit E, 2019, AAAI CONF ARTIF INTE, P3232
[3]  
Capponi A, 2019, IEEE COMMUN SURV TUT, V21, P2419, DOI [10.1109/COMST.2019.2914030, 10.1109/isscs.2019.8801767]
[4]  
Chen Zhong, 2022, 2022 IEEE International Conference on Big Data (Big Data), P495, DOI 10.1109/BigData55660.2022.10021084
[5]  
Chen Z., 2023, Mach. Learn., V11, P1
[6]   Projection Dual Averaging Based Second-order Online Learning [J].
Chen, Zhong ;
Zhan, Huixin ;
Sheng, Victor ;
Edwards, Andrea ;
Zhang, Kun .
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, :51-60
[7]   An effective cost-sensitive sparse online learning framework for imbalanced streaming data classification and its application to online anomaly detection [J].
Chen, Zhong ;
Sheng, Victor ;
Edwards, Andrea ;
Zhang, Kun .
KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (01) :59-87
[8]   Adaptive robust local online density estimation for streaming data [J].
Chen, Zhong ;
Fang, Zhide ;
Sheng, Victor ;
Zhao, Jiabin ;
Fan, Wei ;
Edwards, Andrea ;
Zhang, Kun .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) :1803-1824
[9]  
Chen Z, 2018, IEEE INT CONF BIG DA, P201, DOI 10.1109/BigData.2018.8621923
[10]  
Chen Zhong, 2017, SIAM Rev Soc Ind Appl Math, V2017, P759, DOI 10.1137/1.9781611974973.85