Online Passive-Aggressive Active Learning for Trapezoidal Data Streams

被引:7
作者
Liu, Yanfang [1 ,2 ]
Fan, Xiaocong [3 ]
Li, Wenbin [1 ]
Gao, Yang [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Longyan Univ, Coll Math & Informat Engn, Longyan 364012, Peoples R China
[3] Calif State Univ, Coll Sci Technol Engn & Math, San Marcos, CA 92096 USA
基金
中国国家自然科学基金;
关键词
Multiclass classification; online active learning; online learning; passive-aggressive (PA) learning; trapezoidal data streams; PERCEPTRON; MODEL;
D O I
10.1109/TNNLS.2022.3178880
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The idea of combining the active query strategy and the passive-aggressive (PA) update strategy in online learning can be credited to the PA active (PAA) algorithm, which has proven to be effective in learning linear classifiers from datasets with a fixed feature space. We propose a novel family of online active learning algorithms, named PAA learning for trapezoidal data streams (PAA(TS)) and multiclass PAA(TS) (MPAA(TS)) (and their variants), for binary and multiclass online classification tasks on trapezoidal data streams where the feature space may expand over time. Under the context of an ever-changing feature space, we provide the theoretical analysis of the mistake hounds for both PAA(TS) and MPAA(TS). Our experiments on a wide variety of benchmark datasets have confirm that the combination of the instance-regulated active query strategy and the PA update strategy is much more effective in learning from trapezoidal data streams. We have also compared PAA(TS) with online learning with streaming features (OLSF)-the state-of-the-art approach in learning linear classifiers from trapezoidal data streams. PAA(TS) could achieve much better classification accuracy, especially for large-scale real-world data streams.
引用
收藏
页码:6725 / 6739
页数:15
相关论文
共 36 条
[1]   Scale Invariant Learning from Trapezoidal Data Streams [J].
Alagurajah, Jeevithan ;
Yuan, Xu ;
Wu, Xindong .
PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, :505-508
[2]  
[Anonymous], 2009, ICML, DOI DOI 10.1145/1553374.1553462
[3]  
Beyazit E, 2019, AAAI CONF ARTIF INTE, P3232
[4]   Learning Simplified Decision Boundaries from Trapezoidal Data Streams [J].
Beyazit, Ege ;
Hosseini, Matin ;
Maida, Anthony ;
Wu, Xindong .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 :508-517
[5]   Second-order perceptron algorithm [J].
Cesa-Bianchi, N ;
Conconi, A ;
Gentile, C .
SIAM JOURNAL ON COMPUTING, 2005, 34 (03) :640-668
[6]  
Cesa-Bianchi N, 2006, J MACH LEARN RES, V7, P1205
[7]  
Crammer K, 2006, J MACH LEARN RES, V7, P551
[8]  
Crammer K, 2004, ADV NEUR IN, V16, P1229
[9]   Adaptive regularization of weight vectors [J].
Crammer, Koby ;
Kulesza, Alex ;
Dredze, Mark .
MACHINE LEARNING, 2013, 91 (02) :155-187
[10]  
Dasgupta S, 2009, J MACH LEARN RES, V10, P281