Transfer Learning for Semi-supervised Classification of Non-stationary Data Streams

被引:2
作者
Wen, Yimin [1 ]
Zhou, Qi [2 ]
Xue, Yun [2 ]
Feng, Chao [1 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin, Peoples R China
[2] Henan City Univ, Sch Municipal & Surveying Engn, Yiyang, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT V | 2021年 / 1333卷
基金
中国国家自然科学基金;
关键词
Concept drift; Semi-supervised learning; Transfer learning; CONCEPT DRIFT; ENSEMBLE; CLASSIFIERS;
D O I
10.1007/978-3-030-63823-8_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the scenario of data stream classification, the occurrence of recurring concept drift and the scarcity of labeled data are very common, which make the semi-supervised classification of data streams quite challenging. To deal with these issues, a new classification algorithm for partially labeled streaming data with recurring concept drift is proposed. CAPLRD maintains a pool of concept-specific classifiers and utilizes historical classifiers to label unlabeled data, in which the unlabeled data are labeled by a weighted-majority vote strategy, and concept drifts are detected by automatically monitoring the threshold of classification accuracy on different data chunks. The experimental results illustrate that the transfer learning from historical concept-specific classifiers can improve labeling accuracy significantly, the detection of concept drifts and classification accuracy effectively.
引用
收藏
页码:468 / 477
页数:10
相关论文
共 18 条
[1]  
Bertini Jr J., 2012, J. Braz. Comput. Soc., V18, P299
[2]  
Bifet A, 2010, J MACH LEARN RES, V11, P1601
[3]   Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm [J].
Brzezinski, Dariusz ;
Stefanowski, Jerzy .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) :81-94
[4]  
Ditzler G, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P2741, DOI 10.1109/IJCNN.2011.6033578
[5]  
Domingos P., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P71, DOI 10.1145/347090.347107
[6]   COMPOSE: A Semisupervised Learning Framework for Initially Labeled Nonstationary Streaming Data [J].
Dyer, Karl B. ;
Capo, Robert ;
Polikar, Robi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) :12-26
[7]  
Gao J, 2008, P 14 ACM SIGKDD INT, P283, DOI [DOI 10.1145/1401890.1401928, 10.1145/1401890.1401928]
[8]   An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams [J].
Hosseini, Mohammad Javad ;
Gholipour, Ameneh ;
Beigy, Hamid .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (03) :567-597
[9]   Tracking recurring contexts using ensemble classifiers: an application to email filtering [J].
Katakis, Ioannis ;
Tsoumakas, Grigorios ;
Vlahavas, Ioannis .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (03) :371-391
[10]   Mining Recurring Concept Drifts with Limited Labeled Streaming Data [J].
Li, Peipei ;
Wu, Xindong ;
Hu, Xuegang .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)