A Survey on Multi-Label Data Stream Classification

被引:45
作者
Zheng, Xiulin [1 ,2 ]
Li, Peipei [1 ,2 ]
Chu, Zhe [1 ,2 ]
Hu, Xuegang [1 ,2 ,3 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Minist Educ, Hefei 230601, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Anhui, Peoples R China
[3] Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei 230601, Anhui, Peoples R China
关键词
Data stream mining; multi-label data; multi-label classification; EXTREME LEARNING-MACHINE; CONCEPT-DRIFTING DATA; CLASS IMBALANCE; ENSEMBLE; CLASSIFIERS; SELECTION;
D O I
10.1109/ACCESS.2019.2962059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, many real-world applications of our daily life generate massive volume of streaming data at a higher speed than ever before, to name a few, Web clicking data streams, sensor network data and credit transaction streams. Contrary to traditional data mining using static datasets, there are several challenges for data stream mining, for instance, finite memory, one-pass and timely reaction. In this survey, we provide a comprehensive review of existing multi-label streams mining algorithms and categorize these methods based on different perspectives, which mainly focus on the multi-label data stream classification. We first briefly summarize existing multi-label and data stream classification algorithms and discuss their merits and demerits. Secondly, we identify mining constraints on classification for multi-label streaming data, and present a comprehensive study in algorithms for multi-label data stream classification. Finally, several challenges and open issues in multi-label data stream classification are discussed, which are worthwhile to be pursued by the researchers in the future.
引用
收藏
页码:1249 / 1275
页数:27
相关论文
共 137 条
  • [41] Domingos P., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P71, DOI 10.1145/347090.347107
  • [42] Elisseeff A, 2002, ADV NEUR IN, V14, P681
  • [43] Online ensemble learning: An empirical study
    Fern, A
    Givan, R
    [J]. MACHINE LEARNING, 2003, 53 (1-2) : 71 - 109
  • [44] Multilabel classification via calibrated label ranking
    Fuernkranz, Johannes
    Huellermeier, Eyke
    Mencia, Eneldo Loza
    Brinker, Klaus
    [J]. MACHINE LEARNING, 2008, 73 (02) : 133 - 153
  • [45] Gaber MM, 2005, SIGMOD REC, V34, P18, DOI 10.1145/1083784.1083789
  • [46] Gaber MM, 2007, DATA STREAMS, P39, DOI [DOI 10.1007/978, DOI 10.1007/978-0-387-47534-9_]
  • [47] Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
  • [48] Gama J., ISSUES EVALUATION ST
  • [49] A Survey on Concept Drift Adaptation
    Gama, Joao
    Zliobaite, Indre
    Bifet, Albert
    Pechenizkiy, Mykola
    Bouchachia, Abdelhamid
    [J]. ACM COMPUTING SURVEYS, 2014, 46 (04)
  • [50] Gama J, 2013, INFORM-J COMPUT INFO, V37, P21