A Survey on Multi-Label Data Stream Classification

被引:45
作者
Zheng, Xiulin [1 ,2 ]
Li, Peipei [1 ,2 ]
Chu, Zhe [1 ,2 ]
Hu, Xuegang [1 ,2 ,3 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Minist Educ, Hefei 230601, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Anhui, Peoples R China
[3] Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei 230601, Anhui, Peoples R China
关键词
Data stream mining; multi-label data; multi-label classification; EXTREME LEARNING-MACHINE; CONCEPT-DRIFTING DATA; CLASS IMBALANCE; ENSEMBLE; CLASSIFIERS; SELECTION;
D O I
10.1109/ACCESS.2019.2962059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, many real-world applications of our daily life generate massive volume of streaming data at a higher speed than ever before, to name a few, Web clicking data streams, sensor network data and credit transaction streams. Contrary to traditional data mining using static datasets, there are several challenges for data stream mining, for instance, finite memory, one-pass and timely reaction. In this survey, we provide a comprehensive review of existing multi-label streams mining algorithms and categorize these methods based on different perspectives, which mainly focus on the multi-label data stream classification. We first briefly summarize existing multi-label and data stream classification algorithms and discuss their merits and demerits. Secondly, we identify mining constraints on classification for multi-label streaming data, and present a comprehensive study in algorithms for multi-label data stream classification. Finally, several challenges and open issues in multi-label data stream classification are discussed, which are worthwhile to be pursued by the researchers in the future.
引用
收藏
页码:1249 / 1275
页数:27
相关论文
共 137 条
  • [1] Abdulsalam H, 2008, LECT NOTES COMPUT SC, V5181, P643, DOI 10.1007/978-3-540-85654-2_54
  • [2] Aggarwal C. C., 2012, Proceedings of the 2012 SIAM International Conference on Data Mining, P624
  • [3] Aggarwal CC, 2005, SIAM PROC S, P56
  • [4] Aggarwal CC, 2010, SCIENTIFIC DATA MINING AND KNOWLEDGE DISCOVERY: PRINCIPLES AND FOUNDATIONS, P377, DOI 10.1007/978-3-642-02788-8_14
  • [5] An Efficient Semi-Supervised Multi-label Classifier Capable of Handling Missing Labels
    Akbarnejad, Amirhossein Hosseini
    Baghshah, Mahdieh Soleymani
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 229 - 242
  • [6] Recurring and Novel Class Detection Using Class-Based Ensemble for Evolving Data Stream
    Al-Khateeb, Tahseen
    Masud, Mohammad M.
    Al-Naami, Khaled M.
    Seker, Sadi Evren
    Mustafa, Ahmad M.
    Khan, Latifur
    Trabelsi, Zouheir
    Aggarwal, Charu
    Han, Jiawei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (10) : 2752 - 2764
  • [7] PruDent: A Pruned and Confident Stacking Approach for Multi-Label Classification
    Alali, Abdulaziz
    Kubat, Miroslav
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) : 2480 - 2493
  • [8] Alam MA, 2013, 2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV)
  • [9] ALattas AM, 2018, 2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC)
  • [10] Alippi C., 2010, P INT JOINT C NEUR N, P1, DOI DOI 10.1109/IJCNN.2010.5596899