Hierarchical classification of data streams: a systematic literature review

被引:9
|
作者
Tieppo, Eduardo [1 ,2 ]
dos Santos, Roger Robson [2 ]
Barddal, Jean Paul [2 ]
Nievola, Julio Cesar [2 ]
机构
[1] Inst Fed Parana IFPR, Campus Pinhais, Pinhais, Brazil
[2] Pontificia Univ Catolica Parana PUCPR, Posgrad Informat PPGIa, Curitiba, Parana, Brazil
关键词
Data stream mining; Hierarchical classification; Systematic literature review; Machine learning; ACTIVITY RECOGNITION; OBJECT RECOGNITION; CLASSIFIERS; MACHINE; REPRESENTATION; PERFORMANCE; ALGORITHM; AGREEMENT; QUALITY; DRIFT;
D O I
10.1007/s10462-021-10087-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification task usually works with flat and batch learners, assuming problems as stationary and without relations between class labels. Nevertheless, several real-world problems do not assume these premises, i.e., data have labels organized hierarchically and are made available in streaming fashion, meaning that their behavior can drift over time. Existing studies on hierarchical classification do not consider data streams as input of their process, and thus, data is assumed as stationary and handled through batch learners. The same can be said about works on streaming data, as the hierarchical classification is overlooked. Studies concerning each area individually are promising, yet, do not tackle their intersection. This study analyzes the main characteristics of the state-of-the-art works on hierarchical classification for streaming data concerning five aspects: (i) problems tackled, (ii) datasets, (iii) algorithms, (iv) evaluation metrics, and (v) research gaps in the area. We performed a systematic literature review of primary studies and retrieved 3,722 papers, of which 42 were identified as relevant and used to answer the aforementioned research questions. We found that the problems handled by hierarchical classification of data streams include mainly classification of images, human activities, texts, and audio; the datasets are mostly created or synthetic data; the algorithms and evaluation metrics are well-known techniques or based on those; and research gaps are related to dynamic context, data complexity, and computational resources constraints. We also provide implications for future research and experiments to consider common characteristics shared amongst hierarchical classification and data stream classification.
引用
收藏
页码:3243 / 3282
页数:40
相关论文
共 50 条
  • [41] Assistive Technology Classification for Students With Disabilities in Higher Education: A Systematic Literature Review
    Putri, Nuril Kusumawardani Soeprapto
    Yuhana, Umi Laili
    Siahaan, Daniel Oranova
    Rahayu, Wenny
    Pardede, Eric
    IEEE ACCESS, 2025, 13 : 28135 - 28149
  • [42] Classification and. Advantages Parallel Computing in Process Computation : A Systematic Literature Review
    Fernando, Erick
    Murad, Dina Fitria
    Wijanarko, Bambang Dwi
    2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING, AND DESIGN (ICCED 2018), 2018, : 143 - 147
  • [43] Data fusion for ITS: A systematic literature review
    Ounoughi, Chahinez
    Ben Yahia, Sadok
    INFORMATION FUSION, 2023, 89 : 267 - 291
  • [44] Gamification of student peer review in education: A systematic literature review
    Indriasari, Theresia Devi
    Luxton-Reilly, Andrew
    Denny, Paul
    EDUCATION AND INFORMATION TECHNOLOGIES, 2020, 25 (06) : 5205 - 5234
  • [45] A Systematic Literature Review on Virtual Machine Consolidation
    Dias, Alexandre H. T.
    Correia, Luiz H. A.
    Malheiros, Neumar
    ACM COMPUTING SURVEYS, 2021, 54 (08)
  • [46] Predicting customer churn: A systematic literature review
    De, Soumi
    Prabu, P.
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (07) : 1965 - 1985
  • [47] Enterprise maturity models: a systematic literature review
    Sarmento dos Santos-Neto, Joao Batista
    Cabral Seixas Costa, Ana Paula
    ENTERPRISE INFORMATION SYSTEMS, 2019, 13 (05) : 719 - 769
  • [48] Big data and dynamic capabilities: a bibliometric analysis and systematic literature review
    Rialti, Riccardo
    Marzi, Giacomo
    Ciappei, Cristiano
    Busso, Donatella
    MANAGEMENT DECISION, 2019, 57 (08) : 2052 - 2068
  • [49] Artificial intelligence in information systems research: A systematic literature review and research agenda
    Collins, Christopher
    Dennehy, Denis
    Conboy, Kieran
    Mikalef, Patrick
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2021, 60
  • [50] Big Data Analytics in Association Rule Mining: A Systematic Literature Review
    Shahin, Mahtab
    Peious, Sijo Arakkal
    Sharma, Rahul
    Kaushik, Minakshi
    Ben Yahia, Sadok
    Shah, Syed Attique
    Draheim, Dirk
    2021 THE 3RD INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING AND TECHNOLOGY, BDET 2021, 2021, : 40 - 49