Ensemble multi-label classification using closed frequent labelsets and label taxonomies

被引：0

作者：

Ferrandin, Mauri ^{[1
]}

Cerri, Ricardo ^{[2
]}

机构：

[1] Univ Fed Santa Catarina, Dept Control Automat & Comp Sci, Rua Joao Pessoa 2750, BR-89036256 Blumenau, SC, Brazil

[2] Univ Sao Paulo, Inst Ciencias Matemat & Computacao, Ave Trabalhador Sao Carlense,400 Ctr, BR-13566590 Sao Carlos, SP, Brazil

来源：

APPLIED SOFT COMPUTING | 2025年 / 171卷

关键词：

Multi-label classification; Ensemble multi-label classification; Problem transformation;

D O I：

10.1016/j.asoc.2025.112853

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Ensembles are computational models that combine the strengths of multiple algorithms or models to enhance predictive accuracy, robustness, and generalization across various applications in machine learning and data analysis. They can mitigate the risk of overfitting and improve model stability, reducing the impact of individual algorithmic biases. These are valuable tools for achieving superior performance in complex and dynamic real-world scenarios. Despite constant advances in this research area, recent studies have shown that state-of-the-art ensembles for multi-label classification are still based on classical ensemble methods from 2016. This study proposes three new ensemble algorithms, called the ensemble of flat-to-hierarchical (EF2H) versions, developed using the F2H multi-label classification model. The F2H algorithm transforms the multi- label problem into a hierarchical multi-label problem to generate predictions. Experiments were conducted with 32 multi-label datasets, and the results were compared with those of the state-of-the-art algorithms in this field. The results demonstrate that the EF2H versions are highly competitive algorithms, outperforming the well-known ensemble of classifier chains (ECC) and achieving predictive performance equivalent to that of the random forest of decision trees with binary relevance (RFDTBR) and random forest of predictive clustering trees (RFPCT) algorithms.

引用

页数：12

共 38 条

[1]

Blockeel H., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P55

[2] Comprehensive comparative study of multi-label classification methods [J].

Bogatinovski, Jasmin ;

Todorovski, Ljupco ;

Dzeroski, Saso ;

Kocev, Dragi .

EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203

[3] Learning multi-label scene classification [J].

Boutell, MR ;

Luo, JB ;

Shen, XP ;

Brown, CM .

PATTERN RECOGNITION, 2004, 37 (09) :1757-1771

[4] Tips, guidelines and tools for managing multi-label datasets: The mldr.datasets R package and the Cometa data repository [J].

Charte, Francisco ;

Rivera, Antonio J. ;

Charte, David ;

del Jesus, Mara J. ;

Herrera, Francisco .

NEUROCOMPUTING, 2018, 289 :68-85

[5]

Charte F, 2013, LECT NOTES COMPUT SC, V8073, P150, DOI 10.1007/978-3-642-40846-5_16

[6] Combining instance-based learning and logistic regression for multilabel classification [J].

Cheng, Weiwei ;

Huellermeier, Eyke .

MACHINE LEARNING, 2009, 76 (2-3) :211-225

[7]

Clare A., 2001, PROC EUROPEAN C PRIN, P42, DOI 10.1007/ 3-540-44794-6_4

[8]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[9] Ensemble methods in machine learning [J].

Dietterich, TG .

MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15

[10] Combination of classification and regression in decision tree for multi-labeling image annotation and retrieval [J].

Fakhari, Ali ;

Moghadam, Amir Masoud Eftekhari .

APPLIED SOFT COMPUTING, 2013, 13 (02) :1292-1302

← 1 2 3 4 →