Evolving multi-label classification rules by exploiting high-order label correlations

被引：14

作者：

Nazmi, Shabnam ^{[1
]}

Yan, Xuyang ^{[2
]}

Homaifar, Abdollah ^{[1
]}

Doucette, Emily ^{[3
]}

机构：

[1] North Carolina A&T State Univ, Dept Elect & Comp Engn, 1601 E Market St, Greensboro, NC 27411 USA

[2] North Carolina A&T State Univ, Elect Engn, 1601 E Market St, Greensboro, NC USA

[3] Air Force Res Lab, Munit Directorate, 101 West Eglin Blvd, Eglin AFB, FL USA

来源：

NEUROCOMPUTING | 2020年 / 417卷

关键词：

Multi-label classification; High-order label correlations; Label powerset; Learning classifier systems; Genetic algorithms; FEATURE-SELECTION; NEURAL-NETWORKS; K-LABELSETS; SYSTEMS; DISTANCE; MACHINE; KNN;

D O I：

10.1016/j.neucom.2020.07.055

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-label classification tasks, each problem instance is associated with multiple classes simultaneously. In such settings, the correlation between labels contain valuable information that can be used to obtain more accurate classification models. The correlation between labels can be exploited in different levels such as capturing the pair-wise correlation or exploiting the higher-order correlations. Even though the high-order approach is more capable of modeling the correlation, it is computationally more demanding and has scalability issues. This paper aims at exploiting the high-order label correlations locally using supervised learning classifier systems (UCS). For this purpose, the label powerset (LP) strategy is employed and a prediction aggregation is utilized that improves the prediction capability of the LP method in the presence of unseen labelsets. Exact match ratio and Hamming loss measures are considered to evaluate the rule performance and the expected fitness value of individual classification rules is investigated using both metrics. Also, a computational complexity analysis is provided for training the proposed algorithm. The experimental results of the proposed method are compared with other well-known LP-based methods on multiple benchmark datasets and confirm the competitive performance of this method. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：176 / 186

页数：11

共 54 条

[1] Applying multi-label techniques in emotion identification of short texts [J].

Almeida, Alex M. G. ;

Cerri, Ricardo ;

Paraiso, Emerson Cabrera ;

Mantovani, Rafael Gomes ;

Barbon Junior, Sylvio .

NEUROCOMPUTING, 2018, 320 :35-46

[2]

[Anonymous], 2001, EUR C PRINC DAT MIN

[3]

[Anonymous], 2008, P ECML PKDD 2008 WOR

[4]

[Anonymous], 2010, ICML 10 27 INT C MAC

[5] Accuracy-based Learning Classifier Systems:: Models, analysis and applications to classification tasks [J].

Bernadó-Mansilla, E ;

Garrell-Guiu, JM .

EVOLUTIONARY COMPUTATION, 2003, 11 (03) :209-238

[6] Learning multi-label scene classification [J].

Boutell, MR ;

Luo, JB ;

Shen, XP ;

Brown, CM .

PATTERN RECOGNITION, 2004, 37 (09) :1757-1771

[7] Feature Selection for Multi-label Classification Using Neighborhood Preservation [J].

Cai, Zhiling ;

Zhu, William .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (01) :320-330

[8] Implications of the curse of dimensionality for supervised learning classifier systems: theoretical and empirical analyses [J].

Debie, Essam ;

Shafi, Kamran .

PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (02) :519-536

[9]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[10]

Diplaris S, 2005, LECT NOTES COMPUT SC, V3746, P448

← 1 2 3 4 5 6 →