Applying semi-supervised learning in hierarchical multi-label classification

被引:13
作者
Santos, Araken [1 ]
Canuto, Anne [2 ]
机构
[1] Fed Rural Univ Semiarido, Exact Technol & Human Sci Dept, BR-59515000 Angicos, RN, Brazil
[2] Fed Univ RN, Dept Informat & Appl Math DIMAp, Natal, RN, Brazil
关键词
Multi-label classification; Hierarchical classification; Semi-supervised learning; DECISION TREES;
D O I
10.1016/j.eswa.2014.03.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In classification problems with hierarchical structures of labels, the target function must assign labels that are hierarchically organized and it can be used either for single-label (one label per instance) or multi-label classification problems (more than one label per instance). In parallel to these developments, the idea of semi-supervised learning has emerged as a solution to the problems found in a standard supervised learning procedure (used in most classification algorithms). It combines labelled and unlabelled data during the training phase. Some semi-supervised methods have been proposed for single-label classification methods. However, very little effort has been done in the context of multi-label hierarchical classification. Therefore, this paper proposes a new method for supervised hierarchical multi-label classification, called HMC-RAkEL. Additionally, we propose the use of semi-supervised learning, self-training, in hierarchical multi-label classification, leading to three new methods, called HMC-SSBR, HMC-SSLP and HMC-SSRAkEL. In order to validate the feasibility of these methods, an empirical analysis will be conducted, comparing the proposed methods with their corresponding supervised versions. The main aim of this analysis is to observe whether the semi-supervised methods proposed in this paper have similar performance of the corresponding supervised versions. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6075 / 6085
页数:11
相关论文
共 33 条
[1]  
Alves RT, 2008, LECT N BIOINFORMAT, V5167, P1
[2]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[3]  
[Anonymous], 2008, ISMIR
[4]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836
[5]   Multi-dimensional classification with Bayesian networks [J].
Bielza, C. ;
Li, G. ;
Larranaga, P. .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (06) :705-727
[6]  
Blockeel H, 2006, LECT NOTES ARTIF INT, V4213, P18
[7]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[8]  
Cerri R., 2014, COMPUTATIONAL INTELL, P1467
[9]  
Cerri R., 2009, AN 5 WORKSH ALG APL, P75
[10]  
Cerri R, 2009, LECT N BIOINFORMAT, V5676, P109, DOI 10.1007/978-3-642-03223-3_10