Multi-label Feature Selection Method Combining Unbiased Hilbert-Schmidt Independence Criterion with Controlled Genetic Algorithm

被引:6
作者
Liu, Chang [1 ]
Ma, Quan [1 ]
Xu, Jianhua [1 ]
机构
[1] Nanjing Normal Univ, Sch Comp Sci & Technol, Nanjing 210023, Jiangsu, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV | 2018年 / 11304卷
关键词
Multi-label learning; Feature selection; Hilbert-Schmidt independence criterion; Sequential forward selection; Genetic algorithm; DEPENDENCE;
D O I
10.1007/978-3-030-04212-7_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-label learning, some redundant and irrelevant features increase computational cost and even degrade classification performance, which are widely dealt with via feature selection procedure. Unbiased Hilbert-Schmidt independence criterion (HSIC) is a kernel-based dependence measure between feature and label data, which has been combined with greedy search techniques (e.g., sequential forward selection) to search for a locally optimal feature subset. Alternatively, it is possible to achieve a globally optimal solution using genetic algorithm (GA), but usually the final solution prefers to select about a half of original features. In this paper, we propose a new GA variant to control the number of selected features (simply CGA). Then CGA is integrated with HSIC to formulate a novel multi-label feature selection technique (CGAHSIC) for a given size of feature subset. The effectiveness of our proposed CGAHSIC is validated through comparing with four existing algorithms, on four benchmark data sets, according to four indicative multi-label classification evaluation metrics (Hamming loss, accuracy, F1 and subset accuracy).
引用
收藏
页码:3 / 14
页数:12
相关论文
共 30 条
[1]  
[Anonymous], 2014, IEEE T KNOWLEDGE DAT
[2]  
[Anonymous], 2013, IBEROAMERICAN C PATT, DOI DOI 10.1007/978-3-642-41827-3.66
[3]  
[Anonymous], 2012, 12 IND C DAT MIN ICD
[4]  
[Anonymous], 2016, Multilabel Classification, DOI [10.1007/978-3-319-41111-8, DOI 10.1007/978-3-319-41111-8]
[5]   Document transformation for multi-label feature selection in text categorization [J].
Chen, Weizhu ;
Yan, Jun ;
Zhang, Benyu ;
Chen, Zheng ;
Yang, Qiang .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :451-+
[6]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[7]  
Duda R.O., 2001, Pattern Classification
[8]  
Gretton A., 2005, Algorithmic learning theory, P63, DOI DOI 10.1007/11564089_7
[9]  
Holland J.H., 1992, ADAPTATION NATURE AR, DOI 10.7551/mitpress/1090.001.0001
[10]  
Jungjit S., 2015, Proceedings of the 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, P285