Feature selection for multi-label classification based on neighborhood rough sets

被引:0
作者
Duan, Jie [1 ]
Hu, Qinghua [1 ]
Zhang, Lingjun [1 ]
Qian, Yuhua [2 ]
Li, Deyu [2 ]
机构
[1] School of Computer Science and Technology, Tianjin University, Tianjin
[2] School of Computer and Information Technology, Shanxi University, Taiyuan
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2015年 / 52卷 / 01期
关键词
Dependency; Feature selection; Multi-label classification; Neighborhood rough sets;
D O I
10.7544/issn1000-1239.2015.20140544
中图分类号
学科分类号
摘要
Multi-label classification is a kind of complex decision making tasks, where one object may be assigned with more than one decision label. This kind of tasks widely exist in text categorization, image recognition, gene function analysis. Multi-label classification is usually described with high-dimensional vectors, and some of the features are superfluous and irrelevant. A great number of feature selection algorithms have been developed for single-label classification to conquer the curse of dimensionality. However, as to multi-label classification, fewer researches have been reported for designing feature selection algorithms. In this work, we introduce rough sets to multi-label classification for constructing a feature selection algorithm. We redefine the lower approximation and dependency, and discuss the properties of the model. After that, we design a neighborhood rough sets based feature selection algorithm for multi-label classification. Experimental results show the effectiveness of the proposed algorithm. ©, 2015, Science Press. All right reserved.
引用
收藏
页码:56 / 65
页数:9
相关论文
共 20 条
[1]  
Li Y., Huang S., Zhou Z., Regularized semi-supervised multi-label learning, Journal of Computer Research and Development, 49, 6, pp. 1272-1278, (2012)
[2]  
Zheng W., Wang C., Liu Z., Et al., A multi-label classification algorithm based on random walk mode, Chinese Journal of Computers, 33, 8, pp. 1418-1426, (2010)
[3]  
Sun L., Ji S., Ye J., Multi-Label Dimensionality Reduction, pp. 20-22, (2013)
[4]  
Fisher R.A., The use of multiple measurements in taxonomic problems, Annals of Eugenics, 7, 2, pp. 179-188, (1936)
[5]  
Wold H., Estimation of principal components and related models by iterative least squares, Multivariate Analysis, 1, pp. 391-420, (1966)
[6]  
Zhang Y., Zhou Z., Multi-label dimensionality reduction via dependence maximization, ACM Trans on Knowledge Discovery from Data (TKDD), 4, 3, (2010)
[7]  
Zhang M., Pena J.M., Robles V., Feature selection for multi-label naive Bayes classification, Information Sciences, 179, 19, pp. 3218-3229, (2009)
[8]  
Ge L., Li G., You M., Embedded feature selection for multi-label learning, Journal of Nanjing University (Natural Sciences), 45, 5, pp. 671-676, (2009)
[9]  
Zhang Z., Li S., Li Z., Et al., Multi-label feature selection algorithm based on information entropy, Journal of Computer Research and Development, 50, 6, pp. 1177-1184, (2013)
[10]  
Wang G., Rough Set Theory and Knowledge Acquisition, pp. 147-156, (2001)