Multilabel neural networks with applications to functional genomics and text categorization

被引:893
作者
Zhang, Min-Ling [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Lab Novel Software Technol, Nanjing 210093, Peoples R China
基金
中国国家自然科学基金;
关键词
machine learning; data mining; multilabel learning; neural networks; backpropagation; functional genomics; text categorization;
D O I
10.1109/TKDE.2006.162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multilabel learning, each instance in the training set is associated with a set of labels and the task is to output a label set whose size is unknown a priori for each unseen instance. In this paper, this problem is addressed in the way that a neural network algorithm named BP-MLL, i.e., Backpropagation for Multilabel Learning, is proposed. It is derived from the popular Backpropogation algorithm through employing a novel error function capturing the characteristics of multilabel learning, i.e., the labels belonging to an instance should be ranked higher than those not belonging to that instance. Applications to two real-world multilabel learning problems, i.e., functional genomics and text categorization, show that the performance of BP-MLL is superior to that of some well-established multilabel learning algorithms.
引用
收藏
页码:1338 / 1351
页数:14
相关论文
共 34 条
[1]  
[Anonymous], P 26 ANN INT ACM SIG
[2]  
[Anonymous], 1997, Proceedings of the fourteenth international conference on machine learning, DOI DOI 10.1016/J.ESWA.2008.05.026
[3]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[4]  
[Anonymous], P WORK NOT AM ASS AR
[5]  
Bishop C. M., 1996, Neural networks for pattern recognition
[6]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[7]  
Clare Amanda, 2003, Machine learning and data mining for yeast functional genomics
[8]  
Clare R. D., 2001, Lecture Notes in ComputerScience, V2168, P42, DOI [DOI 10.1007/3-540-44794-6_4, 10.1007/3-540-44794-64.11.W., DOI 10.1007/3-540-44794-64.11.W]
[9]  
De Comité F, 2003, LECT NOTES ARTIF INT, V2734, P35
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38