COST-SENSITIVE MULTI-VIEW LEARNING MACHINE

被引:0
作者
Wang, Zhe [1 ,2 ]
Lu, Mingzhe [1 ]
Niu, Zengxin [1 ]
Xue, Xiangyang [3 ]
Gao, Daqi [1 ]
机构
[1] E China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
关键词
Multi-view learning; cost sensitivity; view-dependent cost; discriminant scatter; pattern classification; SUPPORT VECTOR MACHINES; KERNEL; CLASSIFICATION; MATRIX; DESIGN;
D O I
10.1142/S0218001414510045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning aims to effectively learn from data represented by multiple independent sets of attributes, where each set is taken as one view of the original data. In real-world application, each view should be acquired in unequal cost. Taking web-page classification for example, it is cheaper to get the words on itself (view one) than to get the words contained in anchor texts of inbound hyper-links (view two). However, almost all the existing multi-view learning does not consider the cost of acquiring the views or the cost of evaluating them. In this paper, we support that different views should adopt different representations and lead to different acquisition cost. Thus we develop a new view-dependent cost different from the existing both class-dependent cost and example-dependent cost. To this end, we generalize the framework of multi-view learning with the cost-sensitive technique and further propose a Cost-sensitive Multi-View Learning Machine named CMVLM for short. In implementation, we take into account and measure both the acquisition cost and the discriminant scatter of each view. Then through eliminating the useless views with a predefined threshold, we use the reserved views to train the final classifier. The experimental results on a broad range of data sets including the benchmark UCI, image, and bioinformatics data sets validate that the proposed algorithm can effectively reduce the total cost and have a competitive even better classification performance. The contributions of this paper are that: ( 1) first proposing a view-dependent cost; ( 2) establishing a cost-sensitive multi-view learning framework; ( 3) developing a wrapper technique that is universal to most multiple kernel based classifier.
引用
收藏
页数:24
相关论文
共 56 条
[1]  
Abe N., 2004, Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P3
[2]  
Abney S, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P360
[3]  
Agarwal Alekh, 2013, PMLR, P1220
[4]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[5]  
[Anonymous], 2004, INT C MACH LEARN
[6]  
[Anonymous], ANN C NEUR INF PROC
[7]  
[Anonymous], 1993, P ADV NEURAL INFORM
[8]   ON MULTI-CLASS COST-SENSITIVE LEARNING [J].
Zhou, Zhi-Hua ;
Liu, Xu-Ying .
COMPUTATIONAL INTELLIGENCE, 2010, 26 (03) :232-257
[9]  
[Anonymous], 2013, INT C MACH LEARN
[10]  
[Anonymous], INT J PATTERN RECOGN