Learning similarity with cosine similarity ensemble

被引:298
作者
Xia, Peipei [1 ,2 ]
Zhang, Li [1 ,2 ,3 ]
Li, Fanzhang [1 ,2 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
[2] Soochow Univ, Prov Key Lab Comp Informat Proc Technol, Suzhou 215006, Jiangsu, Peoples R China
[3] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing 210000, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Similarity learning; Cosine similarity; Ensemble learning; Selective ensemble; Machine learning; NEURAL-NETWORKS;
D O I
10.1016/j.ins.2015.02.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is no doubt that similarity is a fundamental notion in the field of machine learning and pattern recognition. How to represent and measure similarity appropriately is a pursuit of many researchers. Many tasks, such as classification and clustering, can be accomplished perfectly when a similarity metric is well-defined. Cosine similarity is a widely used metric that is both simple and effective. This paper proposes a cosine similarity ensemble (CSE) method for learning similarity. In CSE, diversity is guaranteed by using multiple cosine similarity learners, each of which makes use of a different initial point to define the pattern vectors used in its similarity measures. The CSE method is not limited to measuring similarity using only pattern vectors that start at the origin. In addition, the thresholds of these separate cosine similarity learners are adaptively determined. The idea of using a selective ensemble is also implemented in CSE, and the proposed CSE method outperforms other compared methods on various data sets. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:39 / 52
页数:14
相关论文
共 35 条
[21]  
Maurer A, 2008, J MACH LEARN RES, V9, P1049
[22]  
Melacci S., 2008, ARTIFICIAL NEURAL NE
[23]  
Nene S. A., 1996, Tech. Rep. CUCS-005-96
[24]  
Nguyen H.V., 2011, COMP VIS ACCV 2010
[25]  
Phillips PJ, 1999, ADV NEUR IN, V11, P803
[26]  
Samaria F. S., 1994, Proceedings of the Second IEEE Workshop on Applications of Computer Vision (Cat. No.94TH06742), P138, DOI 10.1109/ACV.1994.341300
[27]  
Sarwar B., 2001, P 10 INT C WORLD WID, P285, DOI DOI 10.1145/371920.372071
[28]   Learning Similarity With Multikernel Method [J].
Tang, Yi ;
Li, Luoqing ;
Li, Xuelong .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :131-138
[29]   Optimal linear combination of neural networks for improving classification performance [J].
Ueda, N .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (02) :207-215
[30]   On the Euclidean distance of images [J].
Wang, LW ;
Zhang, Y ;
Feng, JF .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1334-1339