A new study on distance metrics as similarity measurement

被引:14
作者
Yu, Jie [1 ]
Amores, Jaume
Sebe, Nicu
Tian, Qi
机构
[1] Univ Texas, Dept Comp Sci, San Antonio, TX 78285 USA
[2] INRIA, IMEDIA Res Grp, Rocqncourt, France
[3] Univ Amsterdam, Fac Sci, NL-1012 WX Amsterdam, Netherlands
来源
2006 IEEE International Conference on Multimedia and Expo - ICME 2006, Vols 1-5, Proceedings | 2006年
关键词
D O I
10.1109/ICME.2006.262443
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distance metric is widely used in similarity estimation. In this paper we find that the most popular Euclidean and Manhattan distance may not be suitable for all data distributions. A general guideline to establish the relation between a distribution model and its corresponding similarity estimation is proposed. Based on Maximum Likelihood theory, we propose new distance metrics, such as harmonic distance and geometric distance. Because the feature elements may be from heterogeneous sources and usually have different influence on similarity estimation, it is inappropriate to model the distribution as isotropic. We propose a novel boosted distance metric that not only finds the best distance metric that fits the distribution of the underlying elements but also selects the most important feature elements with respect to similarity. The boosted distance metric is tested on fifteen benchmark data sets from the UCI repository and two image retrieval applications. In all the experiments, robust results are obtained based on the proposed methods.
引用
收藏
页码:533 / 536
页数:4
相关论文
共 8 条
[1]  
AMORES J, 2006, PATTERN RECOGNITION
[2]  
FUKUNAGA K, 1983, IEEE T PAMI, V5
[3]   Improved boosting algorithms using confidence-rated predictions [J].
Schapire, RE ;
Singer, Y .
MACHINE LEARNING, 1999, 37 (03) :297-336
[4]   Toward improved ranking metrics [J].
Sebe, N ;
Lew, MS ;
Huijsmans, DP .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (10) :1132-1143
[5]  
SMITH J, 1994, IEEE INT C IM P
[6]  
TANG L, 1994, P NSF ARPA WORKSH PE
[7]  
Zakai M., 1964, IEEE T INFORM TH JAN, P94
[8]  
ZHOU S, 1999, IEEE INT C IM P