Dissimilarity learning for nominal data

被引:37
作者
Cheng, V
Li, CH [1 ]
Kwok, JT
Li, CK
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Kowloon, Hong Kong, Peoples R China
关键词
nominal attributes; pattern classification; dissimilarities; distance measure; classifiers;
D O I
10.1016/j.patcog.2003.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining a good distance (dissimilarity) measure between patterns is of crucial importance in many classification and clustering algorithms. While a lot of work has been performed on continuous attributes, nominal attributes are more difficult to handle. A popular approach is to use the value difference metric (VDM) to define a real-valued distance measure on nominal values. However, VDM treats the attributes separately and ignores any possible interactions among attributes. In this paper, we propose the use of adaptive dissimilarity matrices for measuring the dissimilarities between nominal values. These matrices are learned via optimizing an error function on the training samples. Experimental results show that this approach leads to better classification performance. Moreover, it also allows easier interpretation of (dis)similarity between different nominal values. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1471 / 1477
页数:7
相关论文
共 19 条
[1]  
[Anonymous], 1992, The Tenth National Conference on Artificial Intelligence
[2]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[3]  
Blake C.L., 1998, UCI repository of machine learning databases
[4]   A WEIGHTED NEAREST NEIGHBOR ALGORITHM FOR LEARNING WITH SYMBOLIC FEATURES [J].
COST, S ;
SALZBERG, S .
MACHINE LEARNING, 1993, 10 (01) :57-78
[5]   TRADING MIPS AND MEMORY FOR KNOWLEDGE ENGINEERING [J].
CREECY, RH ;
MASAND, BM ;
SMITH, SJ ;
WALTZ, DL .
COMMUNICATIONS OF THE ACM, 1992, 35 (08) :48-64
[6]  
CRISTIANINI N, 2002, ADV NEURAL INFORMATI, V14
[7]  
Esposito F, 2000, ST CLASS DAT ANAL, P139
[8]   METRIC AND EUCLIDEAN PROPERTIES OF DISSIMILARITY COEFFICIENTS [J].
GOWER, JC ;
LEGENDRE, P .
JOURNAL OF CLASSIFICATION, 1986, 3 (01) :5-48
[9]  
Hart, 2006, PATTERN CLASSIFICATI
[10]   Discriminant adaptive nearest neighbor classification [J].
Hastie, T ;
Tibshirani, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (06) :607-616