Generalized vs set median strings for histogram-based distances: Algorithms and classification results in the image domain

被引:0
作者
Solnon, Christine [1 ]
Jolion, Jean-Michel [1 ]
机构
[1] Univ Lyon 1, CNRS, INSA Lyon Nautibus, LIRIS,UMR 5205, 43 Bd 11 Novembre, F-69622 Villeurbanne, France
来源
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS | 2007年 / 4538卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We compare different statistical characterizations of a set of strings, for three different histogram-based distances. Given a distance, a set of strings may be characterized by its generalized median, i.e., the string -over the set of all possible strings- that minimizes the sum of distances to every string of the set, or by its set median, i.e., the string of the set that minimizes the sum of distances to every other string of the set. For the first two histogram-based distances, we show that the generalized median string can be computed efficiently; for the third one, which biased histograms with individual substitution costs, we conjecture that this is a NP-hard problem, and we introduce two different heuristic algorithms for approximating it. We experimentally compare the relevance of the three histogram-based distances, and the different statistical characterizations of sets of strings, for classifying images that are represented by strings.
引用
收藏
页码:404 / +
页数:2
相关论文
共 10 条
[1]  
Cormen T. H., 1990, INTRO ALGORITHMS
[2]   Topology of strings: Median string is NP-complete [J].
de la Higuera, C ;
Casacuberta, F .
THEORETICAL COMPUTER SCIENCE, 2000, 230 (1-2) :39-48
[3]  
Jiang X., 2004, DATA MINING TIME SER, P173, DOI 10.1142/9789812565402_0008
[4]   STRING-TO-STRING CORRECTION PROBLEM [J].
WAGNER, RA ;
FISCHER, MJ .
JOURNAL OF THE ACM, 1974, 21 (01) :168-173
[5]   SIMPLIcity: Semantics-sensitive integrated matching for picture libraries [J].
Wang, JZ ;
Li, J ;
Wiederhold, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (09) :947-963
[6]  
[No title captured]
[7]  
[No title captured]
[8]  
[No title captured]
[9]  
[No title captured]
[10]  
[No title captured]