Combination of molecular similarity measures using data fusion

被引:106
作者
Ginn, CMR
Willett, P
Bradshaw, J
机构
[1] Univ Sheffield, Sheffield S10 2TN, S Yorkshire, England
[2] Glaxo Wellcome Res & Dev Ltd, Stevenage SG1 2NY, Herts, England
关键词
data fusion; database searching; molecular similarity; similarity measure;
D O I
10.1023/A:1008752200506
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Many different measures of structural similarity have been suggested for matching chemical structures, each such measure focusing upon some particular type of molecular characteristic. The multi-faceted nature of biological activity suggests that an appropriate similarity measure should encompass many different types of characteristic, and this article discusses the use of data fusion methods to combine the results of searches based on multiple similarity measures. Experiments with several different types of dataset and activity suggest that data fusion provides a simple, but effective, approach to the combination of individual similarity measures. The best results were generally obtained with a fusion rule that sums the rank positions achieved by each molecule in searches using individual measures.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 28 条
[1]  
ARABNIA HR, 1998, P INT C MULT MULT IN
[2]   SIMILARITY SEARCHING IN FILES OF 3-DIMENSIONAL CHEMICAL STRUCTURES - COMPARISON OF FRAGMENT-BASED MEASURES OF SHAPE SIMILARITY [J].
BATH, PA ;
POIRRETTE, AR ;
WILLETT, P ;
ALLEN, FH .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (01) :141-147
[3]   COMBINING THE EVIDENCE OF MULTIPLE QUERY REPRESENTATIONS FOR INFORMATION-RETRIEVAL [J].
BELKIN, NJ ;
KANTOR, P ;
FOX, EA ;
SHAW, JA .
INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (03) :431-448
[4]  
CLERC T, 1973, TOP CURR CHEM, V39, P91
[5]  
DEAN PM, 1975, MOL SIMILARITY DRUG
[6]  
DEAN PM, 1998, DESIGNING BIOACTIVE, P199
[7]  
DOWNS GM, 1995, REV COMP CH, V7, P1
[8]  
DRAYTON SK, INTERNET J CHEM
[9]   Similarity searching in files of three-dimensional chemical structures: Evaluation of the EVA descriptor and combination of rankings using data fusion [J].
Ginn, CMR ;
Turner, DB ;
Willett, P ;
Ferguson, AM ;
Heritage, TW .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :23-37
[10]  
GINN CMR, 1998, THESIS U SHEFFIELD