Combining Multiple K-Means Clusterings of Chemical Structures Using Cluster-Based Similarity Partitioning Algorithm

被引:0
作者
Saeedi, Faisal [1 ,2 ]
Salim, Naomie [1 ]
Abdo, Ammar [3 ]
Hentabli, Hamza [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp Sci & Informat Syst, Johor Baharu, Johor, Malaysia
[2] Dept Informat Technol, Sanhan Community Coll, Sanaa, Yemen
[3] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
来源
ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS | 2012年 / 322卷
关键词
2D Fingerprint; Compound Selection; Consensus Clustering; K-Means; Molecular Datasets; Ward's Method; DATA FUSION; COMBINATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consensus clustering methods have been used in many areas to improve the quality of individual clusterings. In this paper, graph-based consensus clustering, Cluster-based Similarity Partitioning Algorithm (CSPA), was used to improve the quality of chemical structures clustering by enhancing the ability to separate active from inactive molecules in each cluster and improve the robustness and stability of individual clusterings. The clustering was evaluated using Quality Partition Index (QPI) measure and the results were compared with the Ward's clustering method. The chemical dataset MDL Drug Data Report (MDDR) database was used for experiments. The results obtained by combining multiple K-means clusterings showed that graph-based consensus clustering, CSPA, can improve the quality of individual chemical structure clusterings.
引用
收藏
页码:304 / +
页数:3
相关论文
共 17 条
[1]   Ligand expansion in ligand-based virtual screening using relevance feedback [J].
Abdo, Ammar ;
Saeed, Faisal ;
Hamza, Hentabli ;
Ahmed, Ali ;
Salim, Naomie .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2012, 26 (03) :279-287
[2]   New Fragment Weighting Scheme for the Bayesian Inference Network in Ligand-Based Virtual Screening [J].
Abdo, Ammar ;
Salim, Naomie .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (01) :25-32
[3]   Ligand-Based Virtual Screening Using Bayesian Networks [J].
Abdo, Ammar ;
Chen, Beining ;
Mueller, Christoph ;
Salim, Naomie ;
Willett, Peter .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (06) :1012-1020
[4]   METHOD FOR AUTOMATIC CLASSIFICATION OF CHEMICAL STRUCTURES [J].
ADAMSON, GW ;
BUSH, JA .
INFORMATION STORAGE AND RETRIEVAL, 1973, 9 (10) :561-568
[5]   CLUSTERING OF CHEMICAL STRUCTURES ON THE BASIS OF 2-DIMENSIONAL SIMILARITY MEASURES [J].
BARNARD, JM ;
DOWNS, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :644-649
[6]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[7]   Combination Rules for Group Fusion in Similarity-Based Virtual Screening [J].
Chen, Beining ;
Mueller, Christoph ;
Willett, Peter .
MOLECULAR INFORMATICS, 2010, 29 (6-7) :533-541
[8]  
Chu C.-W., 2012, BIOORGANIC MED CHEM
[9]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[10]   A fast and high quality multilevel scheme for partitioning irregular graphs [J].
Karypis, G ;
Kumar, V .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) :359-392