Toward automated biochemotype annotation for large compound libraries

被引:3
作者
Chen, Xian
Liang, Yizeng
Xu, Jun
机构
[1] Cent S Univ, Dept Chem, Res Ctr Modernizat Chinese Trad Med, Changsha 410083, Peoples R China
[2] Discovery Partners Int Inc, San Diego, CA 92121 USA
关键词
bioactivity prediction; biochemome; biochemotype annotation; virtual library profiling;
D O I
10.1007/s11030-006-9047-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Combinatorial chemistry allows scientists to probe large synthetically accessible chemical space. However, identifying the sub-space which is selectively associated with an interested biological target, is crucial to drug discovery and life sciences. This paper describes a process to automatically annotate biochemotypes of compounds in a library and thus to identify bioactiviiy related chemotypes (biochemotypes) from a large library of compounds. The process consists of two steps: (1) predicting all possible bioactivities for each compound in a library, and (2) deriving possible biochemotypes based on predictions. The Prediction of Activity Spectra for Substances program (PASS) was used in the first step. In second step, structural similarity and scaffold-hopping technologies are employed. These technologies are used to derive biochemotypes from bioactivity predictions and the corresponding annotated biochemotypes from MDL Drug Data Report (MDDR) database. About a one million (982,889) commercially available compound library (CACL) has been tested using this process. This paper demonstrates the feasibility of automatically annotating biochemotypes for large libraries of compounds. Nevertheless, some issues need to be considered in order to improve the process. First, the prediction accuracy of PASS program has no significant correlation with the number of compounds in a training set. Larger training sets do not necessarily increase the maximal error of prediction (MEP), nor do they increase the hit structural diversity. Smaller training sets do not necessarily decrease MEP, nor do they decrease the hit structural diversity. Second, the success of systematic bioactivity prediction relies on modeling, training data, and the definition of bioactivities (biochemotype ontology). Unfortunately, the biochemotype ontology was not well developed in the PASS program. Consequently, "ill-defined" bioactivities can reduce the quality of predictions. This paper suggests the ways in which the systematic bioactivities prediction program should be improved.
引用
收藏
页码:495 / 509
页数:15
相关论文
共 19 条
[1]   Chemogenomics: Bridging a drug discovery gap [J].
Bleicher, KH .
CURRENT MEDICINAL CHEMISTRY, 2002, 9 (23) :2077-2084
[2]   Identification of two rat genes orthologous to the human interleukin-8 receptors [J].
Dunstan, CAN ;
Salafranca, MN ;
Adhikari, S ;
Xia, YY ;
Feng, LL ;
Harrison, JK .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1996, 271 (51) :32770-32776
[3]   Computer aided prediction of biological activity spectra: Evaluating versus known and predicting of new activities for thiazole derivatives [J].
Geronikaki, A ;
Lagunin, A ;
Poroikov, V ;
Filimonov, D ;
Hadjipavlou-Litina, D ;
Vicini, P .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2002, 13 (3-4) :457-471
[4]   LIGAND: database of chemical compounds and reactions in biological pathways [J].
Goto, S ;
Okuno, Y ;
Hattori, M ;
Nishioka, T ;
Kanehisa, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :402-404
[5]   CLONING OF THE CDNA AND GENE FOR A HUMAN D2-DOPAMINE RECEPTOR [J].
GRANDY, DK ;
MARCHIONNI, MA ;
MAKAM, H ;
STOFKO, RE ;
ALFANO, M ;
FROTHINGHAM, L ;
FISCHER, JB ;
BURKEHOWIE, KJ ;
BUNZOW, JR ;
SERVER, AC ;
CIVELLI, O .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (24) :9762-9766
[6]   Neighborhood behavior. Fuzzy molecular descriptors and their influence on the relationship between structural similarity and property similarity [J].
Horvath, D ;
Mao, B .
QSAR & COMBINATORIAL SCIENCE, 2003, 22 (05) :498-509
[7]   Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings [J].
Lipinski, CA ;
Lombardo, F ;
Dominy, BW ;
Feeney, PJ .
ADVANCED DRUG DELIVERY REVIEWS, 1997, 23 (1-3) :3-25
[8]   Towards a pharmacological genetics [J].
Mitchison, Tim J. .
CHEMISTRY & BIOLOGY, 1994, 1 (01) :3-6
[9]  
Owens J, 2004, DRUG DISCOV TODAY, V9, P299
[10]   PASS biological activity spectrum predictions in the Enhanced open NCI Database Browser [J].
Poroikov, VV ;
Filimonov, DA ;
Ihlenfeldt, WD ;
Gloriozova, TA ;
Lagunin, AA ;
Borodina, YV ;
Stepanchikova, AV ;
Nicklaus, MC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (01) :228-236