Prediction of New Bioactive Molecules using a Bayesian Belief Network

被引:20
作者
Abdo, Ammar [1 ,2 ,4 ]
Leclere, Valerie [3 ]
Jacques, Philippe [3 ]
Salim, Naomie [5 ]
Pupin, Maude [1 ,2 ]
机构
[1] Univ Lille 1, LIFL UMR CNRS 8022, F-59655 Villeneuve Dascq, France
[2] INRIA Lille Nord Europe, F-59655 Villeneuve Dascq, France
[3] Univ Lille 1 Sci & Technol, ProBioGEM, UPRES EA 1026, Polytech Lille, F-59655 Villeneuve Dascq, France
[4] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
[5] Univ Telcnol Malaysia, Fac Comp Sci & Informat Syst, Skudai 81310, Malaysia
关键词
QUANTITATIVE STRUCTURE-ACTIVITY; FEATURE-SELECTION; CLASSIFICATION; FINGERPRINT; INHIBITORS;
D O I
10.1021/ci4004909
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Natural products and synthetic compounds are a valuable source of new small molecules leading to novel drugs to cure diseases. However identifying new biologically active small molecules is still a challenge. In this paper, we introduce a new activity prediction approach using Bayesian belief network for classification (BBNC). The roots of the network are the fragments composing a compound. The leaves are, on one side, the activities to predict and, on another side, the unknown compound. The activities are represented by sets of known compounds, and sets of inactive compounds are also used. We calculated a similarity between an unknown compound and each activity class. The more similar activity is assigned to the unknown compound. We applied this new approach on eight well-known data sets extracted from the literature and compared its performance to three classical machine learning algorithms. Experiments showed that BBNC provides interesting prediction rates (from 79% accuracy for high diverse data sets to 99% for low diverse ones) with a short time calculation. Experiments also showed that BBNC is particularly effective for homogeneous data sets but has been found to perform less well with structurally heterogeneous sets. However, it is important to stress that we believe that using several approaches whenever possible for activity prediction can often give a broader understanding of the data than using only one approach alone. Thus, BBNC is a useful addition to the computational chemist's toolbox.
引用
收藏
页码:30 / 36
页数:7
相关论文
共 42 条
[1]   A new fingerprint to predict nonribosomal peptides activity [J].
Abdo, Ammar ;
Caboche, Segolene ;
Leclere, Valerie ;
Jacques, Philippe ;
Pupin, Maude .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2012, 26 (10) :1187-1194
[2]   Ligand expansion in ligand-based virtual screening using relevance feedback [J].
Abdo, Ammar ;
Saeed, Faisal ;
Hamza, Hentabli ;
Ahmed, Ali ;
Salim, Naomie .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2012, 26 (03) :279-287
[3]   Implementing Relevance Feedback in Ligand-Based Virtual Screening Using Bayesian Inference Network [J].
Abdo, Ammar ;
Salim, Naomie ;
Ahmed, Ali .
JOURNAL OF BIOMOLECULAR SCREENING, 2011, 16 (09) :1081-1088
[4]   New Fragment Weighting Scheme for the Bayesian Inference Network in Ligand-Based Virtual Screening [J].
Abdo, Ammar ;
Salim, Naomie .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (01) :25-32
[5]   Ligand-Based Virtual Screening Using Bayesian Networks [J].
Abdo, Ammar ;
Chen, Beining ;
Mueller, Christoph ;
Salim, Naomie ;
Willett, Peter .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (06) :1012-1020
[6]   Similarity-Based Virtual Screening with a Bayesian Inference Network [J].
Abdo, Ammar ;
Salim, Naomie .
CHEMMEDCHEM, 2009, 4 (02) :210-218
[7]   Designing libraries with CNS activity [J].
Ajay ;
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (24) :4942-4951
[8]  
[Anonymous], 1990, M 196 1988 LOS ANG C
[9]  
[Anonymous], 2008, PIP PIL
[10]   Property-based design of GPCR-targeted library [J].
Balakin, KV ;
Tkachenko, SE ;
Lang, SA ;
Okun, I ;
Ivashchenko, AA ;
Savchuk, NP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1332-1342