Using Local Models to Improve (Q) SAR Predictivity

被引:10
作者
Buchwald, Fabian [1 ]
Girschick, Tobias [1 ]
Seeland, Madeleine [1 ]
Kramer, Stefan [1 ]
机构
[1] Tech Univ Munich, Inst Informat I12, D-85748 Garching, Germany
关键词
Cheminformatics; Local models; Machine learning; Structure-activity relationships; Structural clustering; QSAR MODELS; SIMILARITY; REGRESSION; MOLECULES;
D O I
10.1002/minf.201000154
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
We present a novel (Q) SAR approach that detects groups of structures for local (Q) SAR modeling. The algorithm combines clustering and classification or regression for making predictions on chemical structure data. A clustering procedure producing clusters with shared structural scaffolds is applied as a preprocessing step, before a (local) model is learned for each relevant cluster. Instead of using only one global model (classical approach), we use weighted local models for predictions of query compounds de-pendent on cluster memberships. The approach is evaluated and compared against standard statistical (Q) SAR algorithms on various datasets. The results show that in many cases the application of local models significantly improves the predictive power of the derived (Q) SAR models compared to the classical approach, to models that are induced by a fingerprint-based or a hierarchical clustering approach and to locally weighted learning.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 39 条