QSAR workbench: automating QSAR modeling to drive compound design

被引:31
作者
Cox, Richard [1 ]
Green, Darren V. S. [2 ]
Luscombe, Christopher N. [2 ]
Malcolm, Noj [1 ]
Pickett, Stephen D. [2 ]
机构
[1] Accelrys Ltd, Cambridge CB4 0WN, England
[2] GlaxoSmithKline Med Res Ctr, Stevenage SG1 2NY, Herts, England
关键词
QSAR; Workflow; Pipeline pilot; EVOLVING INTERPRETABLE STRUCTURE; MULTIOBJECTIVE OPTIMIZATION; ELECTROTOPOLOGICAL STATE; VALIDATION; APPLICABILITY; PREDICTIVITY; DOMAIN;
D O I
10.1007/s10822-013-9648-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe the QSAR Workbench, a system for the building and analysis of QSAR models. The system is built around the Pipeline Pilot workflow tool and provides access to a variety of model building algorithms for both continuous and categorical data. Traditionally models are built on a one by one basis and fully exploring the model space of algorithms and descriptor subsets is a time consuming basis. The QSAR Workbench provides a framework to allow for multiple models to be built over a number of modeling algorithms, descriptor combinations and data splits (training and test sets). Methods to analyze and compare models are provided, enabling the user to select the most appropriate model. The Workbench provides a consistent set of routines for data preparation and chemistry normalization that are also applied for predictions. The Workbench provides a large degree of automation with the ability to publish preconfigured model building workflows for a variety of problem domains, whilst providing experienced users full access to the underlying parameterization if required. Methods are provided to allow for publication of selected models as web services, thus providing integration with the chemistry desktop. We describe the design and implementation of the QSAR Workbench and demonstrate its utility through application to two public domain datasets.
引用
收藏
页码:321 / 336
页数:16
相关论文
共 38 条
[1]   The CAESAR project for in silico models for the REACH legislation [J].
Benfenati, Emilio .
CHEMISTRY CENTRAL JOURNAL, 2010, 4
[2]  
Bi J., 2003, P 20 INT C MACH LEAR, P43
[3]   Evolving interpretable structure - Activity relationships. 1. Reduced graph queries [J].
Birchall, Kristian ;
Gillet, Valerie J. ;
Harper, Gavin ;
Pickett, Stephen D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (08) :1543-1557
[4]   Evolving interpretable structure - Activity relationship models. 2. Using multiobjective optimization to derive multiple models [J].
Birchall, Kristian ;
Gillet, Valerie J. ;
Harper, Gavin ;
Pickentt, Stephen D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (08) :1558-1570
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]   Receiver operating characteristics curves and related decision measures: A tutorial [J].
Brown, CD ;
Davis, HT .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 80 (01) :24-38
[7]  
Brown R., QUANTIFYING MODEL ER
[8]   Contemporary QSAR classifiers compared [J].
Bruce, Craig L. ;
Melville, James L. ;
Pickett, Stephen D. ;
Hirst, Jonathan D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (01) :219-227
[9]   Automated QSPR through competitive workflow [J].
Cartmell, J ;
Enoch, S ;
Krstajic, D ;
Leahy, DE .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2005, 19 (11) :821-833
[10]   Global QSAR models of skin sensitisers for regulatory purposes [J].
Chaudhry, Qasim ;
Piclin, Nadege ;
Cotterill, Jane ;
Pintore, Marco ;
Price, Nick R. ;
Chretien, Jacques R. ;
Roncaglioni, Alessandra .
CHEMISTRY CENTRAL JOURNAL, 2010, 4