Reducing the Concepts of Data Science and Machine Learning to Tools for the Bench Chemist

被引:1
作者
Lewis, Richard A. [1 ]
Ertl, Peter [1 ]
Schneider, Nadine [1 ]
Stiefl, Nikolaus [1 ]
机构
[1] Novartis Inst BioMed Res, Comp Aided Drug Design, Global Discovery Chem, CH-4056 Basel, Switzerland
关键词
Data science; Machine learning;
D O I
10.2533/chimia.2019.1001
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine Learning and Data Science have enjoyed a renaissance due to the availability of increased computational power and larger data sets. Many questions can be now asked and answered, that previously were beyond our scope. This does not translate instantly into new tools that can be used by those not skilled in the field, as many of the issues and traps still exist. In this paper, we look at some of the new tools that we have created, and some of the difficulties that still need to be taken care of during the transition from a project run by an expert, to a tool for the bench chemist.
引用
收藏
页码:1001 / 1005
页数:5
相关论文
共 20 条
[1]   The ChEMBL bioactivity database: an update [J].
Bento, A. Patricia ;
Gaulton, Anna ;
Hersey, Anne ;
Bellis, Louisa J. ;
Chambers, Jon ;
Davies, Mark ;
Krueger, Felix A. ;
Light, Yvonne ;
Mak, Lora ;
McGlinchey, Shaun ;
Nowotka, Michal ;
Papadatos, George ;
Santos, Rita ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D1083-D1090
[2]   Lessons learned from the fate of AstraZeneca's drug pipeline: a five-dimensional framework [J].
Cook, David ;
Brown, Dearg ;
Alexander, Robert ;
March, Ruth ;
Morgan, Paul ;
Satterthwaite, Gemma ;
Pangalos, Menelas N. .
NATURE REVIEWS DRUG DISCOVERY, 2014, 13 (06) :419-431
[3]   How not to develop a quantitative structure-activity or structure-property relationship (QSAR/QSPR) [J].
Dearden, J. C. ;
Cronin, M. T. D. ;
Kaiser, K. L. E. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2009, 20 (3-4) :241-266
[4]   50 Years of Data Science [J].
Donoho, David .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (04) :745-766
[5]  
Ertl P., 2017, ARXIV171707449
[6]  
Ertl P., 2004, DRUG DISCOV TODAY, V2, P201
[7]   Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions [J].
Ertl, Peter ;
Schuffenhauer, Ansgar .
JOURNAL OF CHEMINFORMATICS, 2009, 1
[8]   The Experimental Uncertainty of Heterogeneous Public Ki Data [J].
Kramer, Christian ;
Kalliokoski, Tuomo ;
Gedeck, Peter ;
Vulpetti, Anna .
JOURNAL OF MEDICINAL CHEMISTRY, 2012, 55 (11) :5165-5173
[9]   Computational chemistry at novartis [J].
Lewis, R ;
Ertl, P ;
Jacoby, E ;
Tintelnot-Blomley, M ;
Gedeck, P ;
Wolf, RM ;
Peitsch, MC .
CHIMIA, 2005, 59 (7-8) :545-549
[10]   A general method for exploiting QSAR models in lead optimization [J].
Lewis, RA .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (05) :1638-1648