Comprehensible Models for Predicting Molecular Interaction with Heart-Regulating Genes

被引:2
作者
Sonstrod, Cecilia [1 ]
Johansson, Ulf [1 ]
Norinder, Ulf [3 ]
Bostrom, Henrik [2 ]
机构
[1] Univ Boras, Sch Business & Informat, Boras, Sweden
[2] Univ Skovde, Sch Humanit & informat, Skovde, Sweden
[3] AstraZeneca R&D, Sodertalje, Sweden
来源
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS | 2008年
关键词
D O I
10.1109/ICMLA.2008.130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When using machine learning for in silico modeling, the goal is normally to obtain highly accurate predictive models. Often, however, models should also bring insights into interesting relationships in the domain. It is then desirable that machine learning techniques have the ability to obtain small and transparent models, where the user can control the tradeoff between accuracy, comprehensibility and coverage. In this study, three different decision list algorithms are evaluated on a data set concerning the interaction of molecules with a human gene that regulates heart functioning (hERG). The results show that decision list algorithms can obtain predictive performance not far from the state-of-the-art method random forests, but also that algorithms focusing on accuracy alone may produce complex decision lists that are very hard to interpret. The experiments also show that by sacrificing accuracy only to a limited degree, comprehensibility (measured as both model size and classification complexity) can be improved remarkably.
引用
收藏
页码:559 / +
页数:3
相关论文
共 33 条
[1]  
ASUNCION CJ, 2007, UCI MACHINE LEARNING
[2]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Bridgland-Taylor M. H., 2006, Journal of Pharmacological and Toxicological Methods, V54, P189, DOI 10.1016/j.vascn.2006.02.003
[6]   Drugs, hERG and sudden death [J].
Brown, AM .
CELL CALCIUM, 2004, 35 (06) :543-547
[7]  
Clark P., 1989, Machine Learning, V3, P261, DOI 10.1023/A:1022641700528
[8]  
Cohen W. W., 1995, P 12 INT C MACH LEAR, P115, DOI DOI 10.1016/B978-1-55860-377-6.50023-2
[9]   Prediction of hERG potassium channel affinity by the CODESSA approach [J].
Coi, A ;
Massarelli, I ;
Murgia, L ;
Saraceno, M ;
Calderone, V ;
Bianucci, AM .
BIOORGANIC & MEDICINAL CHEMISTRY, 2006, 14 (09) :3153-3159
[10]  
*COMP AB, RUL DISC SYST V 2 6