Active learning strategies with COMBINE analysis: new tricks for an old dog

被引:6
|
作者
Fusani, Lucia [1 ]
Cortes Cabrera, Alvaro [2 ]
机构
[1] Mol Design UK GSK Med Res Ctr, Gunnels Wood Rd, Stevenage SG1 2NY, Herts, England
[2] Galchimia SA, Data Sci & Computat Chem, Severo Ochoa 2, Tres Cantos 28760, Spain
关键词
COMBINE; QSAR; HIV; Taxanes; Protease; BRD4; Active learning; Regression; HIV-1 PROTEASE INHIBITORS; RANDOM FOREST; BINDING; MICROTUBULES; EPOTHILONES; PREDICTION; TOOL; SET;
D O I
10.1007/s10822-018-0181-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The COMBINE method was designed to study congeneric series of compounds including structural information of ligand-protein complexes. Although very successful, the method has not received the same level of attention than other alternatives to study Quantitative Structure Active Relationships (QSAR) mainly because lack of ways to measure the uncertainty of the predictions and the need for large datasets. Active learning, a semi-supervised learning approach that makes use of uncertainty to enhance models' performance while reducing the size of the training sets, has been used in this work to address both problems. We propose two estimators of uncertainty: the pool of regressors and the distance to the training set. The performance of the methods has been evaluated by testing the resulting active learning workflows in 3 diverse datasets: HIV-1 protease inhibitors, Taxol-derivatives and BRD4 inhibitors. The proposed strategies were successful in 80% of the cases for the taxol-derivatives and BRD4 inhibitors, while outperformed random selection in the case of the HIV-1 protease inhibitors time-split. Our results suggest that AL-COMBINE might be an effective way of producing consistently superior QSAR models with a limited number of samples.
引用
收藏
页码:287 / 294
页数:8
相关论文
共 50 条
  • [1] Active learning strategies with COMBINE analysis: new tricks for an old dog
    Lucia Fusani
    Alvaro Cortes Cabrera
    Journal of Computer-Aided Molecular Design, 2019, 33 : 287 - 294
  • [2] Help Me Learn! Architecture and Strategies to Combine Recommendations and Active Learning in Manufacturing
    Zajec, Patrik
    Rozanec, Joze M.
    Trajkova, Elena
    Novalija, Inna
    Kenda, Klemen
    Fortuna, Blaz
    Mladenic, Dunja
    INFORMATION, 2021, 12 (11)
  • [3] Citrullination: new tricks for an old mod
    Christophorou, Maria A.
    Sharma, Priyanka
    Zhang, Xuesen
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2023, 378 (1890)
  • [4] Old dog, new tricks? Exploring the potential functionalities of ChatGPT in supporting educational methods in social psychiatry
    Smith, Alexander
    Hachen, Stefanie
    Schleifer, Roman
    Bhugra, Dinesh
    Buadze, Anna
    Liebrenz, Michael
    INTERNATIONAL JOURNAL OF SOCIAL PSYCHIATRY, 2023, 69 (08) : 1882 - 1889
  • [5] HOW TO TEACH AN OLD SEQUENCER NEW TRICKS
    Perkel, Jeffrey M.
    NATURE, 2018, 559 (7715) : 643 - 645
  • [6] Empirical investigation of active learning strategies
    Pereira-Santos, Davi
    Cavalcante Prudencio, Ricardo Bastos
    de Carvalho, Andre C. P. L. F.
    NEUROCOMPUTING, 2019, 326 : 15 - 27
  • [7] Thrombospondins in physiology and disease: New tricks for old dogs
    Murphy-Ullrich, Joanne E.
    Iozzo, Renato V.
    MATRIX BIOLOGY, 2012, 31 (03) : 152 - 154
  • [8] Analysis of Creative Thinking Skills Development under Active Learning Strategies
    Forte-Celaya, Jacqueline
    Ibarra, Luis
    Glasserman-Morales, Leonardo David
    EDUCATION SCIENCES, 2021, 11 (10):
  • [9] Active Output Selection Strategies for Multiple Learning Regression Models
    Prochaska, Adrian
    Pillas, Julien
    Baeker, Bernard
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 150 - 157
  • [10] Exploring active learning strategies for predictive models in mechanics of materials
    Chen, Yingbin
    Deierling, Phillip
    Xiao, Shaoping
    APPLIED PHYSICS A-MATERIALS SCIENCE & PROCESSING, 2024, 130 (08):