Active learning strategies with COMBINE analysis: new tricks for an old dog

被引:6
|
作者
Fusani, Lucia [1 ]
Cortes Cabrera, Alvaro [2 ]
机构
[1] Mol Design UK GSK Med Res Ctr, Gunnels Wood Rd, Stevenage SG1 2NY, Herts, England
[2] Galchimia SA, Data Sci & Computat Chem, Severo Ochoa 2, Tres Cantos 28760, Spain
关键词
COMBINE; QSAR; HIV; Taxanes; Protease; BRD4; Active learning; Regression; HIV-1 PROTEASE INHIBITORS; RANDOM FOREST; BINDING; MICROTUBULES; EPOTHILONES; PREDICTION; TOOL; SET;
D O I
10.1007/s10822-018-0181-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The COMBINE method was designed to study congeneric series of compounds including structural information of ligand-protein complexes. Although very successful, the method has not received the same level of attention than other alternatives to study Quantitative Structure Active Relationships (QSAR) mainly because lack of ways to measure the uncertainty of the predictions and the need for large datasets. Active learning, a semi-supervised learning approach that makes use of uncertainty to enhance models' performance while reducing the size of the training sets, has been used in this work to address both problems. We propose two estimators of uncertainty: the pool of regressors and the distance to the training set. The performance of the methods has been evaluated by testing the resulting active learning workflows in 3 diverse datasets: HIV-1 protease inhibitors, Taxol-derivatives and BRD4 inhibitors. The proposed strategies were successful in 80% of the cases for the taxol-derivatives and BRD4 inhibitors, while outperformed random selection in the case of the HIV-1 protease inhibitors time-split. Our results suggest that AL-COMBINE might be an effective way of producing consistently superior QSAR models with a limited number of samples.
引用
收藏
页码:287 / 294
页数:8
相关论文
共 50 条
  • [21] Active Learning Strategies Based on Text Informativeness
    Li, Ruide
    Yamakata, Yoko
    Tajima, Keishi
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 32 - 39
  • [22] Evidential uncertainty sampling strategies for active learning
    Hoarau, Arthur
    Lemaire, Vincent
    Le Gall, Yolande
    Dubois, Jean-Christophe
    Martin, Arnaud
    MACHINE LEARNING, 2024, 113 (09) : 6453 - 6474
  • [23] Iterative active learning strategies for subgraph matching
    Ge, Yurun
    Yang, Dominic
    Bertozzi, Andrea L.
    PATTERN RECOGNITION, 2025, 158
  • [24] Strategies to mitigate student resistance to active learning
    Sneha Tharayil
    Maura Borrego
    Michael Prince
    Kevin A. Nguyen
    Prateek Shekhar
    Cynthia J. Finelli
    Cynthia Waters
    International Journal of STEM Education, 5
  • [25] Distributed Active Learning Strategies on Edge Computing
    Qian, Jia
    Hansen, Lars Kai
    Gochhayat, Sarada Prasad
    2019 6TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (IEEE CSCLOUD 2019) / 2019 5TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (IEEE EDGECOM 2019), 2019, : 221 - 226
  • [26] Strategies to mitigate student resistance to active learning
    Tharayil, Sneha
    Borrego, Maura
    Prince, Michael
    Nguyen, Kevin A.
    Shekhar, Prateek
    Finelli, Cynthia J.
    Waters, Cynthia
    INTERNATIONAL JOURNAL OF STEM EDUCATION, 2018, 5
  • [27] Evaluation of active learning strategies for video indexing
    Ayache, Stephane
    Quenot, Georges
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2007, 22 (7-8) : 692 - 704
  • [28] A NEW SEMI-SUPERVISED APPROACH FOR HYPERSPECTRAL IMAGE CLASSIFICATION WITH DIFFERENT ACTIVE LEARNING STRATEGIES
    Dopido, Inmaculada
    Li, Jun
    Plaza, Antonio
    Bioucas-Dias, Jose M.
    2012 4TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING (WHISPERS), 2012,
  • [29] Active learning strategies and assessment in world geography classes
    Klein, P
    JOURNAL OF GEOGRAPHY, 2003, 102 (04) : 146 - 157
  • [30] Understanding Student Characteristics in the Development of Active Learning Strategies
    Mehta, Seema
    Schukow, Casey P.
    Takrani, Amar
    Ritchie, Raquel P.
    Wilkins, Carol A.
    Faner, Martha A.
    MEDICAL SCIENCE EDUCATOR, 2022, 32 (03) : 615 - 626