Active learning strategies with COMBINE analysis: new tricks for an old dog

被引:6
|
作者
Fusani, Lucia [1 ]
Cortes Cabrera, Alvaro [2 ]
机构
[1] Mol Design UK GSK Med Res Ctr, Gunnels Wood Rd, Stevenage SG1 2NY, Herts, England
[2] Galchimia SA, Data Sci & Computat Chem, Severo Ochoa 2, Tres Cantos 28760, Spain
关键词
COMBINE; QSAR; HIV; Taxanes; Protease; BRD4; Active learning; Regression; HIV-1 PROTEASE INHIBITORS; RANDOM FOREST; BINDING; MICROTUBULES; EPOTHILONES; PREDICTION; TOOL; SET;
D O I
10.1007/s10822-018-0181-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The COMBINE method was designed to study congeneric series of compounds including structural information of ligand-protein complexes. Although very successful, the method has not received the same level of attention than other alternatives to study Quantitative Structure Active Relationships (QSAR) mainly because lack of ways to measure the uncertainty of the predictions and the need for large datasets. Active learning, a semi-supervised learning approach that makes use of uncertainty to enhance models' performance while reducing the size of the training sets, has been used in this work to address both problems. We propose two estimators of uncertainty: the pool of regressors and the distance to the training set. The performance of the methods has been evaluated by testing the resulting active learning workflows in 3 diverse datasets: HIV-1 protease inhibitors, Taxol-derivatives and BRD4 inhibitors. The proposed strategies were successful in 80% of the cases for the taxol-derivatives and BRD4 inhibitors, while outperformed random selection in the case of the HIV-1 protease inhibitors time-split. Our results suggest that AL-COMBINE might be an effective way of producing consistently superior QSAR models with a limited number of samples.
引用
收藏
页码:287 / 294
页数:8
相关论文
共 50 条
  • [31] Optimization of Active Learning Strategies for Causal Network Structure
    Zhang, Mengxin
    Zhang, Xiaojun
    MATHEMATICS, 2024, 12 (06)
  • [32] Graph-Based Query Strategies for Active Learning
    Wu, Wei
    Ostendorf, Mari
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 260 - 269
  • [33] Evaluation of multiple active learning strategies in a pharmacology course
    Sumanasekera, Wasana
    Turner, Chase
    Ly, Kaven
    Hoang, Philip
    Jent, Travis
    Sumanasekera, Thimira
    CURRENTS IN PHARMACY TEACHING AND LEARNING, 2020, 12 (01) : 88 - 94
  • [34] Active Learning Strategies for Semi-Supervised DBSCAN
    Li, Jundong
    Sander, Joerg
    Campello, Ricardo
    Zimek, Arthur
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 179 - 190
  • [35] Classification of Skin Lesion through Active Learning Strategies
    Batista, Lucas G.
    Bugatti, Pedro H.
    Saito, Priscila T. M.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 226
  • [36] Active Learning Methods and Technology: Strategies for Design Education
    Coorey, Jillian
    INTERNATIONAL JOURNAL OF ART & DESIGN EDUCATION, 2016, 35 (03) : 337 - 347
  • [37] Understanding Student Characteristics in the Development of Active Learning Strategies
    Seema Mehta
    Casey P. Schukow
    Amar Takrani
    Raquel P. Ritchie
    Carol A. Wilkins
    Martha A. Faner
    Medical Science Educator, 2022, 32 : 615 - 626
  • [38] Different Strategies of Active Learning in Introductory Astronomy Course
    Pramudya, Yudhiakto
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN EDUCATION TECHNOLOGY, 2015, 11 : 62 - 64
  • [39] IMPLEMENTING ACTIVE LEARNING STRATEGIES IN A STRUCTURAL ANALYSIS AND DESIGN COURSE: PEDAGOGY DEVELOPMENT AND LESSONS LEARNED
    Salman, A.
    Ahmed, M.
    11TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI2018), 2018, : 7138 - 7144
  • [40] Optimizing Annotation Effort Using Active Learning Strategies: A Sentiment Analysis Case Study in Persian
    Asli, Seyed Arad Ashrafi
    Sabeti, Behnam
    Majdabadi, Zahra
    Golazizian, Preni
    Fahmi, Reza
    Momenzadeh, Omid
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2855 - 2861