A Value-Based Approach for Training of Classifiers with High-Throughput Small Molecule Screening Data

被引:1
作者
Khuri, Natalia [1 ]
Parsons, Sarah [1 ]
机构
[1] Wake Forest Univ, Winston Salem, NC 27101 USA
来源
12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021) | 2021年
关键词
DRUG; INHIBITORS; CLASSIFICATION; DISCOVERY;
D O I
10.1145/3459930.3469514
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In many practical applications of machine learning, models are built using experimental data that are noisy, biased and of low quality. Binary classifiers trained with such data have low performance in independent and prospective tests. This work builds upon techniques for the estimation of the value of training data and evaluates a batch-based data valuation. Comparative experiments conducted in this work with seven challenging benchmarks, demonstrate that classification performance can be improved by 10% to 25% in independent tests, using value-based training of classifiers. Additionally, between 97% to 100% of class labels can be detected among low-valued training samples. Finally, results show that simpler and faster learning methods, such as generalized linear models, perform as well as complex gradient boosting trees when training data comprises only the high-valued samples extracted from high-throughput small molecule screens.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A Novel Imaging-based High-throughput Screening Approach to Anti-angiogenic Drug Discovery
    Evensen, Lasse
    Micklem, David R.
    Link, Wolfgang
    Lorens, James B.
    [J]. CYTOMETRY PART A, 2010, 77A (01) : 41 - 51
  • [32] Rapid screening the mechanical properties of ZrNi-based metallic glasses by high-throughput combinatorial approach
    Cao, Junhua
    Gao, Meng
    Cai, Yuanfei
    Li, Jinlong
    Wang, Ye
    Wang, Jun-Qiang
    Huo, Juntao
    [J]. INTERMETALLICS, 2022, 148
  • [33] Preparation of Small-Molecule Microarrays by trans-Cyclooctene Tetrazine Ligation and Their Application in the High-Throughput Screening of Protein-Protein Interaction Inhibitors of Bromodomains
    Zhang, Chong-Jing
    Tan, Chelsea Y. J.
    Ge, Jingyan
    Na, Zhenkun
    Chen, Grace Y. J.
    Uttamchandani, Mahesh
    Sun, Hongyan
    Yao, Shao Q.
    [J]. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2013, 52 (52) : 14060 - 14064
  • [34] GUItars: A GUI Tool for Analysis of High-Throughput RNA Interference Screening Data
    Goktug, Asli N.
    Ong, Su Sien
    Chen, Taosheng
    [J]. PLOS ONE, 2012, 7 (11):
  • [35] A novel method for mining highly imbalanced high-throughput screening data in PubChem
    Li, Qingliang
    Wang, Yanli
    Bryant, Stephen H.
    [J]. BIOINFORMATICS, 2009, 25 (24) : 3310 - 3316
  • [36] Cell-Based Screening Using High-Throughput Flow Cytometry
    Black, Christopher B.
    Duensing, Thomas D.
    Trinkle, Linda S.
    Dunlay, R. Terry
    [J]. ASSAY AND DRUG DEVELOPMENT TECHNOLOGIES, 2011, 9 (01) : 13 - 20
  • [37] An informatic pipeline for managing high-throughput screening experiments and analyzing data from stereochemically diverse libraries
    Mulrooney, Carol A.
    Lahr, David L.
    Quintin, Michael J.
    Youngsaye, Willmen
    Moccia, Dennis
    Asiedu, Jacob K.
    Mulligan, Evan L.
    Akella, Lakshmi B.
    Marcaurelle, Lisa A.
    Montgomery, Philip
    Bittker, Joshua A.
    Clemons, Paul A.
    Brudz, Stephen
    Dandapani, Sivaraman
    Duvall, Jeremy R.
    Tolliday, Nicola J.
    De Souza, Andrea
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2013, 27 (05) : 455 - 468
  • [38] Use of Cryopreserved Cell Aliquots in the High-Throughput Screening of Small Interfering RNA Libraries
    Swearingen, Elissa A.
    Fajardo, Flordeliza
    Wang, Xiangyun
    Watson, J. E. Vivienne
    Quon, Kim C.
    Kassner, Paul D.
    [J]. JOURNAL OF BIOMOLECULAR SCREENING, 2010, 15 (05) : 469 - 477
  • [39] A High-Throughput Small Molecule Screen for C-elegans Linker Cell Death Inhibitors
    Schwendeman, Andrew R.
    Shaham, Shai
    [J]. PLOS ONE, 2016, 11 (10):
  • [40] Concod: Accurate Consensus-based Approach of Calling Deletions from High-throughput Sequencing Data
    Zhang, Xiaodong
    Chu, Chong
    Zhang, Yao
    Wu, Yufeng
    Gao, Jingyang
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 72 - 77