Active Learning Strategies for Phenotypic Profiling of High-Content Screens

被引:21
|
作者
Smith, Kevin [1 ]
Horvath, Peter [2 ,3 ]
机构
[1] Swiss Fed Inst Technol, Light Microscopy & Screening Ctr, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Inst Biochem, Zurich, Switzerland
[3] Biol Res Ctr, Synthet & Syst Biol Unit, H-6726 Szeged, Hungary
关键词
High-content screening; machine learning; active learning; phenotypic discovery; multiparametric analysis;
D O I
10.1177/1087057114527313
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-content screening is a powerful method to discover new drugs and carry out basic biological research. Increasingly, high-content screens have come to rely on supervised machine learning (SML) to perform automatic phenotypic classification as an essential step of the analysis. However, this comes at a cost, namely, the labeled examples required to train the predictive model. Classification performance increases with the number of labeled examples, and because labeling examples demands time from an expert, the training process represents a significant time investment. Active learning strategies attempt to overcome this bottleneck by presenting the most relevant examples to the annotator, thereby achieving high accuracy while minimizing the cost of obtaining labeled data. In this article, we investigate the impact of active learning on single-cell-based phenotype recognition, using data from three large-scale RNA interference high-content screens representing diverse phenotypic profiling problems. We consider several combinations of active learning strategies and popular SML methods. Our results show that active learning significantly reduces the time cost and can be used to reveal the same phenotypic targets identified using SML. We also identify combinations of active learning strategies and SML methods which perform better than others on the phenotypic profiling problems we studied.
引用
收藏
页码:685 / 695
页数:11
相关论文
共 50 条
  • [21] Online phenotype discovery in high-content RNAi screens using gap statistics
    Yin, Zheng
    Zhou, Xiaobo
    Bakal, Chris
    Li, Fuhai
    Sun, Youxian
    Perrimon, Norbert
    Wong, Stephen T. C.
    COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS 07), 2007, 952 : 86 - +
  • [22] Normalizing for individual cell population context in the analysis of high-content cellular screens
    Bettina Knapp
    Ilka Rebhan
    Anil Kumar
    Petr Matula
    Narsis A Kiani
    Marco Binder
    Holger Erfle
    Karl Rohr
    Roland Eils
    Ralf Bartenschlager
    Lars Kaderali
    BMC Bioinformatics, 12
  • [23] High-content imaging-based pooled CRISPR screens in mammalian cells
    Yan, Xiaowei
    Stuurman, Nico
    Ribeiro, Susana A.
    Tanenbaum, Marvin E.
    Horlbeck, Max A.
    Liem, Christina R.
    Jost, Marco
    Weissman, Jonathan S.
    Vale, Ronald D.
    JOURNAL OF CELL BIOLOGY, 2021, 220 (02):
  • [24] π-PhenoDrug: A Comprehensive Deep Learning-Based Pipeline for Phenotypic Drug Screening in High-Content Analysis
    Li, Xiao
    Ouyang, Qinxue
    Han, Mingfei
    Liu, Xiaoqing
    He, Fuchu
    Zhu, Yunping
    Leng, Ling
    Ma, Jie
    ADVANCED INTELLIGENT SYSTEMS, 2025,
  • [25] Image Analysis Methods in High-content Screening for Phenotypic Drug Discovery
    Elena, Vorontsova
    Anastasiya, Solovieva
    2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 575 - 575
  • [26] Machine Learning Improves the Precision and Robustness of High-Content Screens: Using Nonlinear Multiparametric Methods to Analyze Screening Results
    Horvath, Peter
    Wild, Thomas
    Kutay, Ulrike
    Csucs, Gabor
    JOURNAL OF BIOMOLECULAR SCREENING, 2011, 16 (09) : 1059 - 1067
  • [27] Workflow and Metrics for Image Quality Control in Large-Scale High-Content Screens
    Bray, Mark-Anthony
    Fraser, Adam N.
    Hasaka, Thomas P.
    Carpenter, Anne E.
    JOURNAL OF BIOMOLECULAR SCREENING, 2012, 17 (02) : 266 - 274
  • [28] Pattern Recognition in High-Content Cytomics Screens for Target Discovery - Case Studies in Endocytosis
    Cao, Lu
    Yan, Kuan
    Winkel, Leah
    de Graauw, Mado
    Verbeek, Fons J.
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 330 - +
  • [29] High-Content Chemical and RNAi Screens for Suppressors of Neurotoxicity in a Huntington's Disease Model
    Schulte, Joost
    Sepp, Katharine J.
    Wu, Chaohong
    Hong, Pengyu
    Littleton, J. Troy
    PLOS ONE, 2011, 6 (08):
  • [30] Phenotypic Profiling of High Throughput Imaging Screens with Generic Deep Convolutional Features
    Jackson, Philip T.
    Wang, Yinhai
    Knight, Sinead
    Chen, Hongming
    Dorval, Thierry
    Brown, Martin
    Bendtsen, Claus
    Obara, Boguslaw
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,