Active Learning Strategies for Phenotypic Profiling of High-Content Screens

被引:21
|
作者
Smith, Kevin [1 ]
Horvath, Peter [2 ,3 ]
机构
[1] Swiss Fed Inst Technol, Light Microscopy & Screening Ctr, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Inst Biochem, Zurich, Switzerland
[3] Biol Res Ctr, Synthet & Syst Biol Unit, H-6726 Szeged, Hungary
关键词
High-content screening; machine learning; active learning; phenotypic discovery; multiparametric analysis;
D O I
10.1177/1087057114527313
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-content screening is a powerful method to discover new drugs and carry out basic biological research. Increasingly, high-content screens have come to rely on supervised machine learning (SML) to perform automatic phenotypic classification as an essential step of the analysis. However, this comes at a cost, namely, the labeled examples required to train the predictive model. Classification performance increases with the number of labeled examples, and because labeling examples demands time from an expert, the training process represents a significant time investment. Active learning strategies attempt to overcome this bottleneck by presenting the most relevant examples to the annotator, thereby achieving high accuracy while minimizing the cost of obtaining labeled data. In this article, we investigate the impact of active learning on single-cell-based phenotype recognition, using data from three large-scale RNA interference high-content screens representing diverse phenotypic profiling problems. We consider several combinations of active learning strategies and popular SML methods. Our results show that active learning significantly reduces the time cost and can be used to reveal the same phenotypic targets identified using SML. We also identify combinations of active learning strategies and SML methods which perform better than others on the phenotypic profiling problems we studied.
引用
收藏
页码:685 / 695
页数:11
相关论文
共 50 条
  • [1] Prioritizing hits from phenotypic high-content screens
    Low, Jonathan
    Stancato, Louis
    Lee, Jonathan
    Sutherland, Jeffrey J.
    CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2008, 11 (03) : 338 - 345
  • [2] A statistical framework for high-content phenotypic profiling using cellular feature distributions
    Pearson, Yanthe E.
    Kremb, Stephan
    Butterfoss, Glenn L.
    Xie, Xin
    Fahs, Hala
    Gunsalus, Kristin C.
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)
  • [3] High-Content Analysis to Leverage a Robust Phenotypic Profiling Approach to Vascular Modulation
    Isherwood, Beverley J.
    Walls, Rebecca E.
    Roberts, Mark E.
    Houslay, Thomas M.
    Brave, Sandra R.
    Barry, Simon T.
    Carragher, Neil O.
    JOURNAL OF BIOMOLECULAR SCREENING, 2013, 18 (10) : 1246 - 1259
  • [4] A statistical framework for high-content phenotypic profiling using cellular feature distributions
    Yanthe E. Pearson
    Stephan Kremb
    Glenn L. Butterfoss
    Xin Xie
    Hala Fahs
    Kristin C. Gunsalus
    Communications Biology, 5
  • [5] High-content Analysis in Monastrol Suppressor Screens
    Zhang, Z.
    Ge, Y.
    Zhang, D.
    Zhou, X.
    METHODS OF INFORMATION IN MEDICINE, 2011, 50 (03) : 265 - 272
  • [6] Improving drug discovery with high-content phenotypic screens by systematic selection of reporter cell lines
    Kang, Jungseog
    Hsu, Chien-Hsiang
    Wu, Qi
    Liu, Shanshan
    Coster, Adam D.
    Posner, Bruce A.
    Altschuler, Steven J.
    Wu, Lani F.
    NATURE BIOTECHNOLOGY, 2016, 34 (01) : 70 - 77
  • [7] Improving drug discovery with high-content phenotypic screens by systematic selection of reporter cell lines
    Jungseog Kang
    Chien-Hsiang Hsu
    Qi Wu
    Shanshan Liu
    Adam D Coster
    Bruce A Posner
    Steven J Altschuler
    Lani F Wu
    Nature Biotechnology, 2016, 34 : 70 - 77
  • [8] High-Content Phenotypic Profiling of Drug Response Signatures across Distinct Cancer Cells
    Caie, Peter D.
    Walls, Rebecca E.
    Ingleston-Orme, Alexandra
    Daya, Sandeep
    Houslay, Tom
    Eagle, Rob
    Roberts, Mark E.
    Carragher, Neil O.
    MOLECULAR CANCER THERAPEUTICS, 2010, 9 (06) : 1913 - 1926
  • [9] High-content phenotypic and pathway profiling to advance drug discovery in diseases of unmet need
    Hughes, Rebecca E.
    Elliott, Richard J. R.
    Dawson, John C.
    Carragher, Neil O.
    CELL CHEMICAL BIOLOGY, 2021, 28 (03): : 338 - 355
  • [10] HIGH-CONTENT SCREENS Giving Rho(d) directions
    Muellner, Markus K.
    Nijman, Sebastian M. B.
    NATURE CHEMICAL BIOLOGY, 2010, 6 (06) : 397 - 398