A Comprehensive Investigation of Active Learning Strategies for Conducting Anti-Cancer Drug Screening

被引:1
作者
Vasanthakumari, Priyanka [1 ]
Zhu, Yitan [1 ]
Brettin, Thomas [2 ]
Partin, Alexander [1 ]
Shukla, Maulik [1 ]
Xia, Fangfang [1 ]
Narykov, Oleksandr [1 ]
Weil, Michael Ryan [3 ]
Stevens, Rick L. [2 ,4 ]
机构
[1] Argonne Natl Lab, Div Data Sci & Learning, Lemont, IL 60439 USA
[2] Argonne Natl Lab, Comp Environm & Life Sci, Lemont, IL 60439 USA
[3] Frederick Natl Lab Canc Res, Canc Data Sci Initiat, Canc Res Technol Program, Rockville 21701, MD USA
[4] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
active learning; machine learning; drug response prediction; drug discovery; cancer; RESPONSES; NETWORKS;
D O I
10.3390/cancers16030530
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Preclinical drug screening experiments for anti-cancer drug discovery typically involve testing candidate drugs against cancer cell lines. This process can be expensive and time consuming since the possible experimental space can be quite huge, involving all of the combinations of candidate cell lines and drugs. Guiding drug screening experiments with active learning strategies could potentially identify promising candidates for successful experimentation. This study investigates various active learning strategies for selecting experiments to generate response data for identifying effective treatments and improving the performance of drug response prediction models. We have demonstrated that most active learning strategies are more efficient than random selection for identifying effective treatments.Abstract It is well-known that cancers of the same histology type can respond differently to a treatment. Thus, computational drug response prediction is of paramount importance for both preclinical drug screening studies and clinical treatment design. To build drug response prediction models, treatment response data need to be generated through screening experiments and used as input to train the prediction models. In this study, we investigate various active learning strategies of selecting experiments to generate response data for the purposes of (1) improving the performance of drug response prediction models built on the data and (2) identifying effective treatments. Here, we focus on constructing drug-specific response prediction models for cancer cell lines. Various approaches have been designed and applied to select cell lines for screening, including a random, greedy, uncertainty, diversity, combination of greedy and uncertainty, sampling-based hybrid, and iteration-based hybrid approach. All of these approaches are evaluated and compared using two criteria: (1) the number of identified hits that are selected experiments validated to be responsive, and (2) the performance of the response prediction model trained on the data of selected experiments. The analysis was conducted for 57 drugs and the results show a significant improvement on identifying hits using active learning approaches compared with the random and greedy sampling method. Active learning approaches also show an improvement on response prediction performance for some of the drugs and analysis runs compared with the greedy sampling method.
引用
收藏
页数:18
相关论文
共 64 条
  • [11] Machine-Learning Assisted Discrimination of Precancerous and Cancerous from Healthy Oral Tissue Based on Multispectral Autofluorescence Lifetime Imaging Endoscopy
    Duran-Sierra, Elvis
    Cheng, Shuna
    Cuenca, Rodrigo
    Ahmed, Beena
    Ji, Jim
    Yakovlev, Vladislav V.
    Martinez, Mathias
    Al-Khalil, Moustafa
    Al-Enazi, Hussain
    Cheng, Yi-Shing Lisa
    Wright, John
    Busso, Carlos
    Jo, Javier A.
    [J]. CANCERS, 2021, 13 (19)
  • [12] An overview of machine learning methods for monotherapy drug response prediction
    Firoozbakht, Farzaneh
    Yousefi, Behnam
    Schwikowski, Benno
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [13] Haussmann E, 2020, IEEE INT VEH SYM, P1430, DOI 10.1109/IV47402.2020.9304793
  • [14] Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy
    Huang, Cai
    Clayton, Evan A.
    Matyunina, Lilya, V
    McDonald, L. DeEtte
    Benigno, Benedict B.
    Vannberg, Fredrik
    McDonald, John F.
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [15] Industrial fault diagnosis based on active learning and semi-supervised learning using small training set
    Jian, Chuanxia
    Yang, Kaijun
    Ao, Yinhui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [16] DeepTTA: a transformer-based model for predicting cancer drug response
    Jiang, Likun
    Jiang, Changzhi
    Yu, Xinyu
    Fu, Rao
    Jin, Shuting
    Liu, Xiangrong
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [17] Jiang Y., 2020, DRUGORCHESTRA JOINTL, DOI DOI 10.1101/2020.11.17.385757
  • [18] HiDRA: Hierarchical Network for Drug Response Prediction with Attention
    Jin, Iljung
    Nam, Hojung
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) : 3858 - 3867
  • [19] Efficient discovery of responses of proteins to compounds using active learning
    Kangas, Joshua D.
    Naik, Armaghan W.
    Murphy, Robert F.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [20] Region-Based Active Learning for Efficient Labeling in Semantic Segmentation
    Kasarla, Tejaswi
    Nagendar, G.
    Hegde, Guruprasad M.
    Balasubramanian, V.
    Jawahar, C. V.
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1109 - 1117