A Comprehensive Investigation of Active Learning Strategies for Conducting Anti-Cancer Drug Screening

被引：1

作者：

Vasanthakumari, Priyanka ^{[1
]}

Zhu, Yitan ^{[1
]}

Brettin, Thomas ^{[2
]}

Partin, Alexander ^{[1
]}

Shukla, Maulik ^{[1
]}

Xia, Fangfang ^{[1
]}

Narykov, Oleksandr ^{[1
]}

Weil, Michael Ryan ^{[3
]}

Stevens, Rick L. ^{[2
,4
]}

机构：

[1] Argonne Natl Lab, Div Data Sci & Learning, Lemont, IL 60439 USA

[2] Argonne Natl Lab, Comp Environm & Life Sci, Lemont, IL 60439 USA

[3] Frederick Natl Lab Canc Res, Canc Data Sci Initiat, Canc Res Technol Program, Rockville 21701, MD USA

[4] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA

来源：

CANCERS | 2024年 / 16卷 / 03期

关键词：

active learning; machine learning; drug response prediction; drug discovery; cancer; RESPONSES; NETWORKS;

D O I：

10.3390/cancers16030530

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Simple Summary Preclinical drug screening experiments for anti-cancer drug discovery typically involve testing candidate drugs against cancer cell lines. This process can be expensive and time consuming since the possible experimental space can be quite huge, involving all of the combinations of candidate cell lines and drugs. Guiding drug screening experiments with active learning strategies could potentially identify promising candidates for successful experimentation. This study investigates various active learning strategies for selecting experiments to generate response data for identifying effective treatments and improving the performance of drug response prediction models. We have demonstrated that most active learning strategies are more efficient than random selection for identifying effective treatments.Abstract It is well-known that cancers of the same histology type can respond differently to a treatment. Thus, computational drug response prediction is of paramount importance for both preclinical drug screening studies and clinical treatment design. To build drug response prediction models, treatment response data need to be generated through screening experiments and used as input to train the prediction models. In this study, we investigate various active learning strategies of selecting experiments to generate response data for the purposes of (1) improving the performance of drug response prediction models built on the data and (2) identifying effective treatments. Here, we focus on constructing drug-specific response prediction models for cancer cell lines. Various approaches have been designed and applied to select cell lines for screening, including a random, greedy, uncertainty, diversity, combination of greedy and uncertainty, sampling-based hybrid, and iteration-based hybrid approach. All of these approaches are evaluated and compared using two criteria: (1) the number of identified hits that are selected experiments validated to be responsive, and (2) the performance of the response prediction model trained on the data of selected experiments. The analysis was conducted for 57 drugs and the results show a significant improvement on identifying hits using active learning approaches compared with the random and greedy sampling method. Active learning approaches also show an improvement on response prediction performance for some of the drugs and analysis runs compared with the greedy sampling method.

引用

页数：18

共 64 条

[11] Machine-Learning Assisted Discrimination of Precancerous and Cancerous from Healthy Oral Tissue Based on Multispectral Autofluorescence Lifetime Imaging Endoscopy
Duran-Sierra, Elvis
Cheng, Shuna
Cuenca, Rodrigo
Ahmed, Beena
Ji, Jim
Yakovlev, Vladislav V.
Martinez, Mathias
Al-Khalil, Moustafa
Al-Enazi, Hussain
Cheng, Yi-Shing Lisa
Wright, John
Busso, Carlos
Jo, Javier A.
[J]. CANCERS, 2021, 13 (19)
[12] An overview of machine learning methods for monotherapy drug response prediction
Firoozbakht, Farzaneh
Yousefi, Behnam
Schwikowski, Benno
[J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[13] Haussmann E, 2020, IEEE INT VEH SYM, P1430, DOI 10.1109/IV47402.2020.9304793
[14] Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy
Huang, Cai
Clayton, Evan A.
Matyunina, Lilya, V
McDonald, L. DeEtte
Benigno, Benedict B.
Vannberg, Fredrik
McDonald, John F.
[J]. SCIENTIFIC REPORTS, 2018, 8
[15] Industrial fault diagnosis based on active learning and semi-supervised learning using small training set
Jian, Chuanxia
Yang, Kaijun
Ao, Yinhui
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
[16] DeepTTA: a transformer-based model for predicting cancer drug response
Jiang, Likun
Jiang, Changzhi
Yu, Xinyu
Fu, Rao
Jin, Shuting
Liu, Xiangrong
[J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
[17] Jiang Y., 2020, DRUGORCHESTRA JOINTL, DOI DOI 10.1101/2020.11.17.385757
[18] HiDRA: Hierarchical Network for Drug Response Prediction with Attention
Jin, Iljung
Nam, Hojung
[J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) : 3858 - 3867
[19] Efficient discovery of responses of proteins to compounds using active learning
Kangas, Joshua D.
Naik, Armaghan W.
Murphy, Robert F.
[J]. BMC BIOINFORMATICS, 2014, 15
[20] Region-Based Active Learning for Efficient Labeling in Semantic Segmentation
Kasarla, Tejaswi
Nagendar, G.
Hegde, Guruprasad M.
Balasubramanian, V.
Jawahar, C. V.
[J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1109 - 1117

← 1 2 3 4 5 6 7 →