Prediction of Drug Efficiency by Transferring Gene Expression Data from Cell Lines to Cancer Patients

被引:4
作者
Borisov, Nicolas [1 ,2 ]
Tkachev, Victor [2 ]
Buzdin, Anton [1 ,2 ]
Muchnik, Ilya [3 ]
机构
[1] IM Sechenov First Moscow State Med Univ, Inst Personalized Med, 8-2 Trubetskaya St, Moscow 119991, Russia
[2] OmicsWay Corp, Dept R&D, 40S Lemon Ave, Walnut, CA 91789 USA
[3] Rutgers State Univ, Hill Ctr, Busch Campus, Piscataway, NJ 08855 USA
来源
BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE | 2018年 / 11100卷
基金
俄罗斯科学基金会;
关键词
CLASSIFICATION; BIOINFORMATICS; INHIBITION; ACTIVATION;
D O I
10.1007/978-3-319-99492-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper represents a novel approach for individual medical treatment in oncology, based on machine learning with transferring gene expression data, obtained on cell lines, onto individual cancer patients for drug efficiency prediction. We give a detailed analysis how to build drug response classifiers, on the example of three experimental pairs of data "kind of cancer/chosen drug for treatment". The main hardness of the problem was the meager size of patient training data: it is many many hundred times smaller than a dimensionality of original feature space. The core feature of our transfer technique is to avoid extrapolation in the feature space when make any predictions of the clinical outcome of the treatment for a patient using gene expression data for cell lines. We can assure that there is no extrapolation by special selection of dimensions of the feature space, which provide sufficient number, say M, of cell line points both below and above any point that correspond to a patient. Additionally, in a manner that is a little similar to the k nearest neighbor (kNN) method, after the selection of feature subspace, we take into account only K cell line points that are closer to a patient's point in the selected subspace. Having varied different feasible values of K and M, we showed that the predictor's accuracy considered AUC, for all three cases of cancer-like diseases are equal or higher than 0.7.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 31 条
  • [1] Aliper AM, 2017, METHODS MOL BIOL, V1613, P31, DOI 10.1007/978-1-4939-7027-8_3
  • [2] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [3] A method for predicting target drug efficiency in cancer based on the analysis of signaling pathway activation
    Artemov, Artem
    Aliper, Alexander
    Korzinkin, Michael
    Lezhnina, Ksenia
    Jellen, Leslie
    Zhukov, Nikolay
    Roumiantsev, Sergey
    Gaifullin, Nurshat
    Zhavoronkov, Alex
    Borisov, Nicolas
    Buzdin, Anton
    [J]. ONCOTARGET, 2015, 6 (30) : 29347 - 29356
  • [4] Bartlett P, 1999, ADVANCES IN KERNEL METHODS, P43
  • [5] Bioinformatic identification and characterization of human endothelial cell-restricted genes
    Bhasin, Manoj
    Yuan, Lei
    Keskin, Derin B.
    Otu, Hasan H.
    Libermann, Towia A.
    Oettgen, Peter
    [J]. BMC GENOMICS, 2010, 11
  • [6] Comprehensive Biomarker Analysis and Final Efficacy Results of Sorafenib in the BATTLE Trial
    Blumenschein, George R., Jr.
    Saintigny, Pierre
    Liu, Suyu
    Kim, Edward S.
    Tsao, Anne S.
    Herbst, Roy S.
    Alden, Christine
    Lee, J. Jack
    Tang, Ximing
    Stewart, David J.
    Kies, Merrill S.
    Fossella, Frank V.
    Tran, Hai T.
    Mao, L.
    Hicks, Marshall E.
    Erasmus, Jeremy, Jr.
    Gupta, Sanjay
    Girard, Luc
    Peyton, Michael
    Diao, Lixia
    Wang, Jing
    Davis, Suzanne E.
    Minna, John D.
    Wistuba, Ignacio
    Hong, Waun K.
    Heymach, John V.
    Lippman, Scott M.
    [J]. CLINICAL CANCER RESEARCH, 2013, 19 (24) : 6967 - 6975
  • [7] Data aggregation at the level of molecular pathways improves stability of experimental transcriptomic and proteomic data
    Borisov, Nicolas
    Suntsova, Maria
    Sorokin, Maxim
    Garazha, Andrew
    Kovalchuk, Olga
    Aliper, Alexander
    Ilnitskaya, Elena
    Lezhnina, Ksenia
    Korzinkin, Mikhail
    Tkachev, Victor
    Saenko, Vyacheslav
    Saenko, Yury
    Sokov, Dmitry G.
    Gaifullin, Nurshat M.
    Kashintsev, Kirill
    Shirokorad, Valery
    Shabalina, Irina
    Zhavoronkov, Alex
    Mishra, Bhubaneswar
    Cantor, Charles R.
    Buzdin, Anton
    [J]. CELL CYCLE, 2017, 16 (19) : 1810 - 1823
  • [8] Buzdin AA, 2017, METHODS MOL BIOL, V1613, P53, DOI 10.1007/978-1-4939-7027-8_4
  • [9] Oncofinder, a new method for the analysis of intracellular signaling pathway activation using transcriptomic data
    Buzdin, Anton A.
    Zhavoronkov, Alex A.
    Korzinkin, Mikhail B.
    Venkova, Larisa S.
    Zenin, Alexander A.
    Smirnov, Philip Yu.
    Borisov, Nikolay M.
    [J]. FRONTIERS IN GENETICS, 2014, 5
  • [10] Celik Z.B., 2015, PARADIGM SVM