Auto-HMM-LMF: feature selection based method for prediction of drug response via autoencoder and hidden Markov model

被引:19
作者
Emdadi, Akram [1 ]
Eslahchi, Changiz [1 ,2 ]
机构
[1] Shahid Beheshti Univ, Dept Comp & Data Sci, Fac Math Sci, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Biol Sci, Tehran 193955746, Iran
关键词
Cancer; Drug response; Autoencoder; Hidden Markov model; Matrix factorization; Personalized treatment;
D O I
10.1186/s12859-021-03974-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Predicting the response of cancer cell lines to specific drugs is an essential problem in personalized medicine. Since drug response is closely associated with genomic information in cancer cells, some large panels of several hundred human cancer cell lines are organized with genomic and pharmacogenomic data. Although several methods have been developed to predict the drug response, there are many challenges in achieving accurate predictions. This study proposes a novel feature selection-based method, named Auto-HMM-LMF, to predict cell line-drug associations accurately. Because of the vast dimensions of the feature space for predicting the drug response, Auto-HMM-LMF focuses on the feature selection issue for exploiting a subset of inputs with a significant contribution. Results: This research introduces a novel method for feature selection of mutation data based on signature assignments and hidden Markov models. Also, we use the autoencoder models for feature selection of gene expression and copy number variation data. After selecting features, the logistic matrix factorization model is applied to predict drug response values. Besides, by comparing to one of the most powerful feature selection methods, the ensemble feature selection method (EFS), we showed that the performance of the predictive model based on selected features introduced in this paper is much better for drug response prediction. Two datasets, the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE) are used to indicate the efficiency of the proposed method across unseen patient cell-line. Evaluation of the proposed model showed that Auto-HMM-LMF could improve the accuracy of the results of the state-of-the-art algorithms, and it can find useful features for the logistic matrix factorization method. Conclusions: We depicted an application of Auto-HMM-LMF in exploring the new candidate drugs for head and neck cancer that showed the proposed method is useful in drug repositioning and personalized medicine. The source code of Auto-HMM-LMF method is available in .
引用
收藏
页数:22
相关论文
共 36 条
  • [1] Signatures of mutational processes in human cancer
    Alexandrov, Ludmil B.
    Nik-Zainal, Serena
    Wedge, David C.
    Aparicio, Samuel A. J. R.
    Behjati, Sam
    Biankin, Andrew V.
    Bignell, Graham R.
    Bolli, Niccolo
    Borg, Ake
    Borresen-Dale, Anne-Lise
    Boyault, Sandrine
    Burkhardt, Birgit
    Butler, Adam P.
    Caldas, Carlos
    Davies, Helen R.
    Desmedt, Christine
    Eils, Roland
    Eyfjord, Jorunn Erla
    Foekens, John A.
    Greaves, Mel
    Hosoda, Fumie
    Hutter, Barbara
    Ilicic, Tomislav
    Imbeaud, Sandrine
    Imielinsk, Marcin
    Jaeger, Natalie
    Jones, David T. W.
    Jones, David
    Knappskog, Stian
    Kool, Marcel
    Lakhani, Sunil R.
    Lopez-Otin, Carlos
    Martin, Sancha
    Munshi, Nikhil C.
    Nakamura, Hiromi
    Northcott, Paul A.
    Pajic, Marina
    Papaemmanuil, Elli
    Paradiso, Angelo
    Pearson, John V.
    Puente, Xose S.
    Raine, Keiran
    Ramakrishna, Manasa
    Richardson, Andrea L.
    Richter, Julia
    Rosenstiel, Philip
    Schlesner, Matthias
    Schumacher, Ton N.
    Span, Paul N.
    Teague, Jon W.
    [J]. NATURE, 2013, 500 (7463) : 415 - +
  • [2] The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity
    Barretina, Jordi
    Caponigro, Giordano
    Stransky, Nicolas
    Venkatesan, Kavitha
    Margolin, Adam A.
    Kim, Sungjoon
    Wilson, Christopher J.
    Lehar, Joseph
    Kryukov, Gregory V.
    Sonkin, Dmitriy
    Reddy, Anupama
    Liu, Manway
    Murray, Lauren
    Berger, Michael F.
    Monahan, John E.
    Morais, Paula
    Meltzer, Jodi
    Korejwa, Adam
    Jane-Valbuena, Judit
    Mapa, Felipa A.
    Thibault, Joseph
    Bric-Furlong, Eva
    Raman, Pichai
    Shipway, Aaron
    Engels, Ingo H.
    Cheng, Jill
    Yu, Guoying K.
    Yu, Jianjun
    Aspesi, Peter, Jr.
    de Silva, Melanie
    Jagtap, Kalpana
    Jones, Michael D.
    Wang, Li
    Hatton, Charles
    Palescandolo, Emanuele
    Gupta, Supriya
    Mahan, Scott
    Sougnez, Carrie
    Onofrio, Robert C.
    Liefeld, Ted
    MacConaill, Laura
    Winckler, Wendy
    Reich, Michael
    Li, Nanxin
    Mesirov, Jill P.
    Gabriel, Stacey B.
    Getz, Gad
    Ardlie, Kristin
    Chan, Vivien
    Myer, Vic E.
    [J]. NATURE, 2012, 483 (7391) : 603 - 607
  • [3] Predicting Radiation Resistance in Breast Cancer with Expression Status of Phosphorylated S6K1
    Choi, Jihye
    Yoon, Yi Na
    Kim, Nawon
    Park, Chan Sub
    Seol, Hyesil
    Park, In-Chul
    Kim, Hyun-Ah
    Noh, Woo Chul
    Kim, Jae-Sung
    Seong, Min-Ki
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [4] Screening for anti-inflammatory components from Corydalis bungeana Turcz. based on macrophage binding combined with HPLC
    Dong, Zi-Bo
    Zhang, Yong-Hong
    Zhao, Bing-Jie
    Li, Chao
    Tian, Gang
    Niu, Ben
    Qi, Hong
    Feng, Liang
    Shao, Jian-Guo
    [J]. BMC COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2015, 15
  • [5] Durbin R, 1998, BIOL SEQ ANAL, V14, P164
  • [6] DSPLMF: A Method for Cancer Drug Sensitivity Prediction Using a Novel Regularization Approach in Logistic Matrix Factorization
    Emdadi, Akram
    Eslahchi, Changiz
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [7] A novel algorithm for parameter estimation of Hidden Markov Model inspired by Ant Colony Optimization
    Emdadi, Akram
    Moughari, Fatemeh Ahmadi
    Meybodi, Fatemeh Yassaee
    Eslahchi, Changiz
    [J]. HELIYON, 2019, 5 (03)
  • [8] COSMIC: somatic cancer genetics at high-resolution
    Forbes, Simon A.
    Beare, David
    Boutselakis, Harry
    Bamford, Sally
    Bindal, Nidhi
    Tate, John
    Cole, Charlotte G.
    Ward, Sari
    Dawson, Elisabeth
    Ponting, Laura
    Stefancsik, Raymund
    Harsha, Bhavana
    Kok, Chai Yin
    Jia, Mingming
    Jubb, Harry
    Sondka, Zbyslaw
    Thompson, Sam
    De, Tisham
    Campbell, Peter J.
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D777 - D783
  • [9] Fuerst ML., 2020, LWW, V42, P34
  • [10] Next-generation characterization of the Cancer Cell Line Encyclopedia
    Ghandi, Mahmoud
    Huang, Franklin W.
    Jane-Valbuena, Judit
    Kryukov, Gregory V.
    Lo, Christopher C.
    McDonald, E. Robert, III
    Barretina, Jordi
    Gelfand, Ellen T.
    Bielski, Craig M.
    Li, Haoxin
    Hu, Kevin
    Andreev-Drakhlin, Alexander Y.
    Kim, Jaegil
    Hess, Julian M.
    Haas, Brian J.
    Aguet, Francois
    Weir, Barbara A.
    Rothberg, Michael V.
    Paolella, Brenton R.
    Lawrence, Michael S.
    Akbani, Rehan
    Lu, Yiling
    Tiv, Hong L.
    Gokhale, Prafulla C.
    De Weck, Antoine
    Mansour, Ali Amin
    Oh, Coyin
    Shih, Juliann
    Hadi, Kevin
    Rosen, Yanay
    Bistline, Jonathan
    Venkatesan, Kavitha
    Reddy, Anupama
    Sonkin, Dmitriy
    Liu, Manway
    Lehar, Joseph
    Korn, Joshua M.
    Porter, Dale A.
    Jones, Michael D.
    Golji, Javad
    Caponigro, Giordano
    Taylor, Jordan E.
    Dunning, Caitlin M.
    Creech, Amanda L.
    Warren, Allison C.
    McFarland, James M.
    Zamanighomi, Mahdi
    Kauffmann, Audrey
    Stransky, Nicolas
    Imielinski, Marcin
    [J]. NATURE, 2019, 569 (7757) : 503 - +