PLASMA: Partial LeAst Squares for Multiomics Analysis

被引:0
作者
Yamaguchi, Kyoko [1 ]
Abdelbaky, Salma [1 ]
Yu, Lianbo [2 ]
Oakes, Christopher C. [1 ]
Abruzzo, Lynne V. [3 ]
Coombes, Kevin R. [4 ]
机构
[1] Ohio State Univ, Dept Internal Med, Div Hematol, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[3] Med Univ South Carolina, Dept Pathol & Lab Med, Charleston, SC 29425 USA
[4] Augusta Univ, Dept Biostat Data Sci & Epidemiol, Georgia Canc Ctr, Sch Publ Hlth, Augusta, GA 30912 USA
关键词
multiomics; supervised learning; overall survival; gastric cancer; esophageal cancer; GASTRIC-CANCER; UP-REGULATION; EXPRESSION; PROMOTES; INVASION; PROTEIN; PROLIFERATION; METASTASIS; STRATEGIES; REGRESSION;
D O I
10.3390/cancers17020287
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background/Objectives: Recent growth in the number and applications of high-throughput "omics" technologies has created a need for better methods to integrate multiomics data. Much progress has been made in developing unsupervised methods, but supervised methods have lagged behind. Methods: Here we present the first algorithm, PLASMA, that can learn to predict time-to-event outcomes from multiomics data sets, even when some samples have only been assayed on a subset of the omics data sets. PLASMA uses two layers of existing partial least squares algorithms to first select components that covary with the outcome and then construct a joint Cox proportional hazards model. Results: We apply PLASMA to the stomach adenocarcinoma (STAD) data from The Cancer Genome Atlas. We validate the model both by splitting the STAD data into training and test sets and by applying them to the subset of esophageal cancer (ESCA) containing adenocarcinomas. We use the other half of the ESCA data, which contains squamous cell carcinomas dissimilar to STAD, as a negative comparison. Our model successfully separates both the STAD test set (p = 2.73 x 10-8) and the independent ESCA adenocarcinoma data (p = 0.025) into high-risk and low-risk patients. It does not separate the negative comparison data set (ESCA squamous cell carcinomas, p = 0.57). The performance of the unified multiomics model is superior to that of individually trained models and is also superior to an unsupervised method (Multi-Omics Factor Analysis; MOFA), which finds latent factors to be used as putative predictors in a post hoc survival analysis. Conclusions: Many of the factors that contribute strongly to the PLASMA model can be justified from the biological literature.
引用
收藏
页数:17
相关论文
共 63 条
[1]   Are DNA Repair Factors Promising Biomarkers for Personalized Therapy in Gastric Cancer? [J].
Abdel-Fatah, Tarek ;
Arora, Arvind ;
Gorguc, Ipek ;
Abbotts, Rachel ;
Beebeejaun, Sarah ;
Storr, Sarah ;
Mohan, Vivek ;
Hawkes, Claire ;
Soomro, Irshad ;
Lobo, Dileep N. ;
Parsons, Simon L. ;
Madhusudan, Srinivasan .
ANTIOXIDANTS & REDOX SIGNALING, 2013, 18 (18) :2392-2398
[2]   Thirty biologically interpretable clusters of transcription factors distinguish cancer type [J].
Abrams, Zachary B. ;
Zucker, Mark ;
Wang, Min ;
Taheri, Amir Asiaee ;
Abruzzo, Lynne V. ;
Coombes, Kevin R. .
BMC GENOMICS, 2018, 19
[3]   Computational strategies for single-cell multi-omics integration [J].
Adossa, Nigatu ;
Khan, Sofia ;
Rytkonen, Kalle T. ;
Elo, Laura L. .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 :2588-2596
[4]  
Ahadi Mahsa, 2020, Asian Pac J Cancer Prev, V21, P1607, DOI 10.31557/APJCP.2020.21.6.1607
[5]   MOFA plus : a statistical framework for comprehensive integration of multi-modal single-cell data [J].
Argelaguet, Ricard ;
Arnol, Damien ;
Bredikhin, Danila ;
Deloro, Yonatan ;
Velten, Britta ;
Marioni, John C. ;
Stegle, Oliver .
GENOME BIOLOGY, 2020, 21 (01)
[6]   Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets [J].
Argelaguet, Ricard ;
Velten, Britta ;
Arnol, Damien ;
Dietrich, Sascha ;
Zenz, Thorsten ;
Marioni, John C. ;
Buettner, Florian ;
Huber, Wolfgang ;
Stegle, Oliver .
MOLECULAR SYSTEMS BIOLOGY, 2018, 14 (06)
[7]   Epidermal Growth Factor Receptor Family and its Role in Gastric Cancer [J].
Arienti, Chiara ;
Pignatta, Sara ;
Tesei, Anna .
FRONTIERS IN ONCOLOGY, 2019, 9
[8]   Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data [J].
Bastien, Philippe ;
Bertrand, Frederic ;
Meyer, Nicolas ;
Maumy-Bertrand, Myriam .
BIOINFORMATICS, 2015, 31 (03) :397-404
[9]  
Bazas V. M., 2009, Experimental Oncology, V31, P182
[10]   Fitting and Cross-Validating Cox Models to Censored Big Data With Missing Values Using Extensions of Partial Least Squares Regression Models [J].
Bertrand, Frederic ;
Maumy-Bertrand, Myriam .
FRONTIERS IN BIG DATA, 2021, 4