A multivariate approach to the integration of multi-omics datasets

被引:199
|
作者
Meng, Chen [1 ]
Kuster, Bernhard [1 ,2 ]
Culhane, Aedin C. [3 ,4 ]
Gholami, Amin Moghaddas [1 ]
机构
[1] Tech Univ Munich, Chair Prote & Bioanalyt, Freising Weihenstephan, Germany
[2] Ctr Integrated Prot Sci Munich, Freising Weihenstephan, Germany
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
[4] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02215 USA
来源
BMC BIOINFORMATICS | 2014年 / 15卷
基金
美国国家卫生研究院;
关键词
Multivariate analysis; Multiple co-inertia; Data integration; Omic data; Visualization; PRINCIPAL COMPONENT ANALYSIS; CO-INERTIA ANALYSIS; EXPRESSION PROFILES; MESSENGER-RNA; CELL; PANEL; TOOL; TRANSCRIPT; PROTEOME; DATABASE;
D O I
10.1186/1471-2105-15-162
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: To leverage the potential of multi-omics studies, exploratory data analysis methods that provide systematic integration and comparison of multiple layers of omics information are required. We describe multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets. Based on a covariance optimization criterion, MCIA simultaneously projects several datasets into the same dimensional space, transforming diverse sets of features onto the same scale, to extract the most variant from each dataset and facilitate biological interpretation and pathway analysis. Results: We demonstrate integration of multiple layers of information using MCIA, applied to two typical "omics" research scenarios. The integration of transcriptome and proteome profiles of cells in the NCI-60 cancer cell line panel revealed distinct, complementary features, which together increased the coverage and power of pathway analysis. Our analysis highlighted the importance of the leukemia extravasation signaling pathway in leukemia that was not highly ranked in the analysis of any individual dataset. Secondly, we compared transcriptome profiles of high grade serous ovarian tumors that were obtained, on two different microarray platforms and next generation RNA-sequencing, to identify the most informative platform and extract robust biomarkers of molecular subtypes. We discovered that the variance of RNA-sequencing data processed using RPKM had greater variance than that with MapSplice and RSEM. We provided novel markers highly associated to tumor molecular subtype combined from four data platforms. MCIA is implemented and available in the R/Bioconductor "omicade4" package. Conclusion: We believe MCIA is an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals. The method is not dependent on feature annotation, and thus it can extract important features even when there are not present across all datasets. MCIA provides simple graphical representations for the identification of relationships between large datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] OmicsARules: a R package for integration of multi-omics datasets via association rules mining
    Danze Chen
    Fan Zhang
    Qianqian Zhao
    Jianzhen Xu
    BMC Bioinformatics, 20
  • [12] Statistical Challenges in Multi-omics Integration
    Sebastiani, Paola
    Leshchk, Anastasia
    Song, Zeyuan
    Karagiannis, Tanya T.
    Gurinovich, Anastasia
    Bae, Harold
    Li, Mengze
    Monti, Stefano
    GENETIC EPIDEMIOLOGY, 2022, 46 (07) : 530 - 530
  • [13] Spatial multi-omics: deciphering technological landscape of integration of multi-omics and its applications
    Liu, Xiaojie
    Peng, Ting
    Xu, Miaochun
    Lin, Shitong
    Hu, Bai
    Chu, Tian
    Liu, Binghan
    Xu, Yashi
    Ding, Wencheng
    Li, Li
    Cao, Canhui
    Wu, Peng
    JOURNAL OF HEMATOLOGY & ONCOLOGY, 2024, 17 (01)
  • [14] Myocarditis: A multi-omics approach
    Wang, Lulu
    Sun, Tao
    Liu, Xiaolan
    Wang, Yan
    Qiao, Xiaorong
    Chen, Nuo
    Liu, Fangqian
    Zhou, Xiaoxiang
    Wang, Hua
    Shen, Hongxing
    CLINICA CHIMICA ACTA, 2024, 554
  • [15] An integrative imputation method based on multi-omics datasets
    Dongdong Lin
    Jigang Zhang
    Jingyao Li
    Chao Xu
    Hong-Wen Deng
    Yu-Ping Wang
    BMC Bioinformatics, 17
  • [16] PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration
    Wieder, Cecilia
    Cooke, Juliette
    Frainay, Clement
    Poupin, Nathalie
    Bowler, Russell
    Jourdan, Fabien
    Kechris, Katerina J.
    Lai, Rachel P. J.
    Ebbels, Timothy
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (03)
  • [17] An integrative imputation method based on multi-omics datasets
    Lin, Dongdong
    Zhang, Jigang
    Li, Jingyao
    Xu, Chao
    Deng, Hong-Wen
    Wang, Yu-Ping
    BMC BIOINFORMATICS, 2016, 17
  • [18] Systematic Multi-Omics Integration (MOI) Approach in Plant Systems Biology
    Jamil, Ili Nadhirah
    Remali, Juwairiah
    Azizan, Kamalrul Azlan
    Nor Muhammad, Nor Azlan
    Arita, Masanori
    Goh, Hoe-Han
    Aizat, Wan Mohd
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [19] Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets
    Argelaguet, Ricard
    Velten, Britta
    Arnol, Damien
    Dietrich, Sascha
    Zenz, Thorsten
    Marioni, John C.
    Buettner, Florian
    Huber, Wolfgang
    Stegle, Oliver
    MOLECULAR SYSTEMS BIOLOGY, 2018, 14 (06)
  • [20] Towards multi-omics synthetic data integration
    Selvarajoo, Kumar
    Maurer-Stroh, Sebastian
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)