A multivariate approach to the integration of multi-omics datasets

被引:199
|
作者
Meng, Chen [1 ]
Kuster, Bernhard [1 ,2 ]
Culhane, Aedin C. [3 ,4 ]
Gholami, Amin Moghaddas [1 ]
机构
[1] Tech Univ Munich, Chair Prote & Bioanalyt, Freising Weihenstephan, Germany
[2] Ctr Integrated Prot Sci Munich, Freising Weihenstephan, Germany
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
[4] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02215 USA
来源
BMC BIOINFORMATICS | 2014年 / 15卷
基金
美国国家卫生研究院;
关键词
Multivariate analysis; Multiple co-inertia; Data integration; Omic data; Visualization; PRINCIPAL COMPONENT ANALYSIS; CO-INERTIA ANALYSIS; EXPRESSION PROFILES; MESSENGER-RNA; CELL; PANEL; TOOL; TRANSCRIPT; PROTEOME; DATABASE;
D O I
10.1186/1471-2105-15-162
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: To leverage the potential of multi-omics studies, exploratory data analysis methods that provide systematic integration and comparison of multiple layers of omics information are required. We describe multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets. Based on a covariance optimization criterion, MCIA simultaneously projects several datasets into the same dimensional space, transforming diverse sets of features onto the same scale, to extract the most variant from each dataset and facilitate biological interpretation and pathway analysis. Results: We demonstrate integration of multiple layers of information using MCIA, applied to two typical "omics" research scenarios. The integration of transcriptome and proteome profiles of cells in the NCI-60 cancer cell line panel revealed distinct, complementary features, which together increased the coverage and power of pathway analysis. Our analysis highlighted the importance of the leukemia extravasation signaling pathway in leukemia that was not highly ranked in the analysis of any individual dataset. Secondly, we compared transcriptome profiles of high grade serous ovarian tumors that were obtained, on two different microarray platforms and next generation RNA-sequencing, to identify the most informative platform and extract robust biomarkers of molecular subtypes. We discovered that the variance of RNA-sequencing data processed using RPKM had greater variance than that with MapSplice and RSEM. We provided novel markers highly associated to tumor molecular subtype combined from four data platforms. MCIA is implemented and available in the R/Bioconductor "omicade4" package. Conclusion: We believe MCIA is an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals. The method is not dependent on feature annotation, and thus it can extract important features even when there are not present across all datasets. MCIA provides simple graphical representations for the identification of relationships between large datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Editorial: Integration of Multi-Omics Techniques in Cancer
    Andrieux, Geoffroy
    Chakraborty, Sajib
    FRONTIERS IN GENETICS, 2021, 12
  • [22] A cloud solution for multi-omics data integration
    Tordini, Fabio
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 559 - 566
  • [23] Lung tissue multi-omics integration in copd
    Olvera, Nuria
    Noell, Guillaume
    Casas-Recasens, Sandra
    Lopez-Giraldo, Alejandra
    Guirao, Angela
    Agusti, Alvar
    Faner Canet, Maria Rosa
    EUROPEAN RESPIRATORY JOURNAL, 2020, 56
  • [24] A multi-omics data integration approach to identify a predictive molecular signature of CLAD
    Pellet, Johann
    Lefaudeux, Diane
    Royer, Pierre-Joseph
    Koutsokera, Angela
    Bourgoin-Voillard, Sandrine
    Schmitt, Markus
    Trocme, Candice
    Claustre, Johanna
    Fritz, Andreas
    Seve, Michel
    Magnan, Antoine
    Pison, Christophe
    Nicod, Laurent
    Auffray, Charles
    EUROPEAN RESPIRATORY JOURNAL, 2015, 46
  • [25] On fusion methods for knowledge discovery from multi-omics datasets
    Baldwin, Edwin
    Han, Jiali
    Luo, Wenting
    Zhou, Jin
    An, Lingling
    Liu, Jian
    Zhang, Hao Helen
    Li, Haiquan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 (18): : 509 - 517
  • [26] Multi-omics integration for neuroblastoma clinical endpoint prediction
    Francescatto, Margherita
    Chierici, Marco
    Dezfooli, Setareh Rezvan
    Zandona, Alessandro
    Jurman, Giuseppe
    Furlanello, Cesare
    BIOLOGY DIRECT, 2018, 13
  • [27] Multi-omics data integration approaches for precision oncology
    Correa-Aguila, Raidel
    Alonso-Pupo, Niuxia
    Hernandez-Rodriguez, Erix W.
    MOLECULAR OMICS, 2022, 18 (06) : 469 - 479
  • [28] Integration of Multi-Omics Data to Identify Cancer Biomarkers
    Li, Peng
    Sun, Bo
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [29] Multi-omics Data Integration, Interpretation, and Its Application
    Subramanian, Indhupriya
    Verma, Srikant
    Kumar, Shiva
    Jere, Abhay
    Anamika, Krishanpal
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2020, 14
  • [30] Methods for the integration of multi-omics data: mathematical aspects
    Matteo Bersanelli
    Ettore Mosca
    Daniel Remondini
    Enrico Giampieri
    Claudia Sala
    Gastone Castellani
    Luciano Milanesi
    BMC Bioinformatics, 17