A multivariate approach to the integration of multi-omics datasets

被引:199
|
作者
Meng, Chen [1 ]
Kuster, Bernhard [1 ,2 ]
Culhane, Aedin C. [3 ,4 ]
Gholami, Amin Moghaddas [1 ]
机构
[1] Tech Univ Munich, Chair Prote & Bioanalyt, Freising Weihenstephan, Germany
[2] Ctr Integrated Prot Sci Munich, Freising Weihenstephan, Germany
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
[4] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02215 USA
来源
BMC BIOINFORMATICS | 2014年 / 15卷
基金
美国国家卫生研究院;
关键词
Multivariate analysis; Multiple co-inertia; Data integration; Omic data; Visualization; PRINCIPAL COMPONENT ANALYSIS; CO-INERTIA ANALYSIS; EXPRESSION PROFILES; MESSENGER-RNA; CELL; PANEL; TOOL; TRANSCRIPT; PROTEOME; DATABASE;
D O I
10.1186/1471-2105-15-162
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: To leverage the potential of multi-omics studies, exploratory data analysis methods that provide systematic integration and comparison of multiple layers of omics information are required. We describe multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets. Based on a covariance optimization criterion, MCIA simultaneously projects several datasets into the same dimensional space, transforming diverse sets of features onto the same scale, to extract the most variant from each dataset and facilitate biological interpretation and pathway analysis. Results: We demonstrate integration of multiple layers of information using MCIA, applied to two typical "omics" research scenarios. The integration of transcriptome and proteome profiles of cells in the NCI-60 cancer cell line panel revealed distinct, complementary features, which together increased the coverage and power of pathway analysis. Our analysis highlighted the importance of the leukemia extravasation signaling pathway in leukemia that was not highly ranked in the analysis of any individual dataset. Secondly, we compared transcriptome profiles of high grade serous ovarian tumors that were obtained, on two different microarray platforms and next generation RNA-sequencing, to identify the most informative platform and extract robust biomarkers of molecular subtypes. We discovered that the variance of RNA-sequencing data processed using RPKM had greater variance than that with MapSplice and RSEM. We provided novel markers highly associated to tumor molecular subtype combined from four data platforms. MCIA is implemented and available in the R/Bioconductor "omicade4" package. Conclusion: We believe MCIA is an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals. The method is not dependent on feature annotation, and thus it can extract important features even when there are not present across all datasets. MCIA provides simple graphical representations for the identification of relationships between large datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Multi-Omics integration for molecular diagnostics of mendelian disorders
    Kopajtich, R.
    Smirnov, D.
    Loipfinger, S.
    Meng, C.
    Ghezzi, D.
    Murayama, K.
    Mayr, J. A.
    Freisinger, P.
    Metodiev, D.
    Rotig, A.
    Klopstock, T.
    Hoffmann, G. F.
    Santer, R.
    Distelmaier, F.
    Olsen, R.
    Piekutowska-Abramczuk, D.
    Verloo, P.
    Lamperti, C.
    Gagneur, J.
    Ludwig, C.
    Prokisch, H.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 119 - 120
  • [42] Review on Integration Analysis and Application of Multi-omics Data
    Zhong, Yating
    Lin, Yanmei
    Chen, Dingjia
    Peng, Yuzhong
    Zeng, Yuanpeng
    Computer Engineering and Applications, 2024, 57 (23) : 1 - 17
  • [43] A Commentary on Multi-omics Data Integration in Systems Vaccinology
    Shannon, Casey P.
    Lee, Amy H. Y.
    Tebbutt, Scott J.
    Singh, Amrit
    JOURNAL OF MOLECULAR BIOLOGY, 2024, 436 (08)
  • [44] Vertical and horizontal integration of multi-omics data with miodin
    Benjamin Ulfenborg
    BMC Bioinformatics, 20
  • [45] Multi-omics integration for neuroblastoma clinical endpoint prediction
    Margherita Francescatto
    Marco Chierici
    Setareh Rezvan Dezfooli
    Alessandro Zandonà
    Giuseppe Jurman
    Cesare Furlanello
    Biology Direct, 13
  • [46] Host and microbiome multi-omics integration: applications and methodologies
    Wang Q.
    Wang K.
    Wu W.
    Giannoulatou E.
    Ho J.W.K.
    Li L.
    Biophysical Reviews, 2019, 11 (1) : 55 - 65
  • [47] Machine learning for multi-omics data integration in cancer
    Cai, Zhaoxiang
    Poulos, Rebecca C.
    Liu, Jia
    Zhong, Qing
    ISCIENCE, 2022, 25 (02)
  • [48] MULTI-OMICS DATA INTEGRATION IN THE CONTEXT OF PRIMARY GLOMERULONEPHRITIS
    Fernandes, Marco
    Delles, Christian
    Husi, Holger
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2016, 31 : 165 - 165
  • [49] Deep multi-omics integration approach reveals new molecular features of uterine leiomyosarcoma
    Falcao, Raul Maia
    de Souza, Jorge Estefano Santana
    Gonzalez-Molina, Jordi
    Mathieson, William
    Carlson, Joseph W.
    Petta, Tirzah Braz
    BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 2025, 1871 (03):
  • [50] A multi-omics integration approach does not discern endotypes in childhood type 1 diabetes
    Petrelli, A.
    Codazzi, V.
    Baldoni, N.
    Scotti, G. M.
    Giovenzana, A.
    Rigamonti, A.
    Frontino, G.
    Genzano, C. Bechi
    Marzinotto, I.
    Lampasona, V.
    Fiorina, P.
    Giustina, A.
    Battaglia, M.
    Morelli, M.
    Bonfanti, R.
    DIABETOLOGIA, 2024, 67 : S157 - S157