A guide to multi-omics data collection and integration for translational medicine

被引:61
作者
Athieniti, Efi [1 ]
Spyrou, George M. [1 ]
机构
[1] Cyprus Inst Neurol & Genet, Dept Bioinformat, 6 Iroon Ave, CY-2371 Nicosia, Cyprus
关键词
Multi-omics; Integration; Translational medicine; Challenges; MODULES; MODEL; JOINT;
D O I
10.1016/j.csbj.2022.11.050
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The emerging high-throughput technologies have led to the shift in the design of translational medicine projects towards collecting multi-omics patient samples and, consequently, their integrated analysis. However, the complexity of integrating these datasets has triggered new questions regarding the appro-priateness of the available computational methods. Currently, there is no clear consensus on the best com-bination of omics to include and the data integration methodologies required for their analysis. This article aims to guide the design of multi-omics studies in the field of translational medicine regarding the types of omics and the integration method to choose. We review articles that perform the integration of multiple omics measurements from patient samples. We identify five objectives in translational medicine applica-tions: (i) detect disease-associated molecular patterns, (ii) subtype identification, (iii) diagnosis/prognosis, (iv) drug response prediction, and (v) understand regulatory processes. We describe common trends in the selection of omic types combined for different objectives and diseases. To guide the choice of data integra-tion tools, we group them into the scientific objectives they aim to address. We describe the main compu-tational methods adopted to achieve these objectives and present examples of tools. We compare tools based on how they deal with the computational challenges of data integration and comment on how they perform against predefined objective-specific evaluation criteria. Finally, we discuss examples of tools for downstream analysis and further extraction of novel insights from multi-omics datasets.(c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creative-commons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:134 / 149
页数:16
相关论文
共 83 条
[31]   Integration of multi-omics datasets enables molecular classification of COPD [J].
Li, Chuan-Xing ;
Wheelock, Craig E. ;
Skold, C. Magnus ;
Wheelock, Asa M. .
EUROPEAN RESPIRATORY JOURNAL, 2018, 51 (05)
[32]   Multi-kernel linear mixed model with adaptive lasso for prediction analysis on high-dimensional multi-omics data [J].
Li, Jun ;
Lu, Qing ;
Wen, Yalu .
BIOINFORMATICS, 2020, 36 (06) :1785-1794
[33]   Identifying multi-layer gene regulatory modules from multi-dimensional genomic data [J].
Li, Wenyuan ;
Zhang, Shihua ;
Liu, Chun-Chi ;
Zhou, Xianghong Jasmine .
BIOINFORMATICS, 2012, 28 (19) :2458-2466
[34]   A review on machine learning principles for multi-view biological data integration [J].
Li, Yifeng ;
Wu, Fang-Xiang ;
Ngom, Alioune .
BRIEFINGS IN BIOINFORMATICS, 2018, 19 (02) :325-340
[35]   From expression footprints to causal pathways: contextualizing large signaling networks with CARNIVAL [J].
Liu, Anika ;
Trairatphisan, Panuwat ;
Gjerga, Enio ;
Didangelos, Athanasios ;
Barratt, Jonathan ;
Saez-Rodriguez, Julio .
NPJ SYSTEMS BIOLOGY AND APPLICATIONS, 2019, 5 (1)
[36]  
Liu E, 2018, BIOINFORMATICS, V1-3, P155, DOI [10.1016/B978-0-12-809633-8.20218-5, DOI 10.1016/B978-0-12-809633-8.20218-5]
[37]   DriverDBv3: a multi-omics database for cancer driver gene research [J].
Liu, Shu-Hsuan ;
Shen, Pei-Chun ;
Chen, Chen-Yang ;
Hsu, An-Ni ;
Cho, Yi-Chun ;
Lai, Yo-Liang ;
Chen, Fang-Hsin ;
Li, Chia-Yang ;
Wang, Shu-Chi ;
Chen, Ming ;
Chung, I-Fang ;
Cheng, Wei-Chung .
NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) :D863-D870
[38]   JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES [J].
Lock, Eric F. ;
Hoadley, Katherine A. ;
Marron, J. S. ;
Nobel, Andrew B. .
ANNALS OF APPLIED STATISTICS, 2013, 7 (01) :523-542
[39]   A comprehensive survey of the approaches for pathway analysis using multi-omics data integration [J].
Maghsoudi, Zeynab ;
Nguyen, Ha ;
Tavakkoli, Alireza ;
Nguyen, Tin .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
[40]   Unsupervised multiple kernel learning for heterogeneous data integration [J].
Mariette, Jerome ;
Villa-Vialaneix, Nathalie .
BIOINFORMATICS, 2018, 34 (06) :1009-1015