共 50 条
Deep latent space fusion for adaptive representation of heterogeneous multi-omics data
被引:28
|作者:
Zhang, Chengming
Chen, Yabin
Zeng, Tao
Zhang, Chuanchao
Chen, Luonan
机构:
[1] School of Mathematics and Statistics, Shandong University
[2] School of Life and Pharmaceutical Sciences, Dalian University of Technology
[3] Wuhan University, Wuhan
[4] The Huazhong University of Science and Technology, Wuhan
基金:
中国国家自然科学基金;
国家重点研发计划;
关键词:
deep learning;
latent space fusion;
adaptive representation;
omics data;
complex disease;
DATA INTEGRATION;
VARIABLE MODEL;
CANCER;
NETWORK;
BIOMARKERS;
CLASSIFICATION;
IDENTIFICATION;
DISEASES;
BREAST;
D O I:
10.1093/bib/bbab600
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
The integration of multi-omics data makes it possible to understand complex biological organisms at the system level. Numerous integration approaches have been developed by assuming a common underlying data space. Due to the noise and heterogeneity of biological data, the performance of these approaches is greatly affected. In this work, we propose a novel deep neural network architecture, named Deep Latent Space Fusion (DLSF), which integrates the multi-omics data by learning consistent manifold in the sample latent space for disease subtypes identification. DLSF is built upon a cycle autoencoder with a shared self-expressive layer, which can naturally and adaptively merge nonlinear features at each omics level into one unified sample manifold and produce adaptive representation of heterogeneous samples at the multi-omics level. We have assessed DLSF on various biological and biomedical datasets to validate its effectiveness. DLSF can efficiently and accurately capture the intrinsic manifold of the sample structures or sample clusters compared with other state-of-the-art methods, and DLSF yielded more significant outcomes for biological significance, survival prognosis and clinical relevance in application of cancer study in The Cancer Genome Atlas. Notably, as a deep case study, we determined a new molecular subtype of kidney renal clear cell carcinoma that may benefit immunotherapy in the viewpoint of multi-omics, and we further found potential subtype-specific biomarkers from multiple omics data, which were validated by independent datasets. In addition, we applied DLSF to identify potential therapeutic agents of different molecular subtypes of chronic lymphocytic leukemia, demonstrating the scalability of DLSF in diverse omics data types and application scenarios.
引用
收藏
页数:15
相关论文