Strategic Multi-Omics Data Integration via Multi-Level Feature Contrasting and Matching

被引:2
|
作者
Zhang, Jinli [1 ]
Ren, Hongwei [1 ]
Jiang, Zongli [1 ]
Chen, Zheng [2 ]
Yang, Ziwei [3 ]
Matsubara, Yasuko [2 ]
Sakurai, Yasushi [2 ]
机构
[1] Beijing Univ Technol, Dept Comp Sci, Beijing 100022, Peoples R China
[2] Osaka Univ, Inst Sci & Ind Res, Suita, Osaka 5650871, Japan
[3] Kyoto Univ, Bioinformat Ctr, Kyoto 6158540, Japan
基金
日本科学技术振兴机构; 日本学术振兴会; 中国国家自然科学基金;
关键词
Multi-omics; clustering; contrastive learning; self-attention;
D O I
10.1109/TNB.2024.3456797
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The analysis and comprehension of multi-omics data has emerged as a prominent topic in the field of bioinformatics and data science. However, the sparsity characteristics and high dimensionality of omics data pose difficulties in terms of extracting meaningful information. Moreover, the heterogeneity inherent in multiple omics sources makes the effective integration of multi-omics data challenging To tackle these challenges, we propose MFCC-SAtt, a multi-level feature contrast clustering model based on self-attention to extract informative features from multi-omics data. MFCC-SAtt treats each omics type as a distinct modality and employs autoencoders with self-attention for each modality to integrate and compress their respective features into a shared feature space. By utilizing a multi-level feature extraction framework along with incorporating a semantic information extractor, we mitigate optimization conflicts arising from different learning objectives. Additionally, MFCC-SAtt guides deep clustering based on multi-level features which further enhances the quality of output labels. By conducting extensive experiments on multi-omics data, we have validated the exceptional performance of MFCC-SAtt. For instance, in a pan-cancer clustering task, MFCC-SAtt achieved an accuracy of over 80.38%.
引用
收藏
页码:579 / 590
页数:12
相关论文
共 50 条
  • [41] Progress in single-cell multimodal sequencing and multi-omics data integration
    Wang, Xuefei
    Wu, Xinchao
    Hong, Ni
    Jin, Wenfei
    BIOPHYSICAL REVIEWS, 2024, 16 (01) : 13 - 28
  • [42] A Concise Review on Multi-Omics Data Integration for Terroir Analysis in Vitis vinifera
    Fabres, Pastor Jullian
    Collins, Cassandra
    Cavagnaro, Timothy R.
    Lopez, Carlos M. Rodriguez
    FRONTIERS IN PLANT SCIENCE, 2017, 8
  • [43] Causal integration of multi-omics data with prior knowledge to generate mechanistic hypotheses
    Dugourd, Aurelien
    Kuppe, Christoph
    Sciacovelli, Marco
    Gjerga, Enio
    Gabor, Attila
    Emdal, Kristina B.
    Vieira, Vitor
    Bekker-Jensen, Dorte B.
    Kranz, Jennifer
    Bindels, Eric. M. J.
    Costa, Ana S. H.
    Sousa, Abel
    Beltrao, Pedro
    Rocha, Miguel
    Olsen, Jesper V.
    Frezza, Christian
    Kramann, Rafael
    Saez-Rodriguez, Julio
    MOLECULAR SYSTEMS BIOLOGY, 2021, 17 (01)
  • [44] Progress in single-cell multimodal sequencing and multi-omics data integration
    Xuefei Wang
    Xinchao Wu
    Ni Hong
    Wenfei Jin
    Biophysical Reviews, 2024, 16 : 13 - 28
  • [45] Multi-omics data integration and analysis pipeline for precision medicine: Systematic review
    Abdelaziz, Esraa Hamdi
    Ismail, Rasha
    Mabrouk, Mai S.
    Amin, Eman
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 113
  • [46] Integration of Multi-Omics Data Using Probabilistic Graph Models and External Knowledge
    Tripp, Bridget A.
    Otu, Hasan H.
    CURRENT BIOINFORMATICS, 2022, 17 (01) : 37 - 47
  • [47] Integration of Multi-Omics Data for the Classification of Glioma Types and Identification of Novel Biomarkers
    Vieira, Francisca G.
    Bispo, Regina
    Lopes, Marta B.
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2024, 18
  • [48] Integration of artificial intelligence and multi-omics in kidney diseases
    Zhou, Xu-Jie
    Zhong, Xu-Hui
    Duan, Li-Xin
    FUNDAMENTAL RESEARCH, 2023, 3 (01): : 126 - 148
  • [49] The Omics Dashboard for Interactive Exploration of Metabolomics and Multi-Omics Data
    Paley, Suzanne
    Karp, Peter D.
    METABOLITES, 2024, 14 (01)
  • [50] Integration of solutions and services for multi-omics data analysis towards personalized medicine
    Reska, Daniel
    Czajkowski, Marcin
    Jurczuk, Krzysztof
    Boldak, Cezary
    Kwedlo, Wojciech
    Bauer, Witold
    Koszelew, Jolanta
    Kretowski, Marek
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2021, 41 (04) : 1646 - 1663