MORE: a multi-omics data-driven hypergraph integration network for biomedical data classification and biomarker identification

被引:1
|
作者
Wang, Yuhan [1 ]
Wang, Zhikang [2 ,3 ]
Yu, Xuan [4 ]
Wang, Xiaoyu [2 ,3 ]
Song, Jiangning [2 ,3 ,5 ]
Yu, Dong-Jun [1 ]
Ge, Fang [6 ,7 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, 200 Xiaolingwei, Nanjing 210094, Peoples R China
[2] Monash Univ, Biomed Discovery Inst, Wellington Rd, Melbourne, Vic 3800, Australia
[3] Monash Univ, Dept Biochem & Mol Biol, Wellington Rd, Melbourne, Vic 3800, Australia
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
[5] Monash Univ, Data Futures Inst, Wellington Rd, Clayton, Vic 3800, Australia
[6] Nanjing Univ Posts & Telecommun, State Key Lab Organ Elect & Informat Displays, 9 Wenyuan, Nanjing 210023, Peoples R China
[7] Nanjing Univ Posts &Telecommun, Inst Adv Mat IAM, 9 Wenyuan, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
comprehensive hyperedge group; multi-omics hypergraph encoding module; multi-omics self-attention mechanism; identify disease-related biomarkers; BREAST-CANCER; CELLS;
D O I
10.1093/bib/bbae658
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing methods have brought about a huge change in omics-based biomedical study. Integrating various omics data is possibly useful for identifying some correlations across data modalities, thus improving our understanding of the underlying biological mechanisms and complexity. Nevertheless, most existing graph-based feature extraction methods overlook the complementary information and correlations across modalities. Moreover, these methods tend to treat the features of each omics modality equally, which contradicts current biological principles. To solve these challenges, we introduce a novel approach for integrating multi-omics data termed Multi-Omics hypeRgraph integration nEtwork (MORE). MORE initially constructs a comprehensive hyperedge group by extensively investigating the informative correlations within and across modalities. Subsequently, the multi-omics hypergraph encoding module is employed to learn the enriched omics-specific information. Afterward, the multi-omics self-attention mechanism is then utilized to adaptatively aggregate valuable correlations across modalities for representation learning and making the final prediction. We assess MORE's performance on datasets characterized by message RNA (mRNA) expression, Deoxyribonucleic Acid (DNA) methylation, and microRNA (miRNA) expression for Alzheimer's disease, invasive breast carcinoma, and glioblastoma. The results from three classification tasks highlight the competitive advantage of MORE in contrast with current state-of-the-art (SOTA) methods. Moreover, the results also show that MORE has the capability to identify a greater variety of disease-related biomarkers compared to existing methods, highlighting its advantages in biomedical data mining and interpretation. Overall, MORE can be investigated as a valuable tool for facilitating multi-omics analysis and novel biomarker discovery. Our code and data can be publicly accessed at https://github.com/Wangyuhanxx/MORE.
引用
收藏
页数:11
相关论文
共 39 条
  • [21] Multi-omics data integration and modeling unravels new mechanisms for pancreatic cancer and improves prognostic prediction
    Fraunhoffer, Nicolas A.
    Abuelafia, Analia Meilerman
    Bigonnet, Martin
    Gayet, Odile
    Roques, Julie
    Nicolle, Remy
    Lomberk, Gwen
    Urrutia, Raul
    Dusetti, Nelson
    Iovanna, Juan
    NPJ PRECISION ONCOLOGY, 2022, 6 (01)
  • [22] Breast Cancer Recurrence Risk Predictor Using a Deep Learning Multi-omics Data Integration Framework
    Rahman, Ariana
    Zhang, Yining
    Park, Jin G.
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 921 - 922
  • [23] Network integration of multi-tumour omics data suggests novel targeting strategies
    do Valle, Italo Faria
    Menichetti, Giulia
    Simonetti, Giorgia
    Bruno, Samantha
    Zironi, Isabella
    Durso, Danielle Fernandes
    Mombach, Jose C. M.
    Martinelli, Giovanni
    Castellani, Gastone
    Remondini, Daniel
    NATURE COMMUNICATIONS, 2018, 9
  • [24] Capsule Network Based Modeling of Multi-omics Data for Discovery of Breast Cancer-Related Genes
    Peng, Chen
    Zheng, Yang
    Huang, De-shuang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) : 1605 - 1612
  • [25] A Dynamic Scale-Free Network Particle Swarm Optimization for Extracting Features on Multi-Omics Data
    Li, Huiyu
    Li, Sheng-Jun
    Shang, Junliang
    Liu, Jin-Xing
    Zheng, Chun-Hou
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (08) : 769 - 781
  • [26] The Promise of Multi-Omics and Clinical Data Integration to Identify and Target Personalized Healthcare Approaches in Autism Spectrum Disorders
    Higdon, Roger
    Earl, Rachel K.
    Stanberry, Larissa
    Hudac, Caitlin M.
    Montague, Elizabeth
    Stewart, Elizabeth
    Janko, Imre
    Choiniere, John
    Broomall, William
    Kolker, Natali
    Bernier, Raphael A.
    Kolker, Eugene
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2015, 19 (04) : 197 - 208
  • [27] Attention-based GCN integrates multi-omics data for breast cancer subtype classification and patient-specific gene marker identification
    Guo, Hui
    Lv, Xiang
    Li, Yizhou
    Li, Menglong
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2023, 22 (05) : 463 - 474
  • [28] Identifying individualized risk subpathways reveals pan-cancer molecular classification based on multi-omics data
    Xu, Yanjun
    Wang, Jingwen
    Li, Feng
    Zhang, Chunlong
    Zheng, Xuan
    Cao, Yang
    Shang, Desi
    Hu, Congxue
    Xu, Yingqi
    Mi, Wanqi
    Li, Xia
    Cao, Yan
    Zhang, Yunpeng
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 838 - 849
  • [29] Identification of tumor mutation burden-associated molecular and clinical features in cancer by analyzing multi-omics data
    Li, Mengyuan
    Gao, Xuejiao
    Wang, Xiaosheng
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [30] A Translational Pipeline for Overall Survival Prediction of Breast Cancer Patients by Decision-Level Integration of Multi-Omics Data
    Mitchel, Jonathan
    Chatlin, Kevin
    Tong, Li
    Wang, May D.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1573 - 1580