MuCoMiD: A <bold>Mu</bold>ltitask Graph <bold>Co</bold>nvolutional Learning Framework for <bold>mi</bold>RNA-<bold>D</bold>isease Association Prediction

被引:5
|
作者
Dong, Ngan [1 ]
Muecke, Stefanie [2 ]
Khosla, Megha [3 ]
机构
[1] Leibniz Univ Hann, L3S Res Ctr, D-30167 Hannover, Germany
[2] TRAIN Omics, Translat Alliance Lower Saxony, D-37081 Hannover, Germany
[3] Delft Univ Technol TU Delft, NL-2628 CD Delft, Netherlands
关键词
Data integration; disease; graph representation learning; MiRNA; multitask; MICRORNAS; TUMORIGENESIS; SIMILARITY;
D O I
10.1109/TCBB.2022.3176456
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Growing evidence from recent studies implies that microRNAs or miRNAs could serve as biomarkers in various complex human diseases. Since wet-lab experiments for detecting miRNAs associated with a disease are expensive and time-consuming, machine learning techniques for miRNA-disease association prediction have attracted much attention in recent years. A big challenge in building reliable machine learning models is that of data scarcity. In particular, existing approaches trained on the available small datasets, even when combined with precalculated handcrafted input features, often suffer from bad generalization and data leakage problems. We overcome the limitations of existing works by proposing a novel multitask graph convolution-based approach, which we refer to as MuCoMiD. MuCoMiD allows automatic feature extraction while incorporating knowledge from five heterogeneous biological information sources (associations between miRNAs/diseases and protein-coding genes (PCGs), interactions between protein-coding genes, miRNA family information, and disease ontology) in a multitask setting which is a novel perspective and has not been studied before. To effectively test the generalization capability of our model, we conduct large-scale experiments on the standard benchmark datasets as well as on our proposed large independent testing sets and case studies. MuCoMiD obtains significantly higher Average Precision (AP) scores than all benchmarked models on three large independent testing sets, especially those with many new miRNAs, as well as in the detection of false positives. Thanks to its capability of learning directly from raw input information, MuCoMiD is easier to maintain and update than handcrafted feature-based methods, which would require recomputation of features every time there is a change in the original information sources (e.g., disease ontology, miRNA/disease-PCG associations, etc.). We share our code for reproducibility and future research at https://git.l3s.uni-hannover.de/dong/cmtt.
引用
收藏
页码:3081 / 3092
页数:12
相关论文
共 8 条
  • [1] <bold>Association of nutritional status indices with gastrointestinal symptoms in systemic sclerosis: a cross-sectional study</bold>
    Oz, Nuran
    Gezer, Halise Hande
    Karabulut, Yusuf
    Duruoz, Mehmet Tuncay
    RHEUMATOLOGY INTERNATIONAL, 2025, 45 (01)
  • [2] The association between carbohydrate quality index and headache severity, disability and duration among women with migraine<bold>:</bold> a cross-sectional study
    Jebraeili, Haniyeh
    Mirzababaei, Atieh
    Abaj, Faezeh
    Mirzaei, Khadijeh
    NUTRITIONAL NEUROSCIENCE, 2024, 27 (10) : 1162 - 1173
  • [3] EmDL: &lt;bold&gt;&lt;underline&gt;E&lt;/underline&gt;&lt;/bold&gt;xtracting &lt;bold&gt;&lt;underline&gt;m&lt;/underline&gt;&lt;/bold&gt;iRNA-&lt;bold&gt;&lt;underline&gt;D&lt;/underline&gt;&lt;/bold&gt;rug Interactions from &lt;bold&gt;&lt;underline&gt;L&lt;/underline&gt;&lt;/bold&gt;iterature
    Xie, Wen-Bin
    Yan, Hong
    Zhao, Xing-Ming
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1722 - 1728
  • [4] Advantages of the co2 laser use in the rare condition of nasal mucosa squamous cell carcinoma surgery in dogs<bold>-</bold>a clinical prospective study
    Carreira, L. Miguel
    Azevedo, P.
    LASERS IN MEDICAL SCIENCE, 2024, 39 (01)
  • [5] Normal BOLD Response to a Step CO2 Stimulus After Correction for Partial Volume Averaging
    Poublanc, Julien
    Shafi, Reema
    Sobczyk, Olivia
    Sam, Kevin
    Mandell, Daniel M.
    Venkatraghavan, Lakshmikumar
    Duffin, James
    Fisher, Joseph A.
    Mikulis, David J.
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [6] Association of pulse wave velocity with total lung capacity: A cross-sectional analysis of the BOLD London study
    Amaral, Andre F. S.
    Patel, Jaymini
    Gnatiuc, Louisa
    Jones, Meinir
    Burney, Peter G. J.
    RESPIRATORY MEDICINE, 2015, 109 (12) : 1569 - 1575
  • [7] Robust BOLD Responses to Faces But Not to Conditioned Threat: Challenging the Amygdala's Reputation in Human Fear and Extinction Learning
    Visser, Renee M.
    Bathelt, Joe
    Scholte, H. Steven
    Kindt, Merel
    JOURNAL OF NEUROSCIENCE, 2021, 41 (50) : 10278 - 10292
  • [8] A conceptual model for CO2-induced redistribution of cerebral blood flow with experimental confirmation using BOLD MRI
    Sobczyk, O.
    Battisti-Charbonney, A.
    Fierstra, J.
    Mandell, D. M.
    Poublanc, J.
    Crawley, A. P.
    Mikulis, D. J.
    Duffin, J.
    Fisher, J. A.
    NEUROIMAGE, 2014, 92 : 56 - 68