PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm

被引:2
|
作者
Zhang, Yan [1 ,2 ,3 ]
Xiang, Ju [1 ,2 ,3 ,4 ,5 ,6 ]
Tang, Liang [3 ,5 ,6 ]
Yang, Jialiang [3 ,7 ,8 ]
Li, Jianming [3 ,5 ,6 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Changsha Med Univ, Sch Informat Sci & Engn, Changsha, Peoples R China
[3] Changsha Med Univ, Academician Workstat, Changsha, Peoples R China
[4] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Peoples R China
[5] Changsha Med Univ, Dept Basic Med Sci, Changsha, Peoples R China
[6] Changsha Med Univ, Neurosci Res Ctr, Changsha, Peoples R China
[7] Qingdao Geneis Inst Big Data Min & Precis Med, Qingdao, Peoples R China
[8] Geneis Beijing Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
disease-gene prediction; biological network; network embedding; network propagation; random projection; TRANSGENIC MOUSE MODEL; ALZHEIMERS-DISEASE; OXIDATIVE STRESS; ASSOCIATION; VARIANTS; POLYMORPHISMS; GRANULIN; DELETION; WALKING; RISK;
D O I
10.3389/fgene.2022.1087784
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuable information from the data to predict pathogenic genes accurately and rapidly is currently a challenging and meaningful task. Therefore, we present a novel computational method called PGAGP for inferring potential pathogenic genes based on an adaptive network embedding algorithm. The PGAGP algorithm is to first extract initial features of nodes from a heterogeneous network of diseases and genes efficiently and effectively by Gaussian random projection and then optimize the features of nodes by an adaptive refining process. These low-dimensional features are used to improve the disease-gene heterogenous network, and we apply network propagation to the improved heterogenous network to predict pathogenic genes more effectively. By a series of experiments, we study the effect of PGAGP's parameters and integrated strategies on predictive performance and confirm that PGAGP is better than the state-of-the-art algorithms. Case studies show that many of the predicted candidate genes for specific diseases have been implied to be related to these diseases by literature verification and enrichment analysis, which further verifies the effectiveness of PGAGP. Overall, this work provides a useful solution for mining disease-gene heterogeneous network to predict pathogenic genes more effectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] PrGeFNE: Predicting disease-related genes by fast network embedding
    Xiang, Ju
    Zhang, Ning-Rui
    Zhang, Jia-Shuai
    Lv, Xiao-Yi
    Li, Min
    METHODS, 2021, 192 : 3 - 12
  • [2] Pathogenic gene prediction based on network embedding
    Liu, Yang
    Guo, Yuchen
    Liu, Xiaoyan
    Wang, Chunyu
    Guo, Maozu
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [3] Disease-gene prediction based on preserving structure network embedding
    Ma, Jinlong
    Qin, Tian
    Xiang, Ju
    FRONTIERS IN AGING NEUROSCIENCE, 2023, 15
  • [4] Community Preserving Network Embedding Based on Memetic Algorithm
    Gong, Maoguo
    Chen, Cheng
    Xie, Yu
    Wang, Shanfeng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (02): : 108 - 118
  • [5] Domain Adaptive Network Embedding
    Song, Guojie
    Zhang, Yizhou
    Xu, Lingjun
    Lu, Haibing
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (05) : 1220 - 1232
  • [6] Predicting the Disease Genes of Multiple Sclerosis Based on Network Representation Learning
    Liu, Haijie
    Guan, Jiaojiao
    Li, He
    Bao, Zhijie
    Wang, Qingmei
    Luo, Xun
    Xue, Hansheng
    FRONTIERS IN GENETICS, 2020, 11
  • [7] Function-Genes and Disease-Genes Prediction Based on Network Embedding and One-Class Classification
    Shi, Weiyu
    Zhang, Yan
    Sun, Yeqing
    Lin, Zhengkui
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (04) : 781 - 801
  • [8] MLANE: Meta-Learning Based Adaptive Network Embedding
    Cui, Chen
    Yang, Ning
    Yu, Philip S.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 904 - 909
  • [9] Predicting disease-genes based on network information loss and protein complexes in heterogeneous network
    Lei, Xiujuan
    Zhang, Yuchen
    INFORMATION SCIENCES, 2019, 479 : 386 - 400
  • [10] Dual Supervised Network Embedding Based Community Detection Algorithm
    Zheng W.
    Wang Y.
    Yang G.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 283 - 290