PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm

被引:2
|
作者
Zhang, Yan [1 ,2 ,3 ]
Xiang, Ju [1 ,2 ,3 ,4 ,5 ,6 ]
Tang, Liang [3 ,5 ,6 ]
Yang, Jialiang [3 ,7 ,8 ]
Li, Jianming [3 ,5 ,6 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Changsha Med Univ, Sch Informat Sci & Engn, Changsha, Peoples R China
[3] Changsha Med Univ, Academician Workstat, Changsha, Peoples R China
[4] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Peoples R China
[5] Changsha Med Univ, Dept Basic Med Sci, Changsha, Peoples R China
[6] Changsha Med Univ, Neurosci Res Ctr, Changsha, Peoples R China
[7] Qingdao Geneis Inst Big Data Min & Precis Med, Qingdao, Peoples R China
[8] Geneis Beijing Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
disease-gene prediction; biological network; network embedding; network propagation; random projection; TRANSGENIC MOUSE MODEL; ALZHEIMERS-DISEASE; OXIDATIVE STRESS; ASSOCIATION; VARIANTS; POLYMORPHISMS; GRANULIN; DELETION; WALKING; RISK;
D O I
10.3389/fgene.2022.1087784
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuable information from the data to predict pathogenic genes accurately and rapidly is currently a challenging and meaningful task. Therefore, we present a novel computational method called PGAGP for inferring potential pathogenic genes based on an adaptive network embedding algorithm. The PGAGP algorithm is to first extract initial features of nodes from a heterogeneous network of diseases and genes efficiently and effectively by Gaussian random projection and then optimize the features of nodes by an adaptive refining process. These low-dimensional features are used to improve the disease-gene heterogenous network, and we apply network propagation to the improved heterogenous network to predict pathogenic genes more effectively. By a series of experiments, we study the effect of PGAGP's parameters and integrated strategies on predictive performance and confirm that PGAGP is better than the state-of-the-art algorithms. Case studies show that many of the predicted candidate genes for specific diseases have been implied to be related to these diseases by literature verification and enrichment analysis, which further verifies the effectiveness of PGAGP. Overall, this work provides a useful solution for mining disease-gene heterogeneous network to predict pathogenic genes more effectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] An Adaptive Semantic Mining Framework for Heterogeneous Information Network Embedding
    Shao, Hao
    Zhu, Rangang
    Liu, Hui
    Wang, Lunwen
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1384 - 1401
  • [42] Multi-task Network Embedding with Adaptive Loss Weighting
    Rizi, Fatemeh Salehi
    Granitzer, Michael
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 1 - 5
  • [43] Unsupervised social network embedding via adaptive specific mappings
    Youming Ge
    Cong Huang
    Yubao Liu
    Sen Zhang
    Weiyang Kong
    Frontiers of Computer Science, 2024, 18
  • [44] Pathogenic Gene Prediction Algorithm Based on Heterogeneous Information Fusion
    Wang, Chunyu
    Zhang, Jie
    Wang, Xueping
    Han, Ke
    Guo, Maozu
    FRONTIERS IN GENETICS, 2020, 11
  • [45] Heterogeneous academic network embedding based multivariate random-walk model for predicting scientific impact
    Chunjing Xiao
    Leilei Sun
    Jianing Han
    Yongwei Qiao
    Applied Intelligence, 2022, 52 : 2171 - 2188
  • [46] Network Embedding Based on DepDist Contraction
    Dopater, Emanuel
    Ochodkova, Eliska
    Kudelka, Milos
    COMPLEX NETWORKS & THEIR APPLICATIONS XII, VOL 1, COMPLEX NETWORKS 2023, 2024, 1141 : 427 - 439
  • [47] Pavement Anomaly Detection Algorithm Based on High-order Dynamic Bayesian Network Embedding
    Li B.
    Zhang H.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (01): : 51 - 59
  • [48] Identifying Human Essential Genes by Network Embedding Protein-Protein Interaction Network
    Dai, Wei
    Chang, Qi
    Peng, Wei
    Zhong, Jiancheng
    Li, Yongjiang
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2019, 2019, 11490 : 127 - 137
  • [49] Network Embedding the Protein-Protein Interaction Network for Human Essential Genes Identification
    Dai, Wei
    Chang, Qi
    Peng, Wei
    Zhong, Jiancheng
    Li, Yongjiang
    GENES, 2020, 11 (02)
  • [50] Complex Disease Genes Identification Using a Heterogeneous Network Embedding Approach
    Ghasemi, Mahdieh
    Rahgozar, Maseud
    Kavousi, Kaveh
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 875 - 882