A Gene-disease Association Prediction Algorithm Based on Multi-source Data Fusion

被引:0
作者
Wang F. [1 ]
机构
[1] Information Management Center Hohhot Vocational College, Hohhot
关键词
Gene-disease association prediction; Multi-dimensional phenotype; Multi-source data fusion;
D O I
10.7546/ijba.2022.26.1.000870
中图分类号
学科分类号
摘要
Accurate gene-disease association prediction results are the basis for effective diagnosis and treatment of complex genetic diseases. However, existing studies related to this topic generally face problems in two aspects: large volume of original data and diverse data type, and data fusion difficulty. Therefore, this paper studied a gene-disease association prediction algorithm based on multi-source data fusion. At first, it processed the multidimensional gene phenotype data, analyzed the gene-disease associations of different phenotypes, and completed the selection of disease gene loci under multi-dimensional phenotypes. Then, this paper fused the multi-source data containing the gene expression data, gene sequence data, gene interaction data, and transcriptome sequencing data, and established the corresponding gene-disease association prediction model. At last, the effectiveness of the constructed prediction model was verified by experimental results. The research results obtained in this paper can improve the low utilization of gene datasets, restored the main features of the datasets to the greatest extent, reasonably processed the data noise, effectively enhanced the robustness of the model, and further improved the classification accuracy of the prediction of disease-causing genes. © 2021. by the authors. All Rights Reserved.
引用
收藏
页码:19 / 36
页数:17
相关论文
共 27 条
[1]  
Al-Aamri A., Taha K., Al-Hammadi Y., Maalouf M., Homouz D., Analyzing a Co-occurrence Gene-interaction Network to Identify Disease-gene Association, BMC Bioinformatics, 20, 1, pp. 1-15, (2019)
[2]  
Chen X., Huang Q., Wang Y., Li J., Liu H., Xie Y., Li Z., A Deep Learning Approach to Identify Association of Disease-gene Using Information of Disease Symptoms and Protein Sequences, Analytical Methods, 12, 15, pp. 2016-2026, (2020)
[3]  
Frasca M., Fontaine J. F., Valentini G., Mesiti M., Notaro M., Malchiodi D., Andrade-Navarro M. A., Disease-genes Must Guide Data Source Integration in the Gene Prioritization Process, International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics, pp. 60-69, (2017)
[4]  
Grewal N., Singh S., Chand T., Effect of Aggregation Operators on Network-based Disease Gene Prioritization: A Case Study on Blood Disorders, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 14, 6, pp. 1276-1287, (2016)
[5]  
He M., Huang C., Liu B., Wang Y., Li J., Factor Graph-aggregated Heterogeneous Network Embedding for Disease-gene Association Prediction, BMC Bioinformatics, 22, 1, pp. 1-15, (2021)
[6]  
Hocker J. D., Poirion O. B., Zhu F., Buchanan J., Zhang K., Chiou J., Preissl S., Cardiac Cell Type-specific Gene Regulatory Programs and Disease Risk Association, Science Advances, 7, 20, (2021)
[7]  
Jayanthi K., Mahesh C., Arthi A., Rajendran K. T., Vijayalakshmi B., Shanker N. R., Early Detection of Pediatric Cardiomyopathy Disease Using Window Based Correlation Method from Gene Micro Array Data, Journal of Sensors, 2021, (2021)
[8]  
Jiang X., Zhao J., Qian W., Song W., Lin G. N., A Generative Adversarial Network Model for Disease Gene Prediction with RNA-Seq Data, IEEE Access, 8, pp. 37352-37360, (2020)
[9]  
Kambakam S., Ngaki M. N., Sahu B. B., Kandel D. R., Singh P., Sumit R., Bhattacharyya M. K., Arabidopsis Non-host Resistance Pss30 Gene Enhances Broad-spectrum Disease Resistance in the Soybean Cultivar Williams 82, The Plant Journal, 107, 5, pp. 1432-1446, (2021)
[10]  
Kawichai T., Suratanee A., Plaimas K., Meta-path Based Gene Ontology Profiles for Predicting Drug-disease Associations, IEEE Access, 9, pp. 41809-41820, (2021)