An integrated study fusing systems biology and machine learning algorithms for genome-based discrimination of IPF and NSIP diseases: a new approach to the diagnostic challenge

被引:0
作者
Elham Amjad
Solmaz Asnaashari
Siavoush Dastmalchi
Babak Sokouti
机构
[1] Tabriz University of Medical Sciences,Biotechnology Research Center
[2] Tabriz University of Medical Sciences,School of Pharmacy
来源
Soft Computing | 2024年 / 28卷
关键词
Machine learning algorithm; Systems biology; Biomarkers; Idiopathic pulmonary fibrosis; Nonspecific interstitial pneumonia; Diagnosis;
D O I
暂无
中图分类号
学科分类号
摘要
Idiopathic pulmonary fibrosis (IPF) and nonspecific interstitial pneumonia (NSIP) are the two types of idiopathic interstitial pneumonia that are most prevalent. IPF and NSIP, often known as chronic interstitial pneumonia, must be differentiated from other forms of idiopathic interstitial pneumonia. However, distinguishing IPF from NSIP on radiographic imaging is challenging. Our goal in this work is to propose a novel approach to this clinical diagnostic challenge by distinguishing IPF from NSIP and healthy individuals via a complete systems biology analysis of existing microarray datasets. The Gene Expression Omnibus (GEO) database was searched, and two microarray datasets were identified. These datasets included normal, IPF, and NSIP samples. A second dataset was retrieved to validate further the built prediction models trained on the first dataset. Following the completion of the stages for data preparation and normalization, the profiles of gene expression were analyzed to determine the differentially expressed genes (DEGs). After that, we constructed module analysis and identified possible biomarkers by leveraging the prioritized and statistically significant DEGs to construct protein–protein interaction networks. The DEGs with the most important priority were also utilized to determine the implicated Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathways and gene ontology (GO) enrichment analyses. Using the Kaplan–Meier approach, we performed three separate assessments of the gene biomarkers' effect on patients' chances of survival. In addition, the found genes were validated not just through several different categorization models, but also by analyzing the published experimental work on the target genes. A total of 32 distinct genes were found when comparing IPF to normal, NSIP to normal, and IPF to NSIP. This was accomplished by identifying seven (14 genes), six (7 genes), and eight (13 genes) modules, as well as three genes (i.e., C6, C5, STAT1). Results from GO analysis and the KEGG pathway evaluation showed evidence for biological processes, cellular components, and molecular activities. When considering the overall survival (OS), fast progression (FP), and post-progression survival (PPS) rates, the Kaplan–Meier analysis demonstrated that 27 out of 32, 16 out of 32, and 13 out of 32 genes were significant. Additionally, the identified biomarkers show high performance for the machine learning classification models. In addition, the scientific literature findings have validated each gene biomarker discovered for IPF, NSIP, and other lung-related conditions. The 32-mRNA signature shows promise as a gene set for IPF and NSIP and as a driver for treatments with the ability to predict and manage patients' survival rates accurately.
引用
收藏
页码:5721 / 5749
页数:28
相关论文
共 360 条
[1]  
Adams J(1993)Extracellular matrix: the thrombospondin family Curr Biol 3 188-190
[2]  
Lawler J(2018)Quantitative proteomic characterization of the lung extracellular matrix in chronic obstructive pulmonary disease and idiopathic pulmonary fibrosis J Proteom 189 23-33
[3]  
Åhrman E(2015)Versican in inflammation and tissue remodeling: the impact on lung disorders Glycobiology 25 243-251
[4]  
Andersson-Sjöland A(2023)Diagnostic value of connective tissue disease related CT signs in usual interstitial pneumonia pattern of interstitial lung disease Indian J Radiol Imaging 33 70-75
[5]  
Augustine A(2008)Genomic markers for malignant progression in pulmonary adenocarcinoma with bronchioloalveolar features Proc Natl Acad Sci USA 105 10155-10160
[6]  
Robinson Vimala L(2015)Ribonucleotide reductase and cancer: biological mechanisms and targeted therapies Oncogene 34 2011-2021
[7]  
Irodi A(2018)A potential risk factor of essential hypertension in case-control study: circular RNA hsa_circ_0037911 Biochem Biophys Res Commun 498 789-794
[8]  
Mathew J(2014)A novel genomic signature with translational significance for human idiopathic pulmonary fibrosis Am J Respir Cell Mol Biol 52 217-231
[9]  
Pakkal M(2005)FasL (CD95L/APO-1L) resistance of neurons mediated by phosphatidylinositol 3-kinase-Akt/protein kinase B-dependent expression of lifeguard/neuronal membrane protein 35 J Neurosci 25 6765-6774
[10]  
Aviel-Ronen S(2019)Calgranulin B and KL-6 in bronchoalveolar lavage of patients with IPF and NSIP Inflammation 42 463-470