Identification and verification of four candidate biomarkers for early diagnosis of osteoarthritis by machine learning

被引:4
作者
Wang, Xinyu [1 ,2 ]
Liu, Tianyi [1 ,3 ,4 ]
Sheng, Yueyang [1 ]
Zhang, Yanzhuo [1 ]
Qiu, Cheng [5 ]
Li, Manyu [6 ]
Cheng, Yuxi [7 ,8 ]
Li, Shan [1 ]
Wang, Ying [1 ]
Wu, Chengai [1 ]
机构
[1] Capital Med Univ, Beijing Jishuitan Hosp, Beijing Res Inst Traumatol & Orthopaed, Natl Ctr Orthopaed,Dept Mol Orthopaed, Beijing 100035, Peoples R China
[2] Capital Med Univ, Beijing Jishuitan Hosp, Natl Ctr Orthopaed, Dept Anesthesiol, Beijing 100035, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Canc Hosp, Natl Canc Ctr, Natl Clin Res Ctr Canc,Dept Med Oncol, Beijing 100021, Peoples R China
[4] Chinese Acad Med Sci & Peking Union Med Coll, Canc Hosp, Natl Canc Ctr, Natl Clin Res Ctr Canc,Dept Hepatobiliary Surg, Beijing 100021, Peoples R China
[5] Shandong Univ, Qilu Hosp, Cheeloo Coll Med, Dept Orthopaed Surg, Jinan 250012, Shandong, Peoples R China
[6] Shandong Univ, Qilu Hosp, Dept Gastroenterol, Jinan 250012, Shandong, Peoples R China
[7] Cent South Univ, Xiangya Stomatol Hosp, Changsha 410008, Hunan, Peoples R China
[8] Cent South Univ, Xiangya Sch Stomatol, Changsha 410008, Hunan, Peoples R China
基金
北京市自然科学基金;
关键词
Osteoarthritis; Early diagnosis; Chondrocyte; Biomarker; Machine learning; B3GALNT1; GRB10; KLF9; SCRG1; ZNF423; GENE-EXPRESSION OMNIBUS; CLASSIFICATION; SYMPTOMS;
D O I
10.1016/j.heliyon.2024.e35121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Osteoarthritis (OA) is a common chronic joint disease. This study aimed to investigate possible OA diagnostic biomarkers and to verify their significance in clinical samples. Methods: We exploited three datasets from the Gene Expression Omnibus (GEO) database, serving as the training set. We first determined differentially expressed genes and screened candidate diagnostic biomarkers by applying three machine learning algorithms (Random Forest, Least Absolute Shrinkage and Selection Operator logistic regression, Support Vector Machine-Recursive Feature Elimination). Another GEO dataset was used as the validation set. The test set consisted of RNA-sequenced peripheral blood samples collected from patients and healthy donors. Blood samples and chondrocytes were collected for quantitative real-time PCR to confirm expression levels. Receiver operating characteristic curves were generated for individual and combined biomarkers. Results: In total, 251 DEGs were screened, where B3GALNT1, SCRG1 and ZNF423 were screened by all three algorithms. The area under the curve (AUC) of various biomarkers in our test set did not reach as high as that in public datasets. GRB10 exhibited highest AUC of 0.947 in the training set but 0.691 in our test set, while the favorable combined model comprising B3GALNT1, GRB10, KLF9 and SCRG1 demonstrated an AUC of 0.986 in the training set, 1.000 in the validation set and 0.836 in our test set. Conclusion: We identified a combined model for early diagnosis of OA that includes B3GALNT1, GRB10, KLF9 and SCRG1. This finding offers new avenues for further exploration of mechanisms underlying OA.
引用
收藏
页数:17
相关论文
共 34 条
[1]   DEVELOPMENT OF CRITERIA FOR THE CLASSIFICATION AND REPORTING OF OSTEOARTHRITIS - CLASSIFICATION OF OSTEOARTHRITIS OF THE KNEE [J].
ALTMAN, R ;
ASCH, E ;
BLOCH, D ;
BOLE, G ;
BORENSTEIN, D ;
BRANDT, K ;
CHRISTY, W ;
COOKE, TD ;
GREENWALD, R ;
HOCHBERG, M ;
HOWELL, D ;
KAPLAN, D ;
KOOPMAN, W ;
LONGLEY, S ;
MANKIN, H ;
MCSHANE, DJ ;
MEDSGER, T ;
MEENAN, R ;
MIKKELSEN, W ;
MOSKOWITZ, R ;
MURPHY, W ;
ROTHSCHILD, B ;
SEGAL, M ;
SOKOLOFF, L ;
WOLFE, F .
ARTHRITIS AND RHEUMATISM, 1986, 29 (08) :1039-1049
[2]   Deciphering osteoarthritis genetics across 826,690 individuals from 9 populations [J].
Boer, Cindy G. ;
Hatzikotoulas, Konstantinos ;
Southam, Lorraine ;
Stefansdottir, Lilja ;
Zhang, Yanfei ;
de Almeida, Rodrigo Coutinho ;
Wu, Tian T. ;
Zheng, Jie ;
Hartley, April ;
Teder-Laving, Maris ;
Skogholt, Anne Heidi ;
Terao, Chikashi ;
Zengini, Eleni ;
Alexiadis, George ;
Barysenka, Andrei ;
Bjornsdottir, Gyda ;
Gabrielsen, Maiken E. ;
Gilly, Arthur ;
Ingvarsson, Thorvaldur ;
Johnsen, Marianne B. ;
Jonsson, Helgi ;
Kloppenburg, Margreet ;
Luetge, Almut ;
Lund, Sigrun H. ;
Magi, Reedik ;
Mangino, Massimo ;
Nelissen, Rob R. G. H. H. ;
Shivakumar, Manu ;
Steinberg, Julia ;
Takuwa, Hiroshi ;
Thomas, Laurent F. ;
Tuerlings, Margo ;
Babis, George C. ;
Cheung, Jason Pui Yin ;
Kang, Jae Hee ;
Kraft, Peter ;
Lietman, Steven A. ;
Samartzis, Dino ;
Slagboom, P. Eline ;
Stefansson, Kari ;
Thorsteinsdottir, Unnur ;
Tobias, Jonathan H. ;
Uitterlinden, Andre G. ;
Winsvold, Bendik ;
Zwart, John-Anker ;
Smith, George Davey ;
Sham, Pak Chung ;
Thorleifsson, Gudmar ;
Gaunt, Tom R. ;
Morris, Andrew P. .
CELL, 2021, 184 (18) :4784-+
[3]   Functional Tissue Analysis Reveals Successful Cryopreservation of Human Osteoarthritic Synovium [J].
Broeren, Mathijs G. A. ;
de Vries, Marieke ;
Bennink, Miranda B. ;
van Lent, Peter L. E. M. ;
van der Kraan, Peter M. ;
Koenders, Marije I. ;
Thurlings, Rogier M. ;
van de Loo, Fons A. J. .
PLOS ONE, 2016, 11 (11)
[4]   Global estimates of the need for rehabilitation based on the Global Burden of Disease study 2019: a systematic analysis for the Global Burden of Disease Study 2019 [J].
Cieza, Alarcos ;
Causey, Kate ;
Kamenov, Kaloyan ;
Hanson, Sarah Wulf ;
Chatterji, Somnath ;
Vos, Theo .
LANCET, 2020, 396 (10267) :2006-2017
[5]   GRB10 and E2F3 as Diagnostic Markers of Osteoarthritis and Their Correlation with Immune Infiltration [J].
Deng, Ya-Jun ;
Ren, En-Hui ;
Yuan, Wen-Hua ;
Zhang, Guang-Zhi ;
Wu, Zuo-Long ;
Xie, Qi-Qi .
DIAGNOSTICS, 2020, 10 (03)
[6]  
Dieppe PA, 2004, J RHEUMATOL, V31, P50
[7]   A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis [J].
Dillies, Marie-Agnes ;
Rau, Andrea ;
Aubert, Julie ;
Hennequet-Antier, Christelle ;
Jeanmougin, Marine ;
Servant, Nicolas ;
Keime, Celine ;
Marot, Guillemette ;
Castel, David ;
Estelle, Jordi ;
Guernec, Gregory ;
Jagla, Bernd ;
Jouneau, Luc ;
Laloe, Denis ;
Le Gall, Caroline ;
Schaeffer, Brigitte ;
Le Crom, Stephane ;
Guedj, Mickael ;
Jaffrezic, Florence .
BRIEFINGS IN BIOINFORMATICS, 2013, 14 (06) :671-683
[8]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[9]   Osteoarthritis, part of life or a curable disease? A bird's-eye view [J].
Englund, Martin .
JOURNAL OF INTERNAL MEDICINE, 2023, 293 (06) :681-693
[10]   Identification of transcription factors responsible for dysregulated networks in human osteoarthritis cartilage by global gene expression analysis [J].
Fisch, K. M. ;
Gamini, R. ;
Alvarez-Garcia, O. ;
Akagi, R. ;
Saito, M. ;
Muramatsu, Y. ;
Sasho, T. ;
Koziol, J. A. ;
Su, A., I ;
Lotz, M. K. .
OSTEOARTHRITIS AND CARTILAGE, 2018, 26 (11) :1531-1538