Development and validation of machine learning-based risk prediction models of oral squamous cell carcinoma using salivary autoantibody biomarkers

被引：7

作者：

Tseng, Yi-Ju ^{[1
,2
]}

Wang, Yi-Cheng ^{[3
]}

Hsueh, Pei-Chun ^{[4
,5
]}

Wu, Chih-Ching ^{[6
,7
,8
,9
,10
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan

[2] Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA USA

[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan

[4] Univ Lausanne, Dept Fundamental Oncol, Lausanne, Switzerland

[5] Univ Lausanne, Ludwig Inst Canc Res, Epalinges, Switzerland

[6] Chang Gung Univ, Grad Inst Biomed Sci, Taoyuan, Taiwan

[7] Chang Gung Univ, Coll Med, Dept Med Biotechnol & Lab Sci, 259,Wenhua 1st Rd, Taoyuan 33302, Taiwan

[8] Chang Gung Mem Hosp, Dept Otolaryngol Head & Neck Surg, Taoyuan, Taiwan

[9] Chang Gung Univ, Mol Med Res Ctr, Taoyuan, Taiwan

[10] Chang Gung Univ, Res Ctr Emerging Viral Infect, Coll Med, Taoyuan, Taiwan

来源：

BMC ORAL HEALTH | 2022年 / 22卷 / 01期

关键词：

Oral cavity squamous cell carcinoma; Autoantibodies; Biomarker; Machine learning; POTENTIALLY MALIGNANT DISORDERS; PROGNOSTIC-SIGNIFICANCE; DIAGNOSTIC MARKERS; CANCER; EPIDEMIOLOGY; EXPRESSION; OUTCOMES; MUCOSA; HEAD;

D O I：

10.1186/s12903-022-02607-2

中图分类号：

R78 [口腔科学];

学科分类号：

1003 ;

摘要：

Introduction The incidence of oral cavity squamous cell carcinoma (OSCC) continues to rise. OSCC is associated with a low average survival rate, and most patients have a poor disease prognosis because of delayed diagnosis. We used machine learning techniques to predict high-risk cases of OSCC by using salivary autoantibody levels and demographic and behavioral data. Methods We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times. Results A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 +/- 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 +/- 0.06 to 0.795 +/- 0.055, p < 0.001). Conclusions We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.

引用

页数：10

共 61 条

[1] Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review
Alabi, Rasheed Omobolaji
Youssef, Omar
Pirinen, Matti
Elmusrati, Mohammed
Makitie, Antti A.
Leivo, Ilmo
Almangush, Alhadi
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 115
[2] Protein Microarray Signature of Autoantibody Biomarkers for the Early Detection of Breast Cancer
Anderson, Karen S.
Sibani, Sahar
Wallstrom, Garrick
Qiu, Ji
Mendoza, Eliseo A.
Raphael, Jacob
Hainsworth, Eugenie
Montor, Wagner R.
Wong, Jessica
Park, Jin G.
Lokko, Naa
Logvinenko, Tanya
Ramachandran, Niroshan
Godwin, Andrew K.
Marks, Jeffrey
Engstrom, Paul
LaBaer, Joshua
[J]. JOURNAL OF PROTEOME RESEARCH, 2011, 10 (01) : 85 - 96
[3] [Anonymous], 2018, Taiwan Cancer Registry Annual Report
[4] Biecek PL, 2018, J MACH LEARN RES, V19
[5] Random forests
Breiman, L
[J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
[6] Artificial Intelligence for the Otolaryngologist: A State of the Art Review
Bur, Andres M.
Shew, Matthew
New, Jacob
[J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2019, 160 (04) : 603 - 611
[7] The need to separate the wheat from the chaff in medical informatics Introducing a comprehensive checklist for the (self)-assessment of medical AI studies
Cabitza, Federico
Campagner, Andrea
[J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 153
[8] Verification of Saliva Matrix Metalloproteinase-1 as a Strong Diagnostic Marker of Oral Cavity Cancer
Chang, Ya-Ting
Chu, Lichieh Julie
Liu, Yen-Chun
Chen, Chih-Jou
Wu, Shu-Fang
Chen, Chien-Hua
Chang, Ian Yi-Feng
Wang, Jun-Sheng
Wu, Tzong-Yuan
Dash, Srinivas
Chiang, Wei-Fan
Chiu, Sheng-Fu
Gou, Shin-Bin
Chien, Chih-Yen
Chang, Kai-Ping
Yu, Jau-Song
[J]. CANCERS, 2020, 12 (08) : 1 - 18
[9] Immunobiomarkers in Small Cell Lung Cancer: Potential Early Cancer Signals
Chapman, Caroline J.
Thorpe, Alison J.
Murray, Andrea
Parsy-Kowalska, Celine B.
Allen, Jared
Stafford, Kelly M.
Chauhan, Alok S.
Kite, Thomas A.
Maddison, Paul
Robertson, John F. R.
[J]. CLINICAL CANCER RESEARCH, 2011, 17 (06) : 1474 - 1480
[10] XGBoost: A Scalable Tree Boosting System
Chen, Tianqi
Guestrin, Carlos
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794

← 1 2 3 4 5 6 7 →