Development and validation of machine learning-based risk prediction models of oral squamous cell carcinoma using salivary autoantibody biomarkers

被引:7
作者
Tseng, Yi-Ju [1 ,2 ]
Wang, Yi-Cheng [3 ]
Hsueh, Pei-Chun [4 ,5 ]
Wu, Chih-Ching [6 ,7 ,8 ,9 ,10 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA USA
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Univ Lausanne, Dept Fundamental Oncol, Lausanne, Switzerland
[5] Univ Lausanne, Ludwig Inst Canc Res, Epalinges, Switzerland
[6] Chang Gung Univ, Grad Inst Biomed Sci, Taoyuan, Taiwan
[7] Chang Gung Univ, Coll Med, Dept Med Biotechnol & Lab Sci, 259,Wenhua 1st Rd, Taoyuan 33302, Taiwan
[8] Chang Gung Mem Hosp, Dept Otolaryngol Head & Neck Surg, Taoyuan, Taiwan
[9] Chang Gung Univ, Mol Med Res Ctr, Taoyuan, Taiwan
[10] Chang Gung Univ, Res Ctr Emerging Viral Infect, Coll Med, Taoyuan, Taiwan
关键词
Oral cavity squamous cell carcinoma; Autoantibodies; Biomarker; Machine learning; POTENTIALLY MALIGNANT DISORDERS; PROGNOSTIC-SIGNIFICANCE; DIAGNOSTIC MARKERS; CANCER; EPIDEMIOLOGY; EXPRESSION; OUTCOMES; MUCOSA; HEAD;
D O I
10.1186/s12903-022-02607-2
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Introduction The incidence of oral cavity squamous cell carcinoma (OSCC) continues to rise. OSCC is associated with a low average survival rate, and most patients have a poor disease prognosis because of delayed diagnosis. We used machine learning techniques to predict high-risk cases of OSCC by using salivary autoantibody levels and demographic and behavioral data. Methods We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times. Results A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 +/- 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 +/- 0.06 to 0.795 +/- 0.055, p < 0.001). Conclusions We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.
引用
收藏
页数:10
相关论文
共 61 条
  • [1] Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review
    Alabi, Rasheed Omobolaji
    Youssef, Omar
    Pirinen, Matti
    Elmusrati, Mohammed
    Makitie, Antti A.
    Leivo, Ilmo
    Almangush, Alhadi
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 115
  • [2] Protein Microarray Signature of Autoantibody Biomarkers for the Early Detection of Breast Cancer
    Anderson, Karen S.
    Sibani, Sahar
    Wallstrom, Garrick
    Qiu, Ji
    Mendoza, Eliseo A.
    Raphael, Jacob
    Hainsworth, Eugenie
    Montor, Wagner R.
    Wong, Jessica
    Park, Jin G.
    Lokko, Naa
    Logvinenko, Tanya
    Ramachandran, Niroshan
    Godwin, Andrew K.
    Marks, Jeffrey
    Engstrom, Paul
    LaBaer, Joshua
    [J]. JOURNAL OF PROTEOME RESEARCH, 2011, 10 (01) : 85 - 96
  • [3] [Anonymous], 2018, Taiwan Cancer Registry Annual Report
  • [4] Biecek PL, 2018, J MACH LEARN RES, V19
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Artificial Intelligence for the Otolaryngologist: A State of the Art Review
    Bur, Andres M.
    Shew, Matthew
    New, Jacob
    [J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2019, 160 (04) : 603 - 611
  • [7] The need to separate the wheat from the chaff in medical informatics Introducing a comprehensive checklist for the (self)-assessment of medical AI studies
    Cabitza, Federico
    Campagner, Andrea
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 153
  • [8] Verification of Saliva Matrix Metalloproteinase-1 as a Strong Diagnostic Marker of Oral Cavity Cancer
    Chang, Ya-Ting
    Chu, Lichieh Julie
    Liu, Yen-Chun
    Chen, Chih-Jou
    Wu, Shu-Fang
    Chen, Chien-Hua
    Chang, Ian Yi-Feng
    Wang, Jun-Sheng
    Wu, Tzong-Yuan
    Dash, Srinivas
    Chiang, Wei-Fan
    Chiu, Sheng-Fu
    Gou, Shin-Bin
    Chien, Chih-Yen
    Chang, Kai-Ping
    Yu, Jau-Song
    [J]. CANCERS, 2020, 12 (08) : 1 - 18
  • [9] Immunobiomarkers in Small Cell Lung Cancer: Potential Early Cancer Signals
    Chapman, Caroline J.
    Thorpe, Alison J.
    Murray, Andrea
    Parsy-Kowalska, Celine B.
    Allen, Jared
    Stafford, Kelly M.
    Chauhan, Alok S.
    Kite, Thomas A.
    Maddison, Paul
    Robertson, John F. R.
    [J]. CLINICAL CANCER RESEARCH, 2011, 17 (06) : 1474 - 1480
  • [10] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794