Oral cancer prognosis based on clinicopathologic and genomic markers using a hybrid of feature selection and machine learning methods

被引:99
作者
Chang, Siow-Wee [1 ,2 ]
Abdul-Kareem, Sameem [2 ]
Merican, Amir Feisal [1 ]
Zain, Rosnah Binti [3 ]
机构
[1] Univ Malaya, Fac Sci, Inst Biol Sci, Kuala Lumpur, Malaysia
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur, Malaysia
[3] Univ Malaya, Fac Dent, OCRCC, Dept Oral Pathol & Oral Med & Periodontol, Kuala Lumpur, Malaysia
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
Oral cancer prognosis; Clinicopathologic; Genomic; Feature selection; Machine learning; SQUAMOUS-CELL CARCINOMA; DIFFERENTIAL EXPRESSION; SURVIVAL ANALYSIS; TUMOR THICKNESS; BREAST-CANCER; P63; PREDICTION; HEAD; P53; SAMPLE;
D O I
10.1186/1471-2105-14-170
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Machine learning techniques are becoming useful as an alternative approach to conventional medical diagnosis or prognosis as they are good for handling noisy and incomplete data, and significant results can be attained despite a small sample size. Traditionally, clinicians make prognostic decisions based on clinicopathologic markers. However, it is not easy for the most skilful clinician to come out with an accurate prognosis by using these markers alone. Thus, there is a need to use genomic markers to improve the accuracy of prognosis. The main aim of this research is to apply a hybrid of feature selection and machine learning methods in oral cancer prognosis based on the parameters of the correlation of clinicopathologic and genomic markers. Results: In the first stage of this research, five feature selection methods have been proposed and experimented on the oral cancer prognosis dataset. In the second stage, the model with the features selected from each feature selection methods are tested on the proposed classifiers. Four types of classifiers are chosen; these are namely, ANFIS, artificial neural network, support vector machine and logistic regression. A k-fold cross-validation is implemented on all types of classifiers due to the small sample size. The hybrid model of ReliefF-GA-ANFIS with 3- input features of drink, invasion and p63 achieved the best accuracy (accuracy = 93.81%; AUC = 0.90) for the oral cancer prognosis. Conclusions: The results revealed that the prognosis is superior with the presence of both clinicopathologic and genomic markers. The selected features can be investigated further to validate the potential of becoming as significant prognostic signature in the oral cancer studies.
引用
收藏
页数:15
相关论文
共 41 条
  • [1] Asakage T, 1998, CANCER, V82, P1443, DOI 10.1002/(SICI)1097-0142(19980415)82:8<1443::AID-CNCR2>3.0.CO
  • [2] 2-A
  • [3] Neuro-fuzzy modeling: An accurate and interpretable method for predicting bladder cancer progression
    Catto, JWF
    Abbod, MF
    Linkens, DA
    Hamdy, FC
    [J]. JOURNAL OF UROLOGY, 2006, 175 (02) : 474 - 479
  • [4] Differential expression of p53, p63 and p73 proteins in human buccal squamous-cell carcinomas
    Chen, YK
    Huse, SS
    Lin, LM
    [J]. CLINICAL OTOLARYNGOLOGY, 2003, 28 (05) : 451 - 455
  • [5] Chih-Jen L., 2011, ACM T INTELLIGENT SY, V2
  • [6] CHIHWEI H, 2010, TECHNICAL REPORT
  • [7] Differential expression of p53 gene family members p63 and p73 in head and neck squamous tumorigenesis
    Choi, HR
    Batsakis, JG
    Zhan, F
    Sturgis, E
    Luna, MA
    El-Naggar, AK
    [J]. HUMAN PATHOLOGY, 2002, 33 (02) : 158 - 164
  • [8] Cruz JA, 2006, CANCER INFORM, V2, P59
  • [9] Dom RM., 2008, Austral-Asian Journal of Cancer, V7, P209