Support vector machine and principal component analysis for microarray data classification

被引:8
作者
Astuti, Widi [1 ]
Adiwijaya [1 ]
机构
[1] Telkom Univ, Sch Comp, Bandung, Indonesia
来源
INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE (ICODIS) | 2018年 / 971卷
关键词
D O I
10.1088/1742-6596/971/1/012003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cancer is a leading cause of death worldwide although a significant proportion of it can be cured if it is detected early. In recent decades, technology called microarray takes an important role in the diagnosis of cancer. By using data mining technique, microarray data classification can be performed to improve the accuracy of cancer diagnosis compared to traditional techniques. The characteristic of microarray data is small sample but it has huge dimension. Since that, there is a challenge for researcher to provide solutions for microarray data classification with high performance in both accuracy and running time. This research proposed the usage of Principal Component Analysis (PCA) as a dimension reduction method along with Support Vector Method (SVM) optimized by kernel functions as a classifier for microarray data classification. The proposed scheme was applied on seven data sets using 5 fold cross validation and then evaluation and analysis conducted on term of both accuracy and running time. The result showed that the scheme can obtained 100% accuracy for Ovarian and Lung Cancer data when Linear and Cubic kernel functions are used. In term of running time, PCA greatly reduced the running time for every data sets.
引用
收藏
页数:7
相关论文
共 9 条
[1]  
Adiwijaya Nurfalah A, 2016, FAR E J ELECT COMMUN, V16, P269
[2]   Gene selection in cancer classification using sparse logistic regression with Bayesian regularization [J].
Cawley, Gavin C. ;
Talbot, Nicola L. C. .
BIOINFORMATICS, 2006, 22 (19) :2348-2355
[3]   A novel feature extraction approach based on ensemble feature selection and modified discriminant independent component analysis for microarray data classification [J].
Mollaee, Maryam ;
Moattar, Mohammad Hossein .
BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2016, 36 (03) :521-529
[4]  
Sahu B., 2012, PROCEDIA ENG
[5]  
Sarhan Ahmad M., 2009, J THEORETICAL APPL I
[6]   A simple and efficient algorithm for gene selection using sparse logistic regression [J].
Shevade, SK ;
Keerthi, SS .
BIOINFORMATICS, 2003, 19 (17) :2246-2253
[7]  
Tran B., 2014, LECT NOTES COMPUTER, V8886
[8]   Gene Expression Data Classification using Support Vector Machine and Mutual Information-based Gene Selection [J].
Vanitha, Devi Arockia C. ;
Devaraj, D. ;
Venkatesulu, M. .
GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 :13-21
[9]  
World Health Organization, Cancer Fact Sheet