Use of LDA combined with PLS for classification of lung cancer gene expression data

被引:0
|
作者
Ghadiri, Keyghobad [1 ]
Rezaei, Mansour [2 ]
Tabatabaei, Seyyed Mohammad [3 ]
Shahsavari, Meisam [4 ]
Shahsavari, Soodeh [5 ]
机构
[1] Kermanshah Univ Med Sci, Nosocomial Infect Res Ctr, Kermanshah, Iran
[2] Kermanshah Univ Med Sci, Dept Biostat & Epidemiol, Fac Publ Hlth, Kermanshah, Iran
[3] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Dept Med Informat, Tehran, Iran
[4] Shahid Beheshti Univ Med Sci, Dept Nursing, Tehran, Iran
[5] Kermanshah Univ Med Sci, Fac Paramed Sci, Hlth Informat Technol Dept, Kermanshah, Iran
来源
INTERNATIONAL JOURNAL OF MEDICAL RESEARCH & HEALTH SCIENCES | 2016年 / 5卷 / 09期
关键词
lung cancer; gene expression data; linear discriminant analysis; partial least squares; classification;
D O I
暂无
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Reliable and precise classification is essential for successful diagnosis and treatment of cancer. Thus, improvements in cancer classification are increasingly sought. Linear discriminant analysis (LDA) is the most effective method of cancer classification in high-dimensional prediction, but there are drawbacks to tumor classification by a formal method such as LDA. We propose a method for lung cancer gene microarray classification that combines a feature reduction approach, partial least squares (PLS), and discriminate method, LDA, for improving classification performance. The real dataset used related to lung cancer gene expression. After bioinformatics data preprocessing, data reduction and feature selection were carried out using PLS and then LDA was used for classification. The results were validated using the accuracy index and gene ontology analysis. Of the total of more than 50,000 genes, 214 genes were shown to have relevance. The classification accuracy of this method was 94.5% and gene ontology analysis results were good. It can be said that the LDA classifier combined with PLS is powerful method. This method can identify gene relationships warranting further biological investigation.
引用
收藏
页码:500 / 506
页数:7
相关论文
共 50 条
  • [1] CANCER CLASSIFICATION FROM THE GENE EXPRESSION PROFILES BY DISCRIMINANT KERNEL-PLS
    Tang, Kai-Lin
    Yao, Wei-Jia
    Li, Tong-Hua
    Li, Yi-Xue
    Cao, Zhi-Wei
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2010, 8 : 147 - 160
  • [2] Lung cancer classification based on enhanced deep learning using gene expression data
    Yuvaraj V.
    Maheswari D.
    Measurement: Sensors, 2023, 30
  • [3] Classification of lung cancer severity using gene expression data based on deep learning
    Ali Bou Nassif
    Nour Ayman Abujabal
    Aya Alchikh Omar
    BMC Medical Informatics and Decision Making, 25 (1)
  • [4] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [5] A hybrid LDA and genetic algorithm for gene selection and classification of microarray data
    Bonilla Huerta, Edmundo
    Duval, Beatrice
    Hao, Jin-Kao
    NEUROCOMPUTING, 2010, 73 (13-15) : 2375 - 2383
  • [6] Cancer classification using gene expression data
    Lu, Y
    Han, JW
    INFORMATION SYSTEMS, 2003, 28 (04) : 243 - 268
  • [7] Selecting significant genes by randomization test for cancer classification using gene expression data
    Mao, Zhiyi
    Cai, Wensheng
    Shao, Xueguang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (04) : 594 - 601
  • [8] Analysis of complexity indices for classification problems: Cancer gene expression data
    Lorena, Ana C.
    Costa, Ivan G.
    Spolaor, Newton
    de Souto, Marcilio C. P.
    NEUROCOMPUTING, 2012, 75 (01) : 33 - 42
  • [9] A genetic filter for cancer classification on gene expression data
    Kim, Yong-Hyuk
    Yoon, Yourim
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1993 - S2002
  • [10] Classification of Cancer Types based on Gene Expression Data
    He, Yinchao
    Bockmon, Ryan
    Modey, Miracle
    Roscoe, Sarah
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2175 - 2182