iPcc: a novel feature extraction method for accurate disease class discovery and prediction

被引:16
|
作者
Ren, Xianwen [1 ,2 ]
Wang, Yong [3 ,4 ]
Zhang, Xiang-Sun [3 ,4 ]
Jin, Qi [1 ,2 ]
机构
[1] Chinese Acad Med Sci, MOH Key Lab Syst Biol Pathogens, Inst Pathogen Biol, Beijing 100730, Peoples R China
[2] Peking Union Med Coll, Beijing 100730, Peoples R China
[3] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Natl Ctr Math & Interdisciplinary Sci, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
GENE-EXPRESSION PROFILES; MICROARRAY ANALYSIS; FEATURE-SELECTION; CLASSIFICATION; CANCER; MODEL; ALGORITHM; CLUSTERS; SUBTYPES; RULES;
D O I
10.1093/nar/gkt343
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study, we present iPcc, a novel method from the feature extraction perspective to further propel gene expression profiling technologies from bench to bedside. We define 'correlation feature space' for samples based on the gene expression profiles by iterative employment of Pearson's correlation coefficient. Numerical experiments on both simulated and real gene expression data sets demonstrate that iPcc can greatly highlight the latent patterns underlying noisy gene expression data and thus greatly improve the robustness and accuracy of the algorithms currently available for disease diagnosis and classification based on gene expression profiles.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Optimal and Novel Hybrid Feature Selector for Accurate Prediction of Heart Disease
    Amarnath, B.
    Balamurugan, S. A. A.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2017, 76 (11): : 720 - 724
  • [2] Prediction of Recombination Spots Using Novel Hybrid Feature Extraction Method via Deep Learning Approach
    Khan, Fatima
    Khan, Mukhtaj
    Iqbal, Nadeem
    Khan, Salman
    Muhammad Khan, Dost
    Khan, Abbas
    Wei, Dong-Qing
    FRONTIERS IN GENETICS, 2020, 11
  • [3] A novel class dependent feature selection method for cancer biomarker discovery
    Zhou, Wengang
    Dickerson, Julie A.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 47 : 66 - 75
  • [4] A novel feature selection method to predict protein structural class
    Yuan, Mingshun
    Yang, Zijiang
    Huang, Guangzao
    Ji, Guoli
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 76 : 118 - 129
  • [5] A Combination of Feature Extraction Methods with an Ensemble of Different Classifiers for Protein Structural Class Prediction Problem
    Dehzangi, Abdollah
    Paliwal, Kuldip
    Sharma, Alok
    Dehzangi, Omid
    Sattar, Abdul
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (03) : 564 - 575
  • [6] A novel feature extraction method using chemosensory EEG for Parkinson's disease classification
    Gulay, Begum Kara
    Demirel, Neslihan
    Vahaplar, Alper
    Guducu, Cagdas
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [7] A Novel Feature Extraction Method for Mechanical Part Recognition
    Cao, Su-Qun
    Yang, Ge-Lan
    Zhu, Quan-Yin
    Zhai, Hai-Fei
    COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION, PTS 1-2, 2011, 88-89 : 116 - +
  • [8] A NOVEL FEATURE EXTRACTION METHOD FOR THE CLASSIFICATION OF SAR IMAGES
    Aytekin, Orsan
    Koc, Mehmet
    Ulusoy, Ilkay
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 3482 - 3485
  • [9] Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction
    Jothi Prakash, V.
    Karthikeyan, N. K.
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 389 - 412
  • [10] Federated learning-based disease prediction: A fusion approach with feature selection and extraction
    Kapila, Ramdas
    Saleti, Sumalatha
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100