Local-structure-preservation and redundancy-removal-based feature selection method and its application to the identification of biomarkers for schizophrenia

被引:0
|
作者
Xing, Ying [1 ]
Pearlson, Godfrey D. [2 ,3 ]
Kochunov, Peter [4 ,5 ]
Calhoun, Vince D. [6 ]
Du, Yuhui [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan, Peoples R China
[2] Yale Univ, Dept Psychiat, New Haven, CT USA
[3] Yale Univ, Dept Neurobiol, New Haven, CT USA
[4] Univ Maryland, Sch Med, Maryland Psychiat Res Ctr, Baltimore, MD USA
[5] Univ Maryland, Sch Med, Dept Psychiat, Baltimore, MD USA
[6] Emory Univ, Georgia State Univ, Georgia Inst Technol, Triinst Ctr Translat Res Neuroimaging & Data Sci, Atlanta, GA USA
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
Regularization; Classification; Similarity; Redundancy; Mental disorder; Neuroimaging; CLASSIFIERS; STATE;
D O I
10.1016/j.neuroimage.2024.120839
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Accurate diagnosis of mental disorders is expected to be achieved through the identification of reliable neuroimaging biomarkers with the help of cutting-edge feature selection techniques. However, existing feature selection methods often fall short in capturing the local structural characteristics among samples and effectively eliminating redundant features, resulting in inadequate performance in disorder prediction. To address this gap, we propose a novel supervised method named local-structure-preservation and redundancy-removal-based feature selection (LRFS), and then apply it to the identification of meaningful biomarkers for schizophrenia (SZ). LRFS method leverages graph-based regularization to preserve original sample similarity relationships during data transformation, thus retaining crucial local structure information. Additionally, it introduces redundancy-removal regularization based on interrelationships among features to exclude similar and redundant features from high-dimensional data. Moreover, LRFS method incorporates l2,1 sparse regularization that enables selecting a sparse and noise-robust feature subset. Experimental evaluations on eight public datasets with diverse properties demonstrate the superior performance of our method over nine popular feature selection methods in identifying discriminative features, with average classification accuracy gains ranging from 1.30 % to 9.11 %. Furthermore, the LRFS method demonstrates superior discriminability in four functional magnetic resonance imaging (fMRI) datasets from 708 healthy controls (HCs) and 537 SZ patients, with an average increase in classification accuracy ranging from 1.89 % to 9.24 % compared to other nine methods. Notably, our method reveals reproducible and significant changes in SZ patients relative to HCs across the four datasets, predominantly in the thalamus-related functional network connectivity, which exhibit a significant correlation with clinical symptoms. Convergence analysis, parameter sensitivity analysis, and ablation studies further demonstrate the effectiveness and robustness of our method. In short, our proposed feature selection method effectively identifies discriminative and reliable features that hold the potential to be biomarkers, paving the way for the elucidation of brain abnormalities and the advancement of precise diagnosis of mental disorders.
引用
收藏
页数:14
相关论文
共 33 条
  • [1] Joint local structure preservation and redundancy minimization for unsupervised feature selection
    Li, Hao
    Wang, Yongli
    Li, Yanchao
    Hu, Peng
    Zhao, Ruxin
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4394 - 4411
  • [2] Joint local structure preservation and redundancy minimization for unsupervised feature selection
    Hao Li
    Yongli Wang
    Yanchao Li
    Peng Hu
    Ruxin Zhao
    Applied Intelligence, 2020, 50 : 4394 - 4411
  • [3] A Novel Neighborhood Rough Set-Based Feature Selection Method and Its Application to Biomarker Identification of Schizophrenia
    Xing, Ying
    Kochunov, Peter
    van Erp, Theo G. M.
    Ma, Tianzhou
    Calhoun, Vince D.
    Du, Yuhui
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 215 - 226
  • [4] Sparse low-redundancy multilabel feature selection based on dynamic local structure preservation and triple graphs exploration
    Yang, Yong
    Chen, Hongmei
    Mi, Yong
    Luo, Chuan
    Horng, Shi-Jinn
    Li, Tianrui
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 242
  • [5] Spectral Clustering-based Local and Global Structure Preservation for Feature Selection
    Zhou, Sihang
    Liu, Xinwang
    Zhu, Chengzhang
    Liu, Qiang
    Yin, Jianping
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 550 - 557
  • [6] Local Structure Preservation Based Discriminant Projection Method for Feature Reduction
    Imani, Maryam
    Ghassemian, Hassan
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 162 - 167
  • [7] Robust sparse and low-redundancy multi-label feature selection with dynamic local and global structure preservation
    Li, Yonghao
    Hu, Liang
    Gao, Wanfu
    PATTERN RECOGNITION, 2023, 134
  • [8] NONPARAMETRIC FEATURE-SELECTION METHOD BASED ON LOCAL INTERCLASS STRUCTURE
    ICHINO, M
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1981, 11 (04): : 289 - 296
  • [9] MiRFFS: a functional group-based feature selection method for the identification of microRNA biomarkers
    Yang, Yang
    Xiao, Yiqun
    Cao, Tianyu
    Kong, Wei
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (01) : 40 - 55
  • [10] Graph autoencoder-based unsupervised feature selection with broad and local data structure preservation
    Feng, Siwei
    Duarte, Marco F.
    NEUROCOMPUTING, 2018, 312 : 310 - 323