A Fast Hybrid Feature Selection Method Based on Dynamic Clustering and Improved Particle Swarm Optimization for High-Dimensional Health Care Data

被引:3
作者
Kang, Yan [1 ]
Peng, Luhan [1 ]
Guo, Jing [1 ]
Lu, Yuhuan [2 ]
Yang, Yun [1 ]
Fan, Baochen [1 ]
Pu, Bin [2 ]
机构
[1] Yunnan Univ, Natl Pilot Sch Software, Yunnan Key Lab Software Engn, Kunming 650106, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Clustering algorithms; Feature extraction; Medical services; Filtering algorithms; Classification algorithms; Biomedical monitoring; Feature selection; health care data; high-dimensional data; correlation-guided clustering; particle swarm optimization; CLASSIFICATION; COLONY;
D O I
10.1109/TCE.2023.3334373
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The ubiquity and commoditization of wearable sensors have generated a deluge of user-generated health care data and played a key role in clinical utility, particularly when incorporated into personalized prediction models. The "curse of dimensionality" and enormous computational costs are still the main challenges faced by the existing algorithms as the number of wearable datasets exponentially increases. We propose a novel method by hybridizing a clustering method and a wrapper method to reduce the dimensionality of raw wearable datasets while preserving health care information. In the clustering stage, a dynamic correlation-guided feature clustering method reduces the search space by designing a dynamic threshold to filter unrelated high-dimensional features. In the wrapper stage, we obtain the optimal feature subset by improving the powerful search capability of the particle swarm optimization algorithm. A crossover operator based on normalized mutual information similarity is proposed to match particles, which effectively improves the diversity of the offspring swarm to prevent premature convergence. In addition, we propose a dynamic swarm strategy to mutate the duplicate particles in the swarm to enhance the efficiency of the particle search process. Our method is evaluated on ten real public datasets, and the experimental results demonstrate its superior performance.
引用
收藏
页码:2447 / 2459
页数:13
相关论文
共 54 条
  • [1] A new fusion of grey wolf optimizer algorithm with a two-phase mutation for feature selection
    Abdel-Basset, Mohamed
    El-Shahat, Doaa
    El-henawy, Ibrahim
    de Albuquerque, Victor Hugo C.
    Mirjalili, Seyedali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
  • [2] Toward a gold standard for promoter prediction evaluation
    Abeel, Thomas
    Van de Peer, Yves
    Saeys, Yvan
    [J]. BIOINFORMATICS, 2009, 25 (12) : I313 - I320
  • [3] A novel approach based on integration of convolutional neural networks and deep feature selection for short-term solar radiation forecasting
    Acikgoz, Hakan
    [J]. APPLIED ENERGY, 2022, 305
  • [4] Prominent feature extraction for review analysis: an empirical study
    Agarwal, Basant
    Mittal, Namita
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2016, 28 (03) : 485 - 498
  • [5] Type-2 fuzzy ontology-aided recommendation systems for IoT-based healthcare
    Ali, Farman
    Islam, S. M. Riazul
    Kwak, Daehan
    Khand, Pervez
    Ullah, Niamat
    Yoo, Sang-jo
    Kwak, K. S.
    [J]. COMPUTER COMMUNICATIONS, 2018, 119 : 138 - 155
  • [6] A framework for feature selection through boosting
    Alsahaf, Ahmad
    Petkov, Nicolai
    Shenoy, Vikram
    Azzopardi, George
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [7] [Anonymous], 1992, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence, DOI DOI 10.7551/MITPRESS/1090.001.0001
  • [8] Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments
    Apolloni, Javier
    Leguizamon, Guillermo
    Alba, Enrique
    [J]. APPLIED SOFT COMPUTING, 2016, 38 : 922 - 932
  • [9] PREDICTING DIABETES FROM PHOTOPLETHYSMOGRAPHY USING DEEP LEARNING
    Avram, Robert
    Tison, Geoffrey
    Kuhar, Peter
    Marcus, Gregory
    Pletcher, Mark
    Olgin, Jeffrey E.
    Aschbacher, Kirstin
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (09) : 16 - 16
  • [10] Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization
    Basari, Abd Samad Hasan
    Hussin, Burairah
    Ananta, I. Gede Pramudya
    Zeniarja, Junta
    [J]. MALAYSIAN TECHNICAL UNIVERSITIES CONFERENCE ON ENGINEERING & TECHNOLOGY 2012 (MUCET 2012), 2013, 53 : 453 - 462