Feature Selection Algorithms for High-dimensional Unbalanced Medical Data

被引:0
作者
Liu, Jiaxuan [1 ]
Li, Daiwei [1 ]
Ren, Lijuan [1 ]
Zhang, Haiqing [1 ]
Tang, Xin [1 ]
Xiang, Xiaoming [2 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu, Peoples R China
[2] Sichuan Meteorol Observat Data & Ctr, Chengdu, Peoples R China
来源
2024 4TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE | 2024年
基金
中国国家自然科学基金;
关键词
feature selection; medical data; high-dimensional; unbalanced;
D O I
10.1109/IARCE64300.2024.00099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-dimensional feature selection is a difficult issue in medical field, while the class imbalance problem existing in medical data can also seriously affect the classification performance of algorithms. To address the joint problem of high-dimensional feature selection and class imbalance, a new objective function and the SMOTE imbalance approach are performed on the Simple, Fast and Efficient (SFE) based high-dimensional feature selection algorithm, as well as incorporating the Metropolis criterion in the SFE algorithm to help the algorithm jump out of the local optimal solution. Finally, a feature selection algorithm based on high-dimensional unbalanced medical data (GOSFE) is proposed. Experiments are conducted on six public datasets, and the results show that compared with the SFE algorithm, GOSFE improves the G-means and F1 metrics by 6.75% and 2.53% on the SMK_CAN_187 dataset, and the G-means and F1 metrics by5.95% and 3.42% on the Leukemia dataset, respectively. Meanwhile, the experiments show that the GOSFE algorithm can quickly search the subset of features with the highest classification accuracy, reduce the redundancy of high-dimensional data, and has a good potential for improving the high-dimensional imbalance problem of medical data.
引用
收藏
页码:511 / 514
页数:4
相关论文
共 19 条
[1]   An Efficient Marine Predators Algorithm for Feature Selection [J].
Abd Elminaam, Diaa Salama ;
Nabil, Ayman ;
Ibraheem, Shimaa A. ;
Houssein, Essam H. .
IEEE ACCESS, 2021, 9 :60136-60153
[2]  
Ahadzadeh B, 2023, IEEE Transactions on Evolutionary Computation
[3]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[4]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]  
Du L, 2015, Annals of Data Science, V2, P293
[7]   Improved Reptile Search Optimization Algorithm Using Chaotic Map and Simulated Annealing for Feature Selection in Medical Field [J].
Elgamal, Zenab ;
Sabri, Aznul Qalid Md ;
Tubishat, Mohammad ;
Tbaishat, Dina ;
Makhadmeh, Sharif Naser ;
Alomari, Osama Ahmad .
IEEE ACCESS, 2022, 10 :51428-51446
[8]   High-Efficient Memristive Genetic Algorithm for Feature Selection [J].
Fang, Chiming ;
Zhou, Houji ;
Yang, Ling ;
Cheng, Weiming ;
Li, Yi ;
Miao, Xiangshui .
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2023, 70 (08) :4163-4169
[9]   Logistic discrimination based on G-mean and F-measure for imbalanced problem [J].
Guo, Huaping ;
Liu, Hongbing ;
Wu, Changan ;
Zhi, Weimei ;
Xiao, Yan ;
She, Wei .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (03) :1155-1166
[10]   Survey Paper Multi-objective particle swarm optimization with adaptive strategies for feature selection [J].
Han, Fei ;
Chen, Wen-Tao ;
Ling, Qing-Hua ;
Han, Henry .
SWARM AND EVOLUTIONARY COMPUTATION, 2021, 62