A Deep Reinforcement Learning-Based Feature Selection Method for Invasive Disease Event Prediction Using Imbalanced Follow-Up Data

被引:0
作者
Du, Yangyi [1 ]
Zhou, Xiaojun [1 ]
Gao, Qian [2 ]
Yang, Chunhua [1 ]
Huang, Tingwen [3 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Cent South Univ, Xiangya Hosp, Dept Dermatol, Changsha 410008, Hunan, Peoples R China
[3] Texas A&M Univ Qatar, Doha 23874, Qatar
基金
中国国家自然科学基金;
关键词
Training; Hospitals; Reinforcement learning; Predictive models; Feature extraction; Prediction algorithms; Breast cancer; Iterative methods; Prognostics and health management; Diseases; Deep reinforcement learning; feature selection; breast cancer; invasive disease events; imbalanced data;
D O I
10.1109/JBHI.2024.3497325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The machine learning-based model is a promising paradigm for predicting invasive disease events (iDEs) in breast cancer. Feature selection (FS) is an essential preprocessing technique employed to identify the pertinent features for the prediction model. However, conventional FS methods often fail with imbalanced clinical data due to the bias towards the majority class. In this paper, a novel FS framework based on reinforcement learning (RLFS) is developed to identify the optimal feature subset for the imbalanced data. The RLFS employs an iterative methodology, wherein data resampling technique generates a balanced dataset before each iteration. A decision network is trained using a deep RL algorithm to identify the relevant features for the dataset in the current iteration. With such an iterative training strategy, numerous constructed datasets gradually boost the FS capacity of the decision network, resulting in a robust performance for imbalanced data. Finally, a weighted model is proposed to determine the most suitable FS solution. The RLFS is employed to predict breast cancer iDEs using real follow-up data. The comparison results demonstrated that RLFS effectively reduces the number of features while outperforming several state-of-the-art FS algorithms.
引用
收藏
页码:1472 / 1483
页数:12
相关论文
共 40 条
[1]   SFE: A Simple, Fast, and Efficient Feature Selection Algorithm for High-Dimensional Data [J].
Ahadzadeh, Behrouz ;
Abdar, Moloud ;
Safara, Fatemeh ;
Khosravi, Abbas ;
Menhaj, Mohammad Bagher ;
Suganthan, Ponnuthurai Nagaratnam .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (06) :1896-1911
[2]   A fast Branch-and-Bound algorithm for U-curve feature selection [J].
Atashpaz-Gargari, Esmaeil ;
Reis, Marcelo S. ;
Braga-Neto, Ulisses M. ;
Barrera, Junior ;
Dougherty, Edward R. .
PATTERN RECOGNITION, 2018, 73 :172-188
[3]   A MARKOVIAN DECISION PROCESS [J].
BELLMAN, R .
JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684
[4]  
Bray F, 2018, CA-CANCER J CLIN, V68, P394, DOI [10.3322/caac.21492, 10.3322/caac.21609]
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]   An Evolutionary Multitasking-Based Feature Selection Method for High-Dimensional Classification [J].
Chen, Ke ;
Xue, Bing ;
Zhang, Mengjie ;
Zhou, Fengyu .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) :7172-7186
[7]   The Research Progress on the Prognostic Value of the Common Hematological Parameters in Peripheral Venous Blood in Breast Cancer [J].
Chen, Li ;
Kong, Xiangyi ;
Yan, Chengrui ;
Fang, Yi ;
Wang, Jing .
ONCOTARGETS AND THERAPY, 2020, 13 :1397-1412
[8]   A Steering-Matrix-Based Multiobjective Evolutionary Algorithm for High-Dimensional Feature Selection [J].
Cheng, Fan ;
Chu, Feixiang ;
Xu, Yi ;
Zhang, Lei .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :9695-9708
[9]   An interactive feature selection method based on multi-step state transition algorithm for high-dimensional data [J].
Du, Yangyi ;
Zhou, Xiaojun ;
Yang, Chunhua ;
Huang, Tingwen .
KNOWLEDGE-BASED SYSTEMS, 2023, 282
[10]   Predicting Invasive Disease-Free Survival for Early Stage Breast Cancer Patients Using Follow-Up Clinical Data [J].
Fu, Bo ;
Liu, Pei ;
Lin, Jie ;
Deng, Ling ;
Hu, Kejia ;
Zheng, Hong .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (07) :2053-2064