Two-Stage Prediction of Comorbid Cancer Patient Survivability Based on Improved Infinite Feature Selection

被引:5
|
作者
Liu, Peng [1 ]
Fei, Shumin [1 ]
机构
[1] Southeast Univ, Minist Educ, Sch Automat, Key Lab Measurement & Control CSE, Nanjing 210096, Peoples R China
关键词
Cancer; Feature extraction; Machine learning; Predictive models; Databases; Breast; Medical diagnostic imaging; Cancer comorbidity; SEER; survival prediction; infinite feature selection; data balancing; unsupervised feature selection; BREAST; MODEL; LUNG;
D O I
10.1109/ACCESS.2020.3016998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The modeling of comorbid cancer patients' survivability has theoretical significance and practical needs. Cancer survivability prediction may provide guidance for clinical decision making and personalized medicine. The Surveillance, Epidemiology, and End Results(SEER) program provides large data sets suitable for analysis with machine learning methods. In this study, we consider survival prediction to be a two-stage problem. The first is to predict the five-year survivability of patients. For those whom the predicted outcome is 'death', the second stage predicts the remaining survival time. Male and female comorbid cancer cases(male-genital and urinary cancer for men and breast and female-genital cancer for women) were identified from the SEER database and labeled. In the classification stage,the dataset was processed with improved infinite feature selection(Iinf-FS) and random undersampling-based data balancing. These two methods resolved the issues of biased data set and poor classification accuracy. In the lifespan prediction stage, unsupervised infinite feature selection (UinfFS) was applied. The results indicate that the proposed method is effective.
引用
收藏
页码:169559 / 169567
页数:9
相关论文
共 50 条
  • [1] Protein sumoylation sites prediction based on two-stage feature selection
    Lu, Lin
    Shi, Xiao-He
    Li, Su-Jun
    Xie, Zhi-Qun
    Feng, Yong-Li
    Lu, Wen-Cong
    Li, Yi-Xue
    Li, Haipeng
    Cai, Yu-Dong
    MOLECULAR DIVERSITY, 2010, 14 (01) : 81 - 86
  • [2] Protein sumoylation sites prediction based on two-stage feature selection
    Lin Lu
    Xiao-He Shi
    Su-Jun Li
    Zhi-Qun Xie
    Yong-Li Feng
    Wen-Cong Lu
    Yi-Xue Li
    Haipeng Li
    Yu-Dong Cai
    Molecular Diversity, 2010, 14 : 81 - 86
  • [3] A two-stage modeling approach for breast cancer survivability prediction
    Sedighi-Maman, Zahra
    Mondello, Alexa
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 149
  • [4] Prediction of Second Primary Lung Cancer Patient's Survivability Based on Improved Eigenvector Centrality-Based Feature Selection
    Liu, Peng
    Jin, Kexin
    Jiao, Yiping
    He, Mutian
    Fei, Shumin
    IEEE ACCESS, 2021, 9 (09): : 55663 - 55672
  • [5] A TWO-STAGE IMPROVED ANT COLONY OPTIMIZATION BASED FEATURE SELECTION FOR WEB CLASSIFICATION
    Xu, Jun
    Li, Guangyao
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2016, 12 (06): : 1851 - 1863
  • [6] A Two-Stage Feature Selection Algorithm Based on Redundancy and Relevance
    Antioquia, Arren Matthew C.
    Azcarraga, Arnulfo P.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Arai, Hiromasa
    Maung, Crystal
    Schweitzer, Haim
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 153 - 159
  • [8] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Maung, Crystal
    Arai, Hiromasa
    Schweitzer, Haim
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (07)
  • [9] Two-Stage Feature Selection for Text Classification
    Ozgur, Levent
    Gungor, Tunga
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 329 - 337
  • [10] Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm
    Qin, Xiwen
    Zhang, Shuang
    Yin, Dongmei
    Chen, Dongxue
    Dong, Xiaogang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (12) : 13747 - 13781