A hybrid Bayesian network and tensor factorization approach for missing value imputation to improve breast cancer recurrence prediction

被引:31
|
作者
Vazifehdan, Mahin [1 ]
Moattar, Mohammad Hossein [1 ]
Jalali, Mehrdad [1 ]
机构
[1] Islamic Azad Univ, Mashhad Branch, Dept Software Engn, Mashhad, Iran
关键词
Breast cancer recurrence; Missing value imputation; Classification; Tensor factorization; Bayesian network; MODEL; REGRESSION;
D O I
10.1016/j.jksuci.2018.01.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining and machine learning approaches can be used to predict breast cancer recurrence. However, real datasets often include missing values for various reasons. In this paper, a hybrid imputation method is proposed with respect to the dependency between the attributes and the type of incomplete attributes in order to especially improve the prediction of breast cancer recurrence. After splitting the dataset into two discrete and numerical subsets, first missing values of the discrete fields are imputed using Bayesian network. Then, using Tensor factorization, the integrated dataset, which comprises of the filled-subset of the previous stage and numerical missing values subset, is constructed so that both continuous missing values are imputed and the accuracy of imputation is enhanced. We evaluated the proposed method versus six imputation methods i.e. mean, Hot-deck, K-NN, Weighted K-NN, Tensor factorization and Bayesian network on three datasets and used three classifiers, namely decision tree, K-Nearest Neighbor and Support Vector Machine for recurrence prediction. Experimental results show that the proposed method has as average 0.26 prediction improvement. Also, the prediction performance of the proposed approach outperforms all other imputation-classifier pairs in terms of specificity, sensitivity and accuracy. (C) 2018 The Authors. Production and hosting by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [31] Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values
    Garcia-Laencina, Pedro J.
    Abreu, Pedro Henriques
    Abreu, Miguel Henriques
    Afonoso, Noemia
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 59 : 125 - 133
  • [32] Performance assessment of hybrid machine learning approaches for breast cancer and recurrence prediction
    Pati, Abhilash
    Panigrahi, Amrutanshu
    Parhi, Manoranjan
    Giri, Jayant
    Qin, Hong
    Mallik, Saurav
    Pattanayak, Sambit Ranjan
    Agrawal, Umang Kumar
    PLOS ONE, 2024, 19 (08):
  • [33] Prediction of docetaxel toxicity in older cancer patients: a Bayesian network approach
    Khayi, Fouzy
    Lafarge, Laurent
    Terret, Catherine
    Albrand, Gilles
    Falquet, Benoit
    Culine, Stephane
    Gourgou, Sophie
    Ducher, Michel
    Bourguignon, Laurent
    FUNDAMENTAL & CLINICAL PHARMACOLOGY, 2019, 33 (06) : 679 - 686
  • [34] Regularized Neural Network to Identify Potential Breast Cancer: A Bayesian Approach
    Rodrigo, Hansapani S.
    Tsokos, Chris P.
    Sharaf, Taysseer
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2016, 15 (02) : 563 - 579
  • [35] A hybrid neural network/genetic algorithm applied to breast cancer detection and recurrence
    Belciug, Smaranda
    Gorunescu, Florin
    EXPERT SYSTEMS, 2013, 30 (03) : 243 - 254
  • [36] Prognostic models for breast cancer: based on logistics regression and Hybrid Bayesian Network
    Su, Fan
    Chao, Jianqian
    Liu, Pei
    Zhang, Bowen
    Zhang, Na
    Luo, Zongyu
    Han, Jiaying
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [37] Prognostic models for breast cancer: based on logistics regression and Hybrid Bayesian Network
    Fan Su
    Jianqian Chao
    Pei Liu
    Bowen Zhang
    Na Zhang
    Zongyu Luo
    Jiaying Han
    BMC Medical Informatics and Decision Making, 23
  • [38] A Novel Aggregated Multiple Imputation Approach for Enhanced Survival Prediction and Classification on Breast Cancer and Lung Cancer Data
    Deepa, P.
    Gunavathi, C.
    IEEE ACCESS, 2024, 12 : 189102 - 189121
  • [39] Prediction of Near-term Breast Cancer Risk using a Bayesian Belief Network
    Zheng, Bin
    Ramalingam, Pandiyarajan
    Hariharan, Harishwaran
    Leader, Joseph K.
    Gur, David
    MEDICAL IMAGING 2013: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2013, 8673
  • [40] Breast Cancer Surgery Survivability Prediction Using Bayesian Network and Support Vector Machines
    Aljawad, Dania Abed
    Alqahtani, Ebtesam
    AL-Kuhaili, Ghaidaa
    Qamhan, Nada
    Alghamdi, Noof
    Alrashed, Saleh
    Alhiyafi, Jamal
    Olatunji, Sunday O.
    2017 INTERNATIONAL CONFERENCE ON INFORMATICS, HEALTH & TECHNOLOGY (ICIHT), 2017,