A hybrid Bayesian network and tensor factorization approach for missing value imputation to improve breast cancer recurrence prediction

被引:31
|
作者
Vazifehdan, Mahin [1 ]
Moattar, Mohammad Hossein [1 ]
Jalali, Mehrdad [1 ]
机构
[1] Islamic Azad Univ, Mashhad Branch, Dept Software Engn, Mashhad, Iran
关键词
Breast cancer recurrence; Missing value imputation; Classification; Tensor factorization; Bayesian network; MODEL; REGRESSION;
D O I
10.1016/j.jksuci.2018.01.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining and machine learning approaches can be used to predict breast cancer recurrence. However, real datasets often include missing values for various reasons. In this paper, a hybrid imputation method is proposed with respect to the dependency between the attributes and the type of incomplete attributes in order to especially improve the prediction of breast cancer recurrence. After splitting the dataset into two discrete and numerical subsets, first missing values of the discrete fields are imputed using Bayesian network. Then, using Tensor factorization, the integrated dataset, which comprises of the filled-subset of the previous stage and numerical missing values subset, is constructed so that both continuous missing values are imputed and the accuracy of imputation is enhanced. We evaluated the proposed method versus six imputation methods i.e. mean, Hot-deck, K-NN, Weighted K-NN, Tensor factorization and Bayesian network on three datasets and used three classifiers, namely decision tree, K-Nearest Neighbor and Support Vector Machine for recurrence prediction. Experimental results show that the proposed method has as average 0.26 prediction improvement. Also, the prediction performance of the proposed approach outperforms all other imputation-classifier pairs in terms of specificity, sensitivity and accuracy. (C) 2018 The Authors. Production and hosting by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [21] A Hybrid Bayesian Network Model for Predicting Breast Cancer Prognosis
    Choi, Jong Pill
    Han, Tae Hwa
    Park, Rae Woong
    HEALTHCARE INFORMATICS RESEARCH, 2009, 15 (01) : 49 - 57
  • [22] A Bayesian network approach incorporating imputation of missing data enables exploratory analysis of complex causal biological relationships
    Howey, Richard
    Clark, Alexander D.
    Naamane, Najib
    Reynard, Louise N.
    Pratt, Arthur G.
    Cordell, Heather J.
    PLOS GENETICS, 2021, 17 (09):
  • [23] Low-Rank Tensor and Hybrid Smoothness Regularization-Based Approach for Traffic Data Imputation With Multimodal Missing
    Zeng, Zeyu
    Liu, Bin
    Feng, Jun
    Yang, Xiaolin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13014 - 13026
  • [24] Bayesian tensor factorization-drive breast cancer subtyping by integrating multi-omics data
    Liu, Qian
    Cheng, Bowen
    Jin, Yongwon
    Hu, Pingzhao
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 125
  • [25] Comparison of Logistic Regression and Bayesian Networks for Risk Prediction of Breast Cancer Recurrence
    Witteveen, Annemieke
    Nane, Gabriela F.
    Vliegen, Ingrid M. H.
    Siesling, Sabine
    IJzerman, Maarten J.
    MEDICAL DECISION MAKING, 2018, 38 (07) : 822 - 833
  • [26] Prognostic value of routine laboratory variables in prediction of breast cancer recurrence
    Zhu Zhu
    Ling Li
    Zhong Ye
    Tong Fu
    Ye Du
    Aiping Shi
    Di Wu
    Ke Li
    Yifan Zhu
    Chun Wang
    Zhimin Fan
    Scientific Reports, 7
  • [27] Prognostic value of routine laboratory variables in prediction of breast cancer recurrence
    Zhu, Zhu
    Li, Ling
    Ye, Zhong
    Fu, Tong
    Du, Ye
    Shi, Aiping
    Wu, Di
    Li, Ke
    Zhu, Yifan
    Wang, Chun
    Fan, Zhimin
    SCIENTIFIC REPORTS, 2017, 7
  • [28] A Novel Hybrid Spatiotemporal Missing Value Imputation Approach for Rainfall Data: An Application to the Ratnapura Area, Sri Lanka
    Saubhagya, Shanthi
    Tilakaratne, Chandima
    Lakraj, Pemantha
    Mammadov, Musa
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [29] Breast cancer recurrence prediction with deep neural network and feature optimization
    Chandran, R. I. Arathi
    Bai, V. Mary Amala
    AUTOMATIKA, 2024, 65 (01) : 343 - 360
  • [30] Prediction of distant recurrence in breast cancer using a deep neural network
    Azman, Balqis Mohd
    Hussain, Saiful Izzuan
    Azmi, Nor Aniza
    Abd Ghani, Muhammad Zahin Athir
    Norlen, Nor Irfan Danial
    REVISTA INTERNACIONAL DE METODOS NUMERICOS PARA CALCULO Y DISENO EN INGENIERIA, 2022, 38 (01):