WIMP: Web server tool for missing data imputation

被引:4
作者
Urda, D. [1 ]
Subirats, J. L. [1 ]
Garcia-Laencina, P. J.
Franco, L. [1 ]
Sancho-Gomez, J. L. [2 ]
Jerez, J. M. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, ETSI Informat, E-29071 Malaga, Spain
[2] Univ Politecn Cartagena, Dept Tecnol Informac & Comunicac, Cartagena, Spain
关键词
Imputation; Missing data; Machine learning; Web application; EMPIRICAL LIKELIHOOD; MICROARRAY DATA; LINEAR-MODELS; REGRESSION; CLASSIFICATION; ALGORITHM; VALUES;
D O I
10.1016/j.cmpb.2012.08.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of unknown or missing data is a crucial task on the analysis of biomedical datasets. There are several situations where it is necessary to classify or identify instances given incomplete vectors, and the existence of missing values can much degrade the performance of the algorithms used for the classification/recognition. The task of learning accurately from incomplete data raises a number of issues some of which have not been completely solved in machine learning applications. In this sense, effective missing value estimation methods are required. Different methods for missing data imputations exist but most of the times the selection of the appropriate technique involves testing several methods, comparing them and choosing the right one. Furthermore, applying these methods, in most cases, is not straightforward, as they involve several technical details, and in particular in cases such as when dealing with microarray datasets, the application of the methods requires huge computational resources. As far as we know, there is not a public software application that can provide the computing capabilities required for carrying the task of data imputation. This paper presents a new public tool for missing data imputation that is attached to a computer cluster in order to execute high computational tasks. The software WIMP (Web IMPutation) is a public available web site where registered users can create, execute, analyze and store their simulations related to missing data imputation. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1247 / 1254
页数:8
相关论文
共 50 条
  • [31] Missing Data Imputation based on Unsupervised Simple Competitive Learning
    Lee, Byoung Jik
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING AND DATA BASES, 2010, : 292 - +
  • [32] Missing data imputation with fuzzy feature selection for diabetes dataset
    Dzulkalnine, Mohamad Faiz
    Sallehuddin, Roselina
    SN APPLIED SCIENCES, 2019, 1 (04):
  • [33] A HYBRID SELF ORGANIZING MAP IMPUTATION (SOMI) WITH NAIVE BAYES FOR IMPUTATION MISSING DATA CLASSIFICATION
    Khotimah, Bain Khusnul
    Miswanto
    Suprajitno, Herry
    INTERNATIONAL JOURNAL OF GEOMATE, 2019, 17 (62): : 195 - 202
  • [34] Missing data imputation: focusing on single imputation
    Zhang, Zhongheng
    ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (01)
  • [35] Imputation of missing daily rainfall data; A comparison between artificial intelligence and statistical techniques
    Wangwongchai, Angkool
    Waqas, Muhammad
    Dechpichai, Porntip
    Hlaing, Phyo Thandar
    Ahmad, Shakeel
    Humphries, Usa Wannasingha
    METHODSX, 2023, 11
  • [36] Missing data imputation with fuzzy feature selection for diabetes dataset
    Mohamad Faiz Dzulkalnine
    Roselina Sallehuddin
    SN Applied Sciences, 2019, 1
  • [37] A survey on missing data in machine learning
    Emmanuel, Tlamelo
    Maupong, Thabiso
    Mpoeleng, Dimane
    Semong, Thabo
    Mphago, Banyatsang
    Tabona, Oteng
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [38] Exploring the Effects of Data Distribution in Missing Data Imputation
    Soares, Jastin Pompeu
    Santos, Miriam Seoane
    Abreu, Pedro Henriques
    Araujo, Helder
    Santos, Joao
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVII, IDA 2018, 2018, 11191 : 251 - 263
  • [39] Evaluation of different approaches for missing data imputation on features associated to genomic data
    Ben Omega Petrazzini
    Hugo Naya
    Fernando Lopez-Bello
    Gustavo Vazquez
    Lucía Spangenberg
    BioData Mining, 14
  • [40] A systematic review of generative adversarial imputation network in missing data imputation
    Zhang, Yuqing
    Zhang, Runtong
    Zhao, Butian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27) : 19685 - 19705