WIMP: Web server tool for missing data imputation

被引:4
作者
Urda, D. [1 ]
Subirats, J. L. [1 ]
Garcia-Laencina, P. J.
Franco, L. [1 ]
Sancho-Gomez, J. L. [2 ]
Jerez, J. M. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, ETSI Informat, E-29071 Malaga, Spain
[2] Univ Politecn Cartagena, Dept Tecnol Informac & Comunicac, Cartagena, Spain
关键词
Imputation; Missing data; Machine learning; Web application; EMPIRICAL LIKELIHOOD; MICROARRAY DATA; LINEAR-MODELS; REGRESSION; CLASSIFICATION; ALGORITHM; VALUES;
D O I
10.1016/j.cmpb.2012.08.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of unknown or missing data is a crucial task on the analysis of biomedical datasets. There are several situations where it is necessary to classify or identify instances given incomplete vectors, and the existence of missing values can much degrade the performance of the algorithms used for the classification/recognition. The task of learning accurately from incomplete data raises a number of issues some of which have not been completely solved in machine learning applications. In this sense, effective missing value estimation methods are required. Different methods for missing data imputations exist but most of the times the selection of the appropriate technique involves testing several methods, comparing them and choosing the right one. Furthermore, applying these methods, in most cases, is not straightforward, as they involve several technical details, and in particular in cases such as when dealing with microarray datasets, the application of the methods requires huge computational resources. As far as we know, there is not a public software application that can provide the computing capabilities required for carrying the task of data imputation. This paper presents a new public tool for missing data imputation that is attached to a computer cluster in order to execute high computational tasks. The software WIMP (Web IMPutation) is a public available web site where registered users can create, execute, analyze and store their simulations related to missing data imputation. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1247 / 1254
页数:8
相关论文
共 50 条
  • [41] Evaluation of different approaches for missing data imputation on features associated to genomic data
    Petrazzini, Ben Omega
    Naya, Hugo
    Lopez-Bello, Fernando
    Vazquez, Gustavo
    Spangenberg, Lucia
    BIODATA MINING, 2021, 14 (01)
  • [42] An Unsupervised Data-Mining and Generative-Based Multiple Missing Data Imputation Network for Energy Dataset
    Kim, Hyung Joon
    Kim, Mun Kyeom
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 13429 - 13440
  • [43] A systematic review of generative adversarial imputation network in missing data imputation
    Yuqing Zhang
    Runtong Zhang
    Butian Zhao
    Neural Computing and Applications, 2023, 35 : 19685 - 19705
  • [44] Four Factors Affecting Missing Data Imputation
    Hackl, Andreas
    Zeindl, Juergen
    Ehrlinger, Lisa
    35TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2023, 2023,
  • [45] Imputation of missing longitudinal data: a comparison of methods
    Engels, JM
    Diehr, P
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (10) : 968 - 976
  • [46] Imputation of missing information in worldwide patent data
    de Rassenfosse, Gaetan
    Seliger, Florian
    DATA IN BRIEF, 2021, 34
  • [47] Imputation of missing data with neural networks for classification
    Choudhury, Suyra Jyoti
    Pal, Nikhil R.
    KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [48] Some Imputation Algorithms for Restoration of Missing Data
    Ryazanov, Vladimir
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2011, 7042 : 372 - 379
  • [49] A MISSING DATA IMPUTATION METHOD WITH DISTANCE FUNCTION
    Jea, Kuen-Fang
    Hsu, Chin-Wei
    Tang, Li-You
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 450 - 455
  • [50] An Experimental Survey of Missing Data Imputation Algorithms
    Miao, Xiaoye
    Wu, Yangyang
    Chen, Lu
    Gao, Yunjun
    Yin, Jianwei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 6630 - 6650