Hybrid Model for Data Imputation: Using Fuzzy c means and Multi Layer Perceptron

被引:0
作者
Azim, Shambeel [1 ]
Aggarwal, Swati [1 ]
机构
[1] Inst Technol & Management, Dept Comp Sci, Gurgaon 122017, Haryana, India
来源
SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC) | 2014年
关键词
Imputation; Fuzzy c-Means; Missing data; MLP; k-means; MISSING VALUES; NEURAL-NETWORK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Database store datasets that are not always complete. They contain missing fields inside some records, that may occur due to human or system error involved in a data collection task. Data imputation is the process of filling in the missing value to generate complete records. Complete databases can be analyzed more accurately in comparison to incomplete databases. This paper proposes a 2-stage hybrid model for filling in the missing values using fuzzy c-means clustering and multilayer perceptron (MEP) working in sequence and compares it with k means imputation and fuzzy c means (FCM) imputation. The accuracy of the model is checked using Mean Absolute Percentage Error (MAPE). The MAPE value obtained shows that the proposed model is more accurate in filling multiple values in a record compared to stage 1 alone.
引用
收藏
页码:1281 / 1285
页数:5
相关论文
共 23 条