Missing data analysis with fuzzy C-Means: A study of its application in a psychological scenario

被引:48
作者
Di Nuovo, Alessandro G. [1 ]
机构
[1] Univ Catania, I-95125 Catania, Italy
关键词
Psychodiagnostic tools; Missing data analysis; Fuzzy C-Means clustering;
D O I
10.1016/j.eswa.2010.12.067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In scientific research, and particularly in psychological studies, data for some variables in the database to be analyzed may well be missing. If not dealt with in the correct way, the missing values may weaken or even compromise the validity of research into the database, especially if it is a small one. In this paper we introduce the most common solutions to this problem offered by the most popular statistical software and a technique based on the most famous fuzzy clustering algorithm: Fuzzy C-Means (FCM). Then we compare these methodologies in order to highlight the peculiar characteristics of each solution. The comparison was made in a psychological research environment, using a database of in-patients who have a diagnosis of mental retardation. The results demonstrate that completion techniques, and in particular the one based on FCM, lead to effective data imputation, avoiding the deletion of elements with missing data, which diminishes the power of the research. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6793 / 6797
页数:5
相关论文
共 22 条
[1]  
[Anonymous], 1997, WISC R CONTRIBUTO TA
[2]  
[Anonymous], 1981, WECHSLER ADULT INTEL
[3]  
[Anonymous], 2001, MISSING DATA
[4]  
[Anonymous], 2012, MMWR-MORBID MORTAL W
[5]  
[Anonymous], 1988, STAT POWER ANAL BEHA
[6]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms
[7]  
Bezdek J.C., 1999, The Handbooks of Fuzzy Sets Series
[8]   A comparison of inclusive and restrictive strategies in modern missing data procedures [J].
Collins, LM ;
Schafer, JL ;
Kam, CM .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :330-351
[9]  
DINUOVO S, 2002, STRUMENTI PSICODIAGN
[10]   Review: A gentle introduction to imputation of missing values [J].
Donders, A. Rogier T. ;
van der Heijden, Geert J. M. G. ;
Stijnen, Theo ;
Moons, Karel G. M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) :1087-1091